THE FUTURE IS HERE

What is Reinforcement Learning? | AI 101

Become a Patron! http://www.patreon.com/everydAI

Thank you to Jeff, Gerald, Milan, Ian, Becky, Jino, Daniel, Narskogr, Jason, and Mariano for being $5+/month Patrons!

Follow me on Twitter! http://twitter.com/jordanbharrod

everydAI is a YouTube channel focused on highlighting the ways we interact with artificial intelligence every day.

Sources:

Reinforcement Learning – An Introduction – https://books.google.com/books?hl=en&lr=&id=uWV0DwAAQBAJ&oi=fnd&pg=PR7&dq=REINFORCEMENT+LEARNING&ots=mhuHt133o0&sig=U-0PPoz7blDn0DEWkhvSeEs7Ujw#v=onepage&q=REINFORCEMENT%20LEARNING&f=false

Predicting Flexible Behaviors in Simulated Environments (DeepMind) – https://deepmind.com/blog/article/producing-flexible-behaviours-simulated-environments

Solving a Rubik’s Cube with a Robot Hand (OpenAI) – https://openai.com/blog/solving-rubiks-cube/ and https://arxiv.org/pdf/1910.07113.pdf

Emergent Tool Use from Multi-Agent Interaction (OpenAI) – https://openai.com/blog/emergent-tool-use/

Apprenticeship Learning via Inverse Reinforcement Learning – https://dl.acm.org/citation.cfm?id=1015430

OpenAI Gym – http://gym.openai.com

Reinforcement Learning (Wikipedia) – https://en.wikipedia.org/wiki/Reinforcement_learning#Deep_reinforcement_learning

Control Theory (Wikipedia) – https://en.wikipedia.org/wiki/Control_theory#Classical_control_theory

Markov Decision Processes (Wikipedia) – https://en.wikipedia.org/wiki/Markov_decision_process#Policy_iteration