AMAZON
Become a Patron! http://www.patreon.com/everydAI
Thank you to Jeff, Gerald, Milan, Ian, Becky, Jino, Daniel, Narskogr, Jason, and Mariano for being $5+/month Patrons!
Follow me on Twitter! http://twitter.com/jordanbharrod
everydAI is a YouTube channel focused on highlighting the ways we interact with artificial intelligence every day.
Sources:
Reinforcement Learning – An Introduction – https://books.google.com/books?hl=en&lr=&id=uWV0DwAAQBAJ&oi=fnd&pg=PR7&dq=REINFORCEMENT+LEARNING&ots=mhuHt133o0&sig=U-0PPoz7blDn0DEWkhvSeEs7Ujw#v=onepage&q=REINFORCEMENT%20LEARNING&f=false
Predicting Flexible Behaviors in Simulated Environments (DeepMind) – https://deepmind.com/blog/article/producing-flexible-behaviours-simulated-environments
Solving a Rubik’s Cube with a Robot Hand (OpenAI) – https://openai.com/blog/solving-rubiks-cube/ and https://arxiv.org/pdf/1910.07113.pdf
Emergent Tool Use from Multi-Agent Interaction (OpenAI) – https://openai.com/blog/emergent-tool-use/
Apprenticeship Learning via Inverse Reinforcement Learning – https://dl.acm.org/citation.cfm?id=1015430
OpenAI Gym – http://gym.openai.com
Reinforcement Learning (Wikipedia) – https://en.wikipedia.org/wiki/Reinforcement_learning#Deep_reinforcement_learning
Control Theory (Wikipedia) – https://en.wikipedia.org/wiki/Control_theory#Classical_control_theory
Markov Decision Processes (Wikipedia) – https://en.wikipedia.org/wiki/Markov_decision_process#Policy_iteration