THE FUTURE IS HERE

Model Based Reinforcement Learning: Policy Iteration, Value Iteration, and Dynamic Programming

Here we introduce dynamic programming, which is a cornerstone of model-based reinforcement learning. We demonstrate dynamic programming for policy iteration and value iteration, leading to the quality function and Q-learning.

Citable link for this video: https://doi.org/10.52843/cassyni.6fs4s9

This is a lecture in a series on reinforcement learning, following the new Chapter 11 from the 2nd edition of our book “Data-Driven Science and Engineering: Machine Learning, Dynamical Systems, and Control” by Brunton and Kutz

Book Website: http://databookuw.com
Book PDF: http://databookuw.com/databook.pdf

Amazon: https://www.amazon.com/Data-Driven-Science-Engineering-Learning-Dynamical/dp/1108422098/

Brunton Website: eigensteve.com