Bellman Equations, Dynamic Programming, Generalized Policy Iteration

Bellman Equations, Dynamic Programming, Generalized Policy Iteration | Reinforcement Learning Part 2

My consulting company : https://truetheta.io

Part two of a six part series on Reinforcement Learning. We discuss the Bellman Equations, Dynamic Programming and Generalized Policy Iteration.

SOCIAL MEDIA

Twitter : https://twitter.com/DuaneJRich

Github: https://github.com/Duane321

SOURCES

[1] R. Sutton and A. Barto. Reinforcement learning: An Introduction (2nd Ed). MIT Press, 2018.

[2] H. Hasselt, et al. RL Lecture Series, Deepmind and UCL, 2021, https://www.youtube.com/playlist?list=PLqYmG7hTraZDVH599EItlEWsUOsJbAodm

SOURCE NOTES

The video covers the topics of Chapter 3 and 4 from [1]. The whole series teaches from [1]. [2] was a useful secondary resource.

TIMESTAMP
0:00 What We’ll Learn
1:09 Review of Previous Topics
2:46 Definition of Dynamic Programming
3:05 Discovering the Bellman Equation
7:13 Bellman Optimality
8:41 A Grid View of the Bellman Equations
11:24 Policy Evaluation
13:58 Policy Improvement
15:55 Generalized Policy Iteration
17:55 A Beautiful View of GPI
18:14 The Gambler’s Problem
20:42 Watch the Next Video!

THE FUTURE IS HERE

AI Now

Man-Machine Interface (MMI) by 6TL

A Brief History of Brain-Computer Interfaces (BCIs)

Smart Home Demo with ERP Speller based on Brain-Computer Interface (BCI)

Ameca Facial Expressions

This Ai Can Read Your Emotions

BIG DATA – "Slay (feat. L1ZY)"

El Big Data en 3 minutos

MEMS: The Second Silicon Revolution?

STMicroelectronics: The Turnaround That Created a European Semiconductor Giant

Mechanical circuits: electronics without electricity