THE FUTURE IS HERE

Reinforcement Learning: Zero to Hero

Reinforcement Learning 101

ABOUT ME
⭕ Subscribe: https://www.youtube.com/c/CodeEmporium?sub_confirmation=1
📚 Medium Blog: https://medium.com/@dataemporium
💻 Github: https://github.com/ajhalthor
👔 LinkedIn: https://www.linkedin.com/in/ajay-halthor-477974bb/

RESOURCES
[1 📚] Reinforcement Learning book: http://incompleteideas.net/book/RLbook2020.pdf
[2 📚] Paradigms of ML: https://idapgroup.com/blog/types-of-machine-learning-out-there/
[3 📚] ChatGPT blog: https://openai.com/blog/chatgpt
[4 📚] Model Free vs Model Based RL: https://spinningup.openai.com/en/latest/spinningup/rl_intro2.html
[5 📚] Original Code for MAB: https://github.com/ankonzoid/LearningX/blob/master/classical_RL/multiarmed_bandit/multiarmed_bandit.py
[6 📚] My code with some more comments: https://github.com/ajhalthor/multi-armed-bandit
[7 📚] Monte Carlo Video referenced: https://www.youtube.com/watch?v=mECgLyCzTQU

PLAYLISTS FROM MY CHANNEL
⭕ Reinforcement Learning: https://youtube.com/playlist?list=PLTl9hO2Oobd9kS–NgVz0EPNyEmygV1Ha&si=AuThDZJwG19cgTA8
Natural Language Processing: https://youtube.com/playlist?list=PLTl9hO2Oobd_bzXUpzKMKA3liq2kj6LfE&si=LsVy8RDPu8jeO-cc
⭕ Transformers from Scratch: https://youtube.com/playlist?list=PLTl9hO2Oobd_bzXUpzKMKA3liq2kj6LfE
⭕ ChatGPT Playlist: https://youtube.com/playlist?list=PLTl9hO2Oobd9coYT6XsTraTBo4pL1j4HJ
⭕ Convolutional Neural Networks: https://youtube.com/playlist?list=PLTl9hO2Oobd9U0XHz62Lw6EgIMkQpfz74
⭕ The Math You Should Know : https://youtube.com/playlist?list=PLTl9hO2Oobd-_5sGLnbgE8Poer1Xjzz4h
⭕ Probability Theory for Machine Learning: https://youtube.com/playlist?list=PLTl9hO2Oobd9bPcq0fj91Jgk_-h1H_W3V
⭕ Coding Machine Learning: https://youtube.com/playlist?list=PLTl9hO2Oobd82vcsOnvCNzxrZOlrz3RiD

MATH COURSES (7 day free trial)
📕 Mathematics for Machine Learning: https://imp.i384100.net/MathML
📕 Calculus: https://imp.i384100.net/Calculus
📕 Statistics for Data Science: https://imp.i384100.net/AdvancedStatistics
📕 Bayesian Statistics: https://imp.i384100.net/BayesianStatistics
📕 Linear Algebra: https://imp.i384100.net/LinearAlgebra
📕 Probability: https://imp.i384100.net/Probability

OTHER RELATED COURSES (7 day free trial)
📕 ⭐ Deep Learning Specialization: https://imp.i384100.net/Deep-Learning
📕 Python for Everybody: https://imp.i384100.net/python
📕 MLOps Course: https://imp.i384100.net/MLOps
📕 Natural Language Processing (NLP): https://imp.i384100.net/NLP
📕 Machine Learning in Production: https://imp.i384100.net/MLProduction
📕 Data Science Specialization: https://imp.i384100.net/DataScience
📕 Tensorflow: https://imp.i384100.net/Tensorflow

TIMESTAMPS
00:00 Part 1: Elements of Reinforcement Learning
13:17 Part 2: Multi Armed Bandits
23:30 Part 3: Markov Decision Process
31:25 Part 4 Bellman Equation
40:14 Part 5: Q Learning
51:51 Part 6: On policy vs Off Policy
01:04:24 Part 7: Monte Carlo in RL
01:13:55 Part 8: Deep Q Networks
01:21:05 Part 9: Proximal Policy Optimization (PPO)
01:30:25 Part 10: RLHF