💡Enroll to gain access to the full course:
https://deeplizard.com/course/rlcpailzrd
Welcome back to this series on reinforcement learning! In this video, we’ll be introducing the idea of Q-learning with value iteration, which is a reinforcement learning technique used for learning the optimal policy in a Markov Decision Process.
We’ll illustrate how this technique works by introducing a game where a reinforcement learning agent tries to maximize points, and through this, we’ll also learn about Q-tables and the trade-off between exploration and exploitation.
Sources:
Reinforcement Learning: An Introduction, Second Edition by Richard S. Sutton and Andrew G. Bartow
http://incompleteideas.net/book/RLbook2020.pdf
Playing Atari with Deep Reinforcement Learning by Deep Mind Technologies
https://www.cs.toronto.edu/~vmnih/docs/dqn.pdf
🕒🦎 VIDEO SECTIONS 🦎🕒
00:00 Welcome to DEEPLIZARD – Go to deeplizard.com for learning resources
00:30 Help deeplizard add video timestamps – See example in the description
08:08 Collective Intelligence and the DEEPLIZARD HIVEMIND
💥🦎 DEEPLIZARD COMMUNITY RESOURCES 🦎💥
👋 Hey, we’re Chris and Mandy, the creators of deeplizard!
👉 Check out the website for more learning material:
🔗 https://deeplizard.com
💻 ENROLL TO GET DOWNLOAD ACCESS TO CODE FILES
🔗 https://deeplizard.com/resources
🧠 Support collective intelligence, join the deeplizard hivemind:
🔗 https://deeplizard.com/hivemind
🧠 Use code DEEPLIZARD at checkout to receive 15% off your first Neurohacker order
👉 Use your receipt from Neurohacker to get a discount on deeplizard courses
🔗 https://neurohacker.com/shop?rfsn=6488344.d171c6
👀 CHECK OUT OUR VLOG:
🔗 https://youtube.com/deeplizardvlog
❤️🦎 Special thanks to the following polymaths of the deeplizard hivemind:
Tammy
Mano Prime
Ling Li
🚀 Boost collective intelligence by sharing this video on social media!
👀 Follow deeplizard:
Our vlog: https://youtube.com/deeplizardvlog
Facebook: https://facebook.com/deeplizard
Instagram: https://instagram.com/deeplizard
Twitter: https://twitter.com/deeplizard
Patreon: https://patreon.com/deeplizard
YouTube: https://youtube.com/deeplizard
🎓 Deep Learning with deeplizard:
Deep Learning Dictionary – https://deeplizard.com/course/ddcpailzrd
Deep Learning Fundamentals – https://deeplizard.com/course/dlcpailzrd
Learn TensorFlow – https://deeplizard.com/course/tfcpailzrd
Learn PyTorch – https://deeplizard.com/course/ptcpailzrd
Natural Language Processing – https://deeplizard.com/course/txtcpailzrd
Reinforcement Learning – https://deeplizard.com/course/rlcpailzrd
Generative Adversarial Networks – https://deeplizard.com/course/gacpailzrd
🎓 Other Courses:
DL Fundamentals Classic – https://deeplizard.com/learn/video/gZmobeGL0Yg
Deep Learning Deployment – https://deeplizard.com/learn/video/SI1hVGvbbZ4
Data Science – https://deeplizard.com/learn/video/d11chG7Z-xk
Trading – https://deeplizard.com/learn/video/ZpfCK_uHL9Y
🛒 Check out products deeplizard recommends on Amazon:
🔗 https://amazon.com/shop/deeplizard
🎵 deeplizard uses music by Kevin MacLeod
🔗 https://youtube.com/channel/UCSZXFhRIx6b0dFX3xS8L1yQ
❤️ Please use the knowledge gained from deeplizard content for good, not evil.



![how i would learn cybersecurity in 2026 if i had to start over [$1,000 GIVEAWAY & BIG ANNOUNCEMENT] how i would learn cybersecurity in 2026 if i had to start over [$1,000 GIVEAWAY & BIG ANNOUNCEMENT]](https://i.ytimg.com/vi/RVPKW--dqhw/hqdefault.jpg)






