An introduction to Policy Gradient methods - Deep Reinforcement Learning -

An introduction to Policy Gradient methods – Deep Reinforcement Learning

In this episode I introduce Policy Gradient methods for Deep Reinforcement Learning.

After a general overview, I dive into Proximal Policy Optimization: an algorithm designed at OpenAI that tries to find a balance between sample efficiency and code complexity. PPO is the algorithm used to train the OpenAI Five system and is also used in a wide range of other challenges like Atari and robotic control tasks.

If you want to support this channel, here is my patreon link:
https://patreon.com/ArxivInsights — You are amazing!! 😉

If you have questions you would like to discuss with me personally, you can book a 1-on-1 video call through Pensight: https://pensight.com/x/xander-steenbrugge

Links mentioned in the video:
⦁ PPO paper: https://arxiv.org/abs/1707.06347
⦁ TRPO paper: https://arxiv.org/abs/1502.05477
⦁ OpenAI PPO blogpost: https://blog.openai.com/openai-baselines-ppo/
⦁ Aurelien Geron: KL divergence and entropy in ML: https://youtu.be/ErfnhcEV1O8
⦁ Deep RL Bootcamp – Lecture 5: https://youtu.be/xvRrgxcpaHY
⦁ RL-adventure PyTorch implementation: https://github.com/higgsfield/RL-Adventure-2
⦁ OpenAI Baselines TensorFlow implementation: https://github.com/openai/baselines

THE FUTURE IS HERE

AI Now

#2: Wit.AI NLP-as-a-service – Natural Language Processing Chatbots

What is NLP ? | Introduction to Natural Language Processing for Beginners | Machine Learning 12

US completes first dogfight between AI-controlled F-16 and human pilot

Introduction to Small Unmanned Aerial System (sUAS-drone) Cybersecurity (video 1 of 3)

LIVE from AUVSI Xponential 2024 – Inside Unmanned Systems – Oren Elkayam – Mobilicom

Unmanned Aircraft Systems | Embry Riddle Aeronautical University

Paving the way for Unmanned Aerial Systems| Embry-Riddle Aeronautical University (ERAU)

Ten Everyday Machine Learning Use Cases

Real project experience: What BA does in Machine learning projects

10 Mind-Blowing Facts About Quantum AI