THE FUTURE IS HERE

SARSA Temporal Difference Learning in Python from Scratch with OpenAI Gym – Reinforcement Learning

#machinelearning #reinforcementlearning #controltheory #robotics #pythontutorial #python #openai #openaigym #gymnasium #mechatronics #controlengineering #controlsystems

It takes a significant amount of time and energy to create these free video tutorials. You can support my efforts by making a PayPal donation or by becoming a Patreon:
PayPal: https://www.paypal.me/AleksandarHaber
Patreon: https://www.patreon.com/user?u=32080176&fan_landing=true

The post accompanying this video tutorial is given here: https://aleksandarhaber.com/explanation-and-python-implementation-of-on-policy-sarsa-temporal-difference-learning-reinforcement-learning-tutorial/

In this reinforcement learning tutorial, we explain how to implement the SARSA temporal difference learning algorithm in Python and how to test the algorithm behavior by using the OpenAI Gym. SARSA stands for State-Action-Reward-State-Action, and SARSE temporal difference learning is a very important algorithm that serves as the basis for the development of other algorithms. We explain how to write a Python class that implements the temporal difference learning algorithm.