What is Q* | Reinforcement learning 101 & Hypothesis
๐ Links
- Jim Fanโs tweet: https://twitter.com/DrJimFan/status/1728100123862004105
- Reinforcement learning deep dive: https://www.youtube.com/watch?v=i7q8bISGwMQ&t=380s
- Github: Q-learning AI to play snake game - https://www.crafters.ai/aitools/teach-ai-to-play-snake-q-learning-practice
- Lets verify step by step: https://arxiv.org/abs/2305.20050
- Tree of thought: https://arxiv.org/abs/2305.10601
- Graph of thought: https://arxiv.org/abs/2308.09687
๐๐ป About Me
My name is Jason Zhou, a product designer who shares interesting AI experiments & products. Email me if you need help building AI apps! ask@ai-jason.com
#chatgpt #gpt4 #gpt5 #ai #artificialintelligence #tutorial #stepbystep #openai #llm #chatgpt #largelanguagemodels #largelanguagemodel #agent #reinforcementlearning
๐ Links
– Jim Fanโs tweet: https://twitter.com/DrJimFan/status/1728100123862004105
– Reinforcement learning deep dive: https://www.youtube.com/watch?v=i7q8bISGwMQ&t=380s
– Github: Q-learning AI to play snake game – https://www.crafters.ai/aitools/teach-ai-to-play-snake-q-learning-practice
– Lets verify step by step: https://arxiv.org/abs/2305.20050
– Tree of thought: https://arxiv.org/abs/2305.10601
– Graph of thought: https://arxiv.org/abs/2308.09687
๐๐ป About Me
My name is Jason Zhou, a product designer who shares interesting AI experiments & products. Email me if you need help building AI apps! ask@ai-jason.com
#chatgpt #gpt4 #gpt5 #ai #artificialintelligence #tutorial #stepbystep #openai #llm #chatgpt #largelanguagemodels #largelanguagemodel #agent #reinforcementlearning