THE FUTURE IS HERE

Reinforcement Learning from Human Feedback – The success behind ChatGPT

​Agenda of 46th Meetup [ Online ]
​​7:00 PM IST – 7:40 PM IST: Reinforcement Learning from Human Feedback – The success behind ChatGPT by Souradip Chakraborty

​[ Ph.D. CS at the University of Maryland, Past: Walmart Labs, Google Developers Expert-ML ]

​​7:40 PM IST onwards: QnA Session for Souradip’s Talk

​About the Speaker

​Souradip is currently a 2nd-year Ph.D. Computer Science student at the University of Maryland, College Park, working in the Foundations of Reinforcement Learning in Sequential Decision Making. His goal is to develop large-scale robust algorithms for sequential decision-making tasks under practical and challenging limitations to make Safe, Fair, Robust, and Aligned to Human behavior & Preferences – bridge the Gap b/w Theory and Practice.

​He recently received the Outstanding Paper Award, TSRML at Neurips2022 and Outstanding Reviewer Awards, Neurips 2022, AISTATS 2023. As a part of the Ph.D. program, he has published in venues including ICML, Neurips, AAAI, CoRL, and ICRA.

​In the past, Souradip has worked for 3 years as a Research AI Scientist at Walmart Labs, India after completing my Masters from the Indian Statistical Institute in 2018 summa cum laude and also a Google Developers Expert in Machine Learning (2019). Co-authored several US patents and top-tier publications in the field of AI & ML applications in the NLP and Computer Vision domain as a part of Walmart Labs and GDE-ML.

​​Connect with Souradip on LinkedIn here

​​[ All above timings are in IST ( Indian Standard Time ) ]

​​Have any queries regarding the meetup? Drop a mail to kdmdelhincr@gmail.com