How Do AI Models Like Claude & ChatGPT Get So Smart?
Have you ever wondered how AI systems like ChatGPT, Claude, & Gemini get so smart? In this episode, Anthropic engineer Hansohl Kim breaks down reinforcement learning — the training method that helps large language models go beyond data memorization to learn from feedback, and learn right from wrong.
In this video you'll learn: the difference between supervised learning & reinforcement learning, why RL is crucial for alignment and safety, & how it shapes the “personality” of AI systems like Claude.
CHAPTERS
0:00 Introduction
0:12 What is Supervised Learning?
2:23 What is Reinforcement Learning?
4:40 How Supervised & Reinforcement Learning Work Together
KEY INSIGHTS
🎯 Reinforcement learning trains AI with feedback, not just “right answers.”
🎯 RL is essential for alignment, teaching models how to behave responsibly.
🎯 Pre-training gives AI knowledge, while reinforcement learning teaches it how to use that knowledge.
HANSOHL KIM RELATED LINKS
🌐 Anthropic – https://www.anthropic.com/
💼 Hansohl Kim on LinkedIn – linkedin.com/in/hansohl
🔔 DON’T MISS OUT
Subscribe and hit the bell for more game-changing product design & AI insights.
#reinforcementlearning #AI #machinelearning #gamethinking #anthropic
------------------------------------------------------------------------------------
📚 ABOUT OUR CHANNEL📚
We deconstruct breakout games & apps to help you innovate smarter and find product/market fit. Hosted by Amy Jo Kim - Game Designer & Startup Coach - prev. Rock Band, The Sims, Covet Fashion Happify, Netflix.
Check out our channel here:
🔔 Don’t forget to subscribe! 🔔
LEARN MORE ABOUT GAME THINKING
Check out our rapid innovation programs for product leaders.👍
https://www.gamethinking.io/programs
Join our free online community 📣 and get in on exclusive free events at
https://gamethinking.io/gschool
Read our Game Thinking book 📘 at
https://gamethinking.io/book/
FIND US AT 👇
https://gamethinking.io/
GET IN TOUCH 👍
support@gamethinking.io
FOLLOW US ON SOCIAL 📱
Get updates or reach out to Get updates on our Social Media Profiles!
https://x.com/amyjokim
https://www.linkedin.com/in/amyjokim/
https://amyjokim.medium.com/
Game Thinking TV