How Do AI Models Like Claude & ChatGPT Get So Smart?

Channel:

Game Thinking TV

Subscribers:

15,500

Published on September 25, 2025 3:13:00 PM ● Video Link: https://www.youtube.com/watch?v=CKN72Rf5yrE

Duration: 0:00

178 views

Have you ever wondered how AI systems like ChatGPT, Claude, & Gemini get so smart? In this episode, Anthropic engineer Hansohl Kim breaks down reinforcement learning — the training method that helps large language models go beyond data memorization to learn from feedback, and learn right from wrong.

In this video you'll learn: the difference between supervised learning & reinforcement learning, why RL is crucial for alignment and safety, & how it shapes the “personality” of AI systems like Claude.

CHAPTERS
0:00 Introduction
0:12 What is Supervised Learning?
2:23 What is Reinforcement Learning?
4:40 How Supervised & Reinforcement Learning Work Together

KEY INSIGHTS
🎯 Reinforcement learning trains AI with feedback, not just “right answers.”
🎯 RL is essential for alignment, teaching models how to behave responsibly.
🎯 Pre-training gives AI knowledge, while reinforcement learning teaches it how to use that knowledge.

HANSOHL KIM RELATED LINKS
🌐 Anthropic – https://www.anthropic.com/
💼 Hansohl Kim on LinkedIn – linkedin.com/in/hansohl

🔔 DON’T MISS OUT
Subscribe and hit the bell for more game-changing product design & AI insights.

#reinforcementlearning #AI #machinelearning #gamethinking #anthropic

------------------------------------------------------------------------------------

📚 ABOUT OUR CHANNEL📚
We deconstruct breakout games & apps to help you innovate smarter and find product/market fit. Hosted by Amy Jo Kim - Game Designer & Startup Coach - prev. Rock Band, The Sims, Covet Fashion Happify, Netflix.

Check out our channel here:

🔔 Don’t forget to subscribe! 🔔

LEARN MORE ABOUT GAME THINKING
Check out our rapid innovation programs for product leaders.👍
https://www.gamethinking.io/programs

Join our free online community 📣 and get in on exclusive free events at
https://gamethinking.io/gschool

Read our Game Thinking book 📘 at
https://gamethinking.io/book/

FIND US AT 👇
https://gamethinking.io/

GET IN TOUCH 👍
support@gamethinking.io

FOLLOW US ON SOCIAL 📱
Get updates or reach out to Get updates on our Social Media Profiles!
https://x.com/amyjokim
https://www.linkedin.com/in/amyjokim/
https://amyjokim.medium.com/

Game Thinking TV

Other Videos By Game Thinking TV

3 days ago	Why AI is So Addictive!
2025-09-30	Can AI Really Think Like Us?
2025-09-25	How Do AI Models Like Claude & ChatGPT Get So Smart?
2025-09-19	What is reinforcement learning? #shorts
2025-09-16	What is supervised learning? #shorts
2025-09-12	Can AI Really Make Products By Itself? #shorts
2025-09-10	How AI is Revolutionizing Product Discovery & Design
2025-09-08	Is AI just fancy autocomplete? #shorts
2025-09-04	How well can AI mimic real people? #shorts
2025-09-02	Are synthetic users a revolution or a scam? #shorts
2025-08-31	Somebody's watchin' me #catlove #catshorts #caturday #cat
2025-08-29	Trust drives long-term retention. #shorts
2025-08-29	Why Fast Research Beats Perfect Research!
2025-08-28	Do SYNTHETIC USERS help or hurt product design?
2025-08-27	Integrity over rewards. #shorts
2025-08-20	Why I Left University Research Behind FOR GOOD
2025-08-18	Real ideas win with integrity. #shorts
2025-08-08	How to build trust with your customers #shorts
2025-08-06	Behavioral NUDGES Work Only If You Know This SECRET \| Amy Bucher
2025-08-04	Behavior change is easy if... #shorts
2025-07-24	The secret to habits that stick. #shorts

Channel	Latest
Easy Shot	6 hours ago
西片	6 hours ago
Harika Panda	6 hours ago
1ShotPlays	6 hours ago
GamePlay Adventure	6 hours ago
laSexta	6 hours ago
NayTendo	6 hours ago
Heves THG	6 hours ago
AlphaReplay	6 hours ago
HelyaLP	6 hours ago
Crazy Dentist	6 hours ago
Frost Prime	6 hours ago
https://www.youtube.com/playlist?list=LL80kutrYd1SAiqrkMVT8k1w	6 hours ago
Scrawl	6 hours ago
La Chaine de Fire	6 hours ago
AidariaLp	6 hours ago
AD The Smeargle	6 hours ago
Mr. Ray	6 hours ago
Точка зрения	6 hours ago
Sneili Sneils	6 hours ago
Foxboy Saga Superstars	6 hours ago
TaoWB	6 hours ago
chibirobo12	6 hours ago
Uno TheFirstOne	6 hours ago
Piotr “Kusiorr” Kłusek	6 hours ago