David Silver: AlphaGo, AlphaZero, and Deep Reinforcement Learning | Lex Fridman Podcast #86

Channel:

Lex Fridman

Subscribers:

4,820,000

Published on April 3, 2020 9:16:46 PM ● Video Link: https://www.youtube.com/watch?v=uPUEq8d73JI

Category:

Show

Duration: 1:48:01

365,548 views

8,828

David Silver leads the reinforcement learning research group at DeepMind and was lead researcher on AlphaGo, AlphaZero and co-lead on AlphaStar, and MuZero and lot of important work in reinforcement learning.

Support this podcast by signing up with these sponsors:
- MasterClass: https://masterclass.com/lex
- Cash App - use code "LexPodcast" and download:
- Cash App (App Store): https://apple.co/2sPrUHe
- Cash App (Google Play): https://bit.ly/2MlvP5w

EPISODE LINKS:
Reinforcement learning (book): https://amzn.to/2Jwp5zG

PODCAST INFO:
Podcast website:
https://lexfridman.com/podcast
Apple Podcasts:
https://apple.co/2lwqZIr
Spotify:
https://spoti.fi/2nEwCF8
RSS:
https://lexfridman.com/feed/podcast/
Full episodes playlist:
https://www.youtube.com/playlist?list=PLrAXtmErZgOdP_8GztsuKi9nrraNbKKp4
Clips playlist:
https://www.youtube.com/playlist?list=PLrAXtmErZgOeciFP3CBCIEElOJeitOr41

OUTLINE:
0:00 - Introduction
4:09 - First program
11:11 - AlphaGo
21:42 - Rule of the game of Go
25:37 - Reinforcement learning: personal journey
30:15 - What is reinforcement learning?
43:51 - AlphaGo (continued)
53:40 - Supervised learning and self play in AlphaGo
1:06:12 - Lee Sedol retirement from Go play
1:08:57 - Garry Kasparov
1:14:10 - Alpha Zero and self play
1:31:29 - Creativity in AlphaZero
1:35:21 - AlphaZero applications
1:37:59 - Reward functions
1:40:51 - Meaning of life

CONNECT:
- Subscribe to this YouTube channel
- Twitter: https://twitter.com/lexfridman
- LinkedIn: https://www.linkedin.com/in/lexfridman
- Facebook: https://www.facebook.com/LexFridmanPage
- Instagram: https://www.instagram.com/lexfridman
- Medium: https://medium.com/@lexfridman
- Support on Patreon: https://www.patreon.com/lexfridman

Other Videos By Lex Fridman

2020-04-15	I'm Most Proud of Trying - Eric Weinstein \| AI Podcast Clips
2020-04-14	Take Back MIT \| Eric Weinstein and Lex Fridman
2020-04-13	Eric Weinstein: Geometric Unity and the Call for New Ideas & Institutions \| Lex Fridman Podcast #88
2020-04-12	Richard Dawkins: Meaning of Life \| AI Podcast Clips
2020-04-11	Richard Dawkins: Memes \| AI Podcast Clips
2020-04-10	Richard Dawkins: The Programmer of the Simulation Came About Through Evolution \| AI Podcast Clips
2020-04-09	Richard Dawkins: Evolution, Intelligence, Simulation, and Memes \| Lex Fridman Podcast #87
2020-04-06	The Way Out \| Lex Fridman (Original)
2020-04-04	AlphaZero and Self Play (David Silver, DeepMind) \| AI Podcast Clips
2020-04-04	Consciousness is Not a Computation (Roger Penrose) \| AI Podcast Clips
2020-04-03	David Silver: AlphaGo, AlphaZero, and Deep Reinforcement Learning \| Lex Fridman Podcast #86
2020-04-02	Forever oscillate between dissatisfaction and gratitude
2020-04-02	Measure passion not progress
2020-04-01	Escaping the Local Optimum of Low Expectation
2020-04-01	Roger Penrose: Infinite Cycles of the Universe Punctuated by Big Bang Singularities
2020-03-31	Roger Penrose: Physics of Consciousness and the Infinite Universe \| Lex Fridman Podcast #85
2020-03-30	Nick Bostrom: Superintelligence \| AI Podcast Clips
2020-03-28	Nick Bostrom: Experience Machine \| AI Podcast Clips
2020-03-27	Nick Bostrom on the Joe Rogan Podcast Conversation About the Simulation \| AI Podcast Clips
2020-03-26	Why is the Simulation Interesting to Elon Musk? (Nick Bostrom) \| AI Podcast Clips
2020-03-25	Nick Bostrom: Simulation and Superintelligence \| Lex Fridman Podcast #83

Tags:

david silver

deep rl

deepmind

google

reinforcement learning

machine learning

deep learning

alphazero

muzero

artificial intelligence

agi

ai podcast

artificial intelligence podcast

lex fridman

lex podcast

lex mit

lex ai

lex jre

mit ai

Channel	Latest
CaptnCasual	6 hours ago
Kratos is Live	6 hours ago
Aston Villa Football Club	6 hours ago
Snipe Gaming TV	6 hours ago
Vini Simas (Red)	6 hours ago
ポッキー	6 hours ago
Mapocolops	6 hours ago
lakemies	6 hours ago
ReDLaNGaMeR	6 hours ago
No Commentary	6 hours ago
咕叽沙雕动画	6 hours ago
Kust Jidding	6 hours ago
Майни	6 hours ago
Matboksen	6 hours ago
fuzzytigercat	6 hours ago
Himanshu Rana	6 hours ago
JD Sports	6 hours ago
Narev Gaming	6 hours ago
BCD Universe	6 hours ago
Google for Developers	6 hours ago
mobilesuitgaming	6 hours ago
Honkh	6 hours ago
VGamingJunkieVT	6 hours ago
Stremphoenix	6 hours ago
TeCPnews	6 hours ago