David Silver: AlphaGo, AlphaZero, and Deep Reinforcement Learning | Lex Fridman Podcast #86

Channel:
Subscribers:
4,820,000
Published on ● Video Link: https://www.youtube.com/watch?v=uPUEq8d73JI



Category:
Show
Duration: 1:48:01
365,548 views
8,828


David Silver leads the reinforcement learning research group at DeepMind and was lead researcher on AlphaGo, AlphaZero and co-lead on AlphaStar, and MuZero and lot of important work in reinforcement learning.

Support this podcast by signing up with these sponsors:
- MasterClass: https://masterclass.com/lex
- Cash App - use code "LexPodcast" and download:
- Cash App (App Store): https://apple.co/2sPrUHe
- Cash App (Google Play): https://bit.ly/2MlvP5w

EPISODE LINKS:
Reinforcement learning (book): https://amzn.to/2Jwp5zG

PODCAST INFO:
Podcast website:
https://lexfridman.com/podcast
Apple Podcasts:
https://apple.co/2lwqZIr
Spotify:
https://spoti.fi/2nEwCF8
RSS:
https://lexfridman.com/feed/podcast/
Full episodes playlist:
https://www.youtube.com/playlist?list=PLrAXtmErZgOdP_8GztsuKi9nrraNbKKp4
Clips playlist:
https://www.youtube.com/playlist?list=PLrAXtmErZgOeciFP3CBCIEElOJeitOr41

OUTLINE:
0:00 - Introduction
4:09 - First program
11:11 - AlphaGo
21:42 - Rule of the game of Go
25:37 - Reinforcement learning: personal journey
30:15 - What is reinforcement learning?
43:51 - AlphaGo (continued)
53:40 - Supervised learning and self play in AlphaGo
1:06:12 - Lee Sedol retirement from Go play
1:08:57 - Garry Kasparov
1:14:10 - Alpha Zero and self play
1:31:29 - Creativity in AlphaZero
1:35:21 - AlphaZero applications
1:37:59 - Reward functions
1:40:51 - Meaning of life

CONNECT:
- Subscribe to this YouTube channel
- Twitter: https://twitter.com/lexfridman
- LinkedIn: https://www.linkedin.com/in/lexfridman
- Facebook: https://www.facebook.com/LexFridmanPage
- Instagram: https://www.instagram.com/lexfridman
- Medium: https://medium.com/@lexfridman
- Support on Patreon: https://www.patreon.com/lexfridman




Other Videos By Lex Fridman


2020-04-15I'm Most Proud of Trying - Eric Weinstein | AI Podcast Clips
2020-04-14Take Back MIT | Eric Weinstein and Lex Fridman
2020-04-13Eric Weinstein: Geometric Unity and the Call for New Ideas & Institutions | Lex Fridman Podcast #88
2020-04-12Richard Dawkins: Meaning of Life | AI Podcast Clips
2020-04-11Richard Dawkins: Memes | AI Podcast Clips
2020-04-10Richard Dawkins: The Programmer of the Simulation Came About Through Evolution | AI Podcast Clips
2020-04-09Richard Dawkins: Evolution, Intelligence, Simulation, and Memes | Lex Fridman Podcast #87
2020-04-06The Way Out | Lex Fridman (Original)
2020-04-04AlphaZero and Self Play (David Silver, DeepMind) | AI Podcast Clips
2020-04-04Consciousness is Not a Computation (Roger Penrose) | AI Podcast Clips
2020-04-03David Silver: AlphaGo, AlphaZero, and Deep Reinforcement Learning | Lex Fridman Podcast #86
2020-04-02Forever oscillate between dissatisfaction and gratitude
2020-04-02Measure passion not progress
2020-04-01Escaping the Local Optimum of Low Expectation
2020-04-01Roger Penrose: Infinite Cycles of the Universe Punctuated by Big Bang Singularities
2020-03-31Roger Penrose: Physics of Consciousness and the Infinite Universe | Lex Fridman Podcast #85
2020-03-30Nick Bostrom: Superintelligence | AI Podcast Clips
2020-03-28Nick Bostrom: Experience Machine | AI Podcast Clips
2020-03-27Nick Bostrom on the Joe Rogan Podcast Conversation About the Simulation | AI Podcast Clips
2020-03-26Why is the Simulation Interesting to Elon Musk? (Nick Bostrom) | AI Podcast Clips
2020-03-25Nick Bostrom: Simulation and Superintelligence | Lex Fridman Podcast #83



Tags:
david silver
deep rl
deepmind
google
reinforcement learning
machine learning
deep learning
alphazero
muzero
artificial intelligence
agi
ai
ai podcast
artificial intelligence podcast
lex fridman
lex podcast
lex mit
lex ai
lex jre
mit ai