DeepMind x UCL RL Lecture Series - Model-free Control [6/13]

Subscribers:
663,000
Published on ● Video Link: https://www.youtube.com/watch?v=t9uf9cuogBo



Duration: 1:40:41
16,637 views
156


Research Scientist Hado van Hasselt covers prediction algorithms for policy improvement, leading to algorithms that can learn good behaviour policies from sampled experience.

Slides: https://dpmd.ai/modelfreecontrol
Full video lecture series: https://dpmd.ai/DeepMindxUCL21




Other Videos By Google DeepMind


2022-02-07Let's get physical - DeepMind: The Podcast (S2, Ep4)
2022-01-31Better together - DeepMind: The Podcast (S2, Ep3)
2022-01-25A breakthrough unfolds - DeepMind: The Podcast (S2, Ep1)
2022-01-25Speaking of intelligence - DeepMind: The Podcast (S2, Ep2)
2022-01-10DeepMind: The Podcast (S2 trailer)
2021-09-09DeepMind x UCL RL Lecture Series - Introduction to Reinforcement Learning [1/13]
2021-09-09DeepMind x UCL RL Lecture Series - Exploration & Control [2/13]
2021-09-09DeepMind x UCL RL Lecture Series - MDPs and Dynamic Programming [3/13]
2021-09-09DeepMind x UCL RL Lecture Series - Theoretical Fund. of Dynamic Programming Algorithms [4/13]
2021-09-09DeepMind x UCL RL Lecture Series - Model-free Prediction [5/13]
2021-09-09DeepMind x UCL RL Lecture Series - Model-free Control [6/13]
2021-09-09DeepMind x UCL RL Lecture Series - Function Approximation [7/13]
2021-09-09DeepMind x UCL RL Lecture Series - Planning & models [8/13]
2021-09-09DeepMind x UCL RL Lecture Series - Policy-Gradient and Actor-Critic methods [9/13]
2021-09-09DeepMind x UCL RL Lecture Series - Approximate Dynamic Programming [10/13]
2021-09-09DeepMind x UCL RL Lecture Series - Multi-step & Off Policy [11/13]
2021-09-09DeepMind x UCL RL Lecture Series - Deep Reinforcement Learning #1 [12/13]
2021-09-09DeepMind x UCL RL Lecture Series - Deep Reinforcement Learning #2 [13/13]
2021-07-27Open-Ended Learning Leads to Generally Capable Agents | Results Showreel
2021-07-22AlphaFold Protein Structure Database
2021-01-05NeurIPS 2020: JAX Ecosystem Meetup