DeepMind x UCL RL Lecture Series - Model-free Control [6/13]
Channel:
Subscribers:
663,000
Published on ● Video Link: https://www.youtube.com/watch?v=t9uf9cuogBo
Research Scientist Hado van Hasselt covers prediction algorithms for policy improvement, leading to algorithms that can learn good behaviour policies from sampled experience.
Slides: https://dpmd.ai/modelfreecontrol
Full video lecture series: https://dpmd.ai/DeepMindxUCL21