DeepMind x UCL RL Lecture Series - Multi-step & Off Policy [11/13]
Channel:
Subscribers:
663,000
Published on ● Video Link: https://www.youtube.com/watch?v=u84MFu1nG4g
Research Scientist Hado van Hasselt discusses multi-step and off policy algorithms, including various techniques for variance reduction.
Slides: https://dpmd.ai/offpolicy
Full video lecture series: https://dpmd.ai/DeepMindxUCL21