Policy Gradient Methods: Tutorial and New Frontiers

Subscribers:
344,000
Published on ● Video Link: https://www.youtube.com/watch?v=y4ci8whvS1E



Category:
Tutorial
Duration: 1:09:20
12,399 views
213


In this tutorial we discuss several recent advances in deep reinforcement learning involving policy gradient methods. These methods have shown significant success in a wide range of domains, including continuous-action domains such as manipulation, locomotion, and flight. They have also achieved the state of the art in discrete action domains such as Atari. We will provide a unifying overview of a variety of different policy gradient methods, and we will also discuss the formalism of stochastic computation graphs for computing gradients of expectations. 

See more on this video at https://www.microsoft.com/en-us/research/video/policy-gradient-methods-tutorial-new-frontiers/







Tags:
microsoft research