Policy Gradient Methods: Tutorial and New Frontiers

Channel:

Microsoft Research

Subscribers:

351,000

Published on August 27, 2017 3:24:33 AM ● Video Link: https://www.youtube.com/watch?v=y4ci8whvS1E

Category:

Tutorial

Duration: 1:09:20

12,399 views

213

In this tutorial we discuss several recent advances in deep reinforcement learning involving policy gradient methods. These methods have shown significant success in a wide range of domains, including continuous-action domains such as manipulation, locomotion, and flight. They have also achieved the state of the art in discrete action domains such as Atari. We will provide a unifying overview of a variety of different policy gradient methods, and we will also discuss the formalism of stochastic computation graphs for computing gradients of expectations.

See more on this video at https://www.microsoft.com/en-us/research/video/policy-gradient-methods-tutorial-new-frontiers/

Other Videos By Microsoft Research

2017-09-07	Fast Quantification of Uncertainty and Robustness with Variational Bayes
2017-09-07	Understanding the Rapidly Developing Field of Mobile Mental Health
2017-09-06	Low Latency Displays for Augmented Reality
2017-09-04	High-Accuracy Neural-Network Models for Speech Enhancement
2017-09-04	Position Tracking for Virtual Reality using Wi-Fi
2017-09-04	Speech Emotion Recognition with Convolutional Neural Networks
2017-08-28	The Malmo Collaborative AI Challenge
2017-08-28	Counterfactual Multi-Agent Policy Gradients
2017-08-28	Probabilistic Machine Learning and AI
2017-08-26	Design - On the Human Side
2017-08-26	Policy Gradient Methods: Tutorial and New Frontiers
2017-08-21	Improving trust in the compilation from F* to C
2017-08-21	Data Science Summer School 2017: Student Trajectories and School Choice in NYC
2017-08-21	Keynote: The Interplay of Agent and Market Design
2017-08-21	From Visual Sensing to Visual Intelligence
2017-08-21	Post-quantum cryptography from supersingular isogeny problems?
2017-08-21	What (and How) Can Linked-View Visualization tell us about the Universe, and Brains?
2017-08-21	Understanding Black-box Predictions via Influence Functions
2017-08-17	Fontlings In Story Baker Demo 9
2017-08-17	Fontlings Demo 8
2017-08-17	Robot Acting Sentences Demo

Tags:

microsoft research

Channel	Latest
Simple Gamer	6 hours ago
RedCaio	6 hours ago
A TUTTO CALCIO⚽	6 hours ago
Zaxx Gaming	6 hours ago
LEO DESANDE E ANA CLÁUDIA	6 hours ago
Starzkil1z	6 hours ago
rickX lods official	6 hours ago
WraggyTheGamer	6 hours ago
Böröcz "DeadFox" Bence	6 hours ago
Joey Fernandez	6 hours ago
Drachinifel	6 hours ago
UmmeBlox	6 hours ago
Hutton	6 hours ago
CANAL DO MARCIO 🎮🕹	6 hours ago
なすななし	6 hours ago
COSEF NASTYA	6 hours ago
จุ่มค่ะ มากับนุ่นแล้วก็มากับโบว์	6 hours ago
ADIT DIAMOND	6 hours ago
D R P O O - FF	6 hours ago
Ini Guru Budi	6 hours ago
HaDDGamer YT	6 hours ago
Gamer of Andhra	6 hours ago
WBG LEADER	6 hours ago
ちょぶり【eFootball解説】	6 hours ago
AB Sujeet	6 hours ago