Temporal Difference Models: Deep Model-free RL for Model-based

Channel:

Subscribers:

344,000

Published on April 20, 2018 10:29:02 PM ● Video Link: https://www.youtube.com/watch?v=j-3nUkzMFA8

Duration: 19:50

5,236 views

Deep reinforcement learning (RL) has shown promising results for learning complex sequential decision-making behaviors in various environments. However, most successes have been exclusively in simulation, and results in real-world applications such as robotics are limited, largely due to poor sample efficiency of typical deep RL algorithms. I will introduce temporal difference models (TDMs), an extension of goal-conditioned value functions that enables multi time resolution model-base planning. TDMs generalize traditional predictive models, bridge the gap between model-based and off-policy model-free RL, and show substantial improvements in sample efficiency without introducing asymptotic performance loss.

See more at https://www.microsoft.com/en-us/research/video/temporal-difference-models-deep-model-free-rl-model-based/

Other Videos By Microsoft Research

2018-04-25	AI, Machine Learning, and the Reasoning Machine with Dr. Geoff Gordon
2018-04-23	Haptic Controllers: How Microsoft is making virtual reality tangible
2018-04-23	Accessibility - How Microsoft is making technology accessible to more people
2018-04-20	Distill and transfer learning for robust multitask RL
2018-04-20	Graph neural networks: Variations and applications
2018-04-20	Active Mini-Batch Sampling using Repulsive Point Processes
2018-04-20	Debiasing Evidence Approximations: Importance-Weighted Autoencoders Jackknife Variational Inference
2018-04-20	Reconciling Cancer Genotype and Phenotypes by Learning Structured Priors
2018-04-20	Variational Continual Learning
2018-04-20	Priors for Deep Networks: Limit theorems, pitfalls, open questions
2018-04-20	Temporal Difference Models: Deep Model-free RL for Model-based
2018-04-18	Getting good VIBEs from your computer with Dr. Mary Czerwinski
2018-04-18	The Uncanny Valley of Haptics
2018-04-13	Tales from the Crypt(ography) Lab with Dr. Kristin Lauter
2018-04-12	Fireside Chat with Jon Kleinberg
2018-04-12	Fireside Chat with Tom Dietterich
2018-04-12	Fireside Chat with Eva Tardos
2018-04-11	Technology-focused university levels the playing field with AI for students who are deaf (Extended)
2018-04-10	Dashboard Mechanisms for Online Marketplaces
2018-04-10	Recharging Bandits
2018-04-10	Measuring Sample Quality with Stein's Method

Tags:

microsoft research

Channel	Latest
VieKiper	6 hours ago
TheEmeraldMaster™️	6 hours ago
Darkness Games	6 hours ago
Haider khalid alnasery	7 hours ago
CBROWN PUBG	7 hours ago
Clubbi	7 hours ago
あらげゲームチャンネル	7 hours ago
BrandyDerg	8 hours ago
ASM_txc	8 hours ago
MFGAMING	8 hours ago
Plenoctis - Filosofia del Videogioco	8 hours ago
CS Suomi	8 hours ago
DJ Minecraft	9 hours ago
WalidSepet Studio	10 hours ago
The 8RCV Adventure’s	10 hours ago
V FACE	10 hours ago
CHEZAME	10 hours ago
WebNations	10 hours ago
Vorbe	10 hours ago
Tru4Smash	10 hours ago
Furious Phoenix Gamings	10 hours ago
ToonChan TV	10 hours ago
REALAX	10 hours ago
Gamer Shiva	11 hours ago
khuka	11 hours ago