Zap Stochastic Approximation and Implications to Q-Learning

Channel:

Subscribers:

68,700

Published on December 3, 2020 5:32:50 AM ● Video Link: https://www.youtube.com/watch?v=_3CNm-Fc828

Duration: 28:41

564 views

Sean Meyn (University of Florida)
https://simons.berkeley.edu/talks/tbd-244
Reinforcement Learning from Batch Data and Simulation

Other Videos By Simons Institute for the Theory of Computing

2020-12-04	Policy Evaluation under Interference
2020-12-04	Stable Reinforcement Learning with Unbounded State Space
2020-12-04	Multiagent Reinforcement Learning: Rollout and Policy Iteration
2020-12-04	Sample Efficient Reinforcement Learning via Low-Rank Matrix Estimation
2020-12-04	Nearly Minimax Optimal Reward-Free Reinforcement Learning
2020-12-03	Statistical Efficiency in Offline Reinforcement Learning
2020-12-03	Batch Policy Learning in Average Reward Markov Decision Processes
2020-12-03	Panel Discussion
2020-12-02	The Mean-Squared Error of Double Q-Learning
2020-12-02	Q-learning with Uniformly Bounded Variance
2020-12-02	Zap Stochastic Approximation and Implications to Q-Learning
2020-12-02	Computational/Statistical Gaps for Learning Neural Networks
2020-12-02	Uniform Offline Policy Evaluation (OPE) and Offline Learning in Tabular RL
2020-12-02	Batch Value-function Approximation with Only Realizability
2020-12-01	Reinforcement Learning using Generative Models for Continuous State and Action Space Systems
2020-12-01	Monte Carlo Sampling Approach to Solving Stochastic Multistage Programs
2020-12-01	Robust Learning of Stochastic Dynamical Systems
2020-12-01	Confident Off-policy Evaluation and Selection through Self-Normalized Importance Weighting
2020-12-01	An Equivalence between Loss Functions and Non-Uniform Sampling in Experience Replay
2020-11-30	Beyond Worst-Case: Instance-Dependent Optimality in Reinforcement Learning
2020-11-30	Learning Multi-Agent Collaborations With Decomposition

Tags:

Simons Institute

theoretical computer science

UC Berkeley

Computer Science

Theory of Computation

Theory of Computing

Ana Busic

Reinforcement Learning from Batch Data and Simulation

Sean Meyn

Channel	Latest
아루우	6 hours ago
Nostradamus	6 hours ago
OUDO - ON THE RIFT	6 hours ago
Foxline	6 hours ago
S-Tavo Plays	6 hours ago
Ictfix.net	6 hours ago
Winkazi	6 hours ago
Samanta Gamer	7 hours ago
smskcntr	7 hours ago
Texshanfor Ferdi	7 hours ago
AhtmosTV	7 hours ago
ScarletMarisa375	7 hours ago
OtakuPT	7 hours ago
Koragg Wolzard WolfThunderRangerKilleranger34*	7 hours ago
Insert Coin	7 hours ago
Justmaiko Gaming	7 hours ago
Crainer	7 hours ago
Overdrive	7 hours ago
Adri’s On Fire	7 hours ago
Game Guides Channel	7 hours ago
GemplayTV	7 hours ago
Sveneta	7 hours ago
ImpulseDm	7 hours ago
Is It Playable?	7 hours ago
GrizzoUK	7 hours ago