The Mean-Squared Error of Double Q-Learning

Channel:

Simons Institute for the Theory of Computing

Subscribers:

68,700

Published on December 3, 2020 6:38:57 AM ● Video Link: https://www.youtube.com/watch?v=pw3g2ARJUz0

Duration: 35:26

412 views

R. Srikant (University of Illinois at Urbana-Champaign)
https://simons.berkeley.edu/talks/tbd-246
Reinforcement Learning from Batch Data and Simulation

Other Videos By Simons Institute for the Theory of Computing

2020-12-05	Offline Reinforcement Learning and Model-Based Optimization
2020-12-04	Convergence and Sample Complexity of Gradient Methods for the Model-Free Linear Quadratic Regulator
2020-12-04	Policy Evaluation under Interference
2020-12-04	Stable Reinforcement Learning with Unbounded State Space
2020-12-04	Multiagent Reinforcement Learning: Rollout and Policy Iteration
2020-12-04	Sample Efficient Reinforcement Learning via Low-Rank Matrix Estimation
2020-12-04	Nearly Minimax Optimal Reward-Free Reinforcement Learning
2020-12-03	Statistical Efficiency in Offline Reinforcement Learning
2020-12-03	Batch Policy Learning in Average Reward Markov Decision Processes
2020-12-03	Panel Discussion
2020-12-02	The Mean-Squared Error of Double Q-Learning
2020-12-02	Q-learning with Uniformly Bounded Variance
2020-12-02	Zap Stochastic Approximation and Implications to Q-Learning
2020-12-02	Computational/Statistical Gaps for Learning Neural Networks
2020-12-02	Uniform Offline Policy Evaluation (OPE) and Offline Learning in Tabular RL
2020-12-02	Batch Value-function Approximation with Only Realizability
2020-12-01	Reinforcement Learning using Generative Models for Continuous State and Action Space Systems
2020-12-01	Monte Carlo Sampling Approach to Solving Stochastic Multistage Programs
2020-12-01	Robust Learning of Stochastic Dynamical Systems
2020-12-01	Confident Off-policy Evaluation and Selection through Self-Normalized Importance Weighting
2020-12-01	An Equivalence between Loss Functions and Non-Uniform Sampling in Experience Replay

Tags:

Simons Institute

theoretical computer science

UC Berkeley

Computer Science

Theory of Computation

Theory of Computing

R. Srikant

Reinforcement Learning from Batch Data and Simulation

Channel	Latest
RoninRevil	6 hours ago
Wos	6 hours ago
MOMOKO YODA	6 hours ago
iGuti89	6 hours ago
Pierro_fps	6 hours ago
Dragomazing	7 hours ago
Sport Piceno Game	7 hours ago
Thích Violin	7 hours ago
Der Mikeintosh	7 hours ago
UltimateNyde	7 hours ago
Nexific	7 hours ago
KevGaming87	7 hours ago
Liban Ali	7 hours ago
Reborn Project	7 hours ago
Mokka Commentry	7 hours ago
CARBON	7 hours ago
SkyWhait	7 hours ago
Lostgamerrus	8 hours ago
Crouch Gaming	8 hours ago
RayThaGawd	8 hours ago
Schannel	8 hours ago
la cueva de lobo	8 hours ago
Geezax	8 hours ago
Nubo BIT	8 hours ago
Inter	8 hours ago