Confident Off-policy Evaluation and Selection through Self-Normalized Importance Weighting

Channel:

Simons Institute for the Theory of Computing

Subscribers:

68,700

Published on December 1, 2020 8:10:19 AM ● Video Link: https://www.youtube.com/watch?v=0MYRwW6BdvU

Duration: 41:01

787 views

Ilja Kuzborskij (DeepMind)
https://simons.berkeley.edu/talks/tbd-238
Reinforcement Learning from Batch Data and Simulation

Other Videos By Simons Institute for the Theory of Computing

2020-12-03	Panel Discussion
2020-12-02	The Mean-Squared Error of Double Q-Learning
2020-12-02	Q-learning with Uniformly Bounded Variance
2020-12-02	Zap Stochastic Approximation and Implications to Q-Learning
2020-12-02	Computational/Statistical Gaps for Learning Neural Networks
2020-12-02	Uniform Offline Policy Evaluation (OPE) and Offline Learning in Tabular RL
2020-12-02	Batch Value-function Approximation with Only Realizability
2020-12-01	Reinforcement Learning using Generative Models for Continuous State and Action Space Systems
2020-12-01	Monte Carlo Sampling Approach to Solving Stochastic Multistage Programs
2020-12-01	Robust Learning of Stochastic Dynamical Systems
2020-12-01	Confident Off-policy Evaluation and Selection through Self-Normalized Importance Weighting
2020-12-01	An Equivalence between Loss Functions and Non-Uniform Sampling in Experience Replay
2020-11-30	Beyond Worst-Case: Instance-Dependent Optimality in Reinforcement Learning
2020-11-30	Learning Multi-Agent Collaborations With Decomposition
2020-11-30	Online Learning with A Lot of Batch Data
2020-11-24	Ahmed El Alaoui \| Fellows Lightning Talks \| 6th Annual Industry Day
2020-11-24	Computational Complexity of Statistical Inference \| Program Presentations \| 6th Annual Industry Day
2020-11-24	Computational Innovation and Data-Driven Biology \| Program Presentations \| 6th Annual Industry Day
2020-11-24	JP Morgan \| Industry Partner Lightning Talks \| 6th Annual Industry Day
2020-11-24	Microsoft Research \| Industry Partner Lightning Talks \| 6th Annual Industry Day
2020-11-24	Vidya Muthukumar \| Fellows Lightning Talks \| 6th Annual Industry Day

Tags:

Simons Institute

theoretical computer science

UC Berkeley

Computer Science

Theory of Computation

Theory of Computing

Reinforcement Learning from Batch Data and Simulation

Ilja Kuzborskij

Channel	Latest
PurpleVanGo紫飯Go Ch.	6 hours ago
Терра Чорт	6 hours ago
George Arrancar	6 hours ago
X1TheGamer	6 hours ago
Rovix	6 hours ago
ARD Klassik	6 hours ago
The Box Man	6 hours ago
9AL Games	6 hours ago
Szafa Grafa	6 hours ago
Arlott Addict	6 hours ago
Ninjas in Pyjamas	6 hours ago
Axe Mobile Legends	7 hours ago
きりはch	7 hours ago
すだち風味 / Sudachi Piano	7 hours ago
MLBB eSports	7 hours ago
ねみゅさん	7 hours ago
Gamanji	7 hours ago
Phenomenal	7 hours ago
CPCC	7 hours ago
河馬	7 hours ago
ENMINUTOS	7 hours ago
Howhowgoose 皓皓鵝-遊戲頻道	7 hours ago
Robinoyo	7 hours ago
BRKsEDU	7 hours ago
Jack Pattillo	7 hours ago