Confident Off-policy Evaluation and Selection through Self-Normalized Importance Weighting
Subscribers:
68,700
Published on ● Video Link: https://www.youtube.com/watch?v=0MYRwW6BdvU
Ilja Kuzborskij (DeepMind)
https://simons.berkeley.edu/talks/tbd-238
Reinforcement Learning from Batch Data and Simulation
Other Videos By Simons Institute for the Theory of Computing
Tags:
Simons Institute
theoretical computer science
UC Berkeley
Computer Science
Theory of Computation
Theory of Computing
Reinforcement Learning from Batch Data and Simulation
Ilja Kuzborskij