Uniform Offline Policy Evaluation (OPE) and Offline Learning in Tabular RL
Subscribers:
68,700
Published on ● Video Link: https://www.youtube.com/watch?v=7Vam6NVFMII
Yu-Xiang Wang (UC Santa Barbara)
https://simons.berkeley.edu/talks/tbd-243
Reinforcement Learning from Batch Data and Simulation
Other Videos By Simons Institute for the Theory of Computing
Tags:
Simons Institute
theoretical computer science
UC Berkeley
Computer Science
Theory of Computation
Theory of Computing
Yu-Xiang Wang
Reinforcement Learning from Batch Data and Simulation