Batch Policy Learning in Average Reward Markov Decision Processes
Subscribers:
68,700
Published on ● Video Link: https://www.youtube.com/watch?v=UUjcql5__44
Peng Liao (Harvard)
https://simons.berkeley.edu/talks/tbd-247
Reinforcement Learning from Batch Data and Simulation
Other Videos By Simons Institute for the Theory of Computing
Tags:
Simons Institute
theoretical computer science
UC Berkeley
Computer Science
Theory of Computation
Theory of Computing
Peng Liao
Reinforcement Learning from Batch Data and Simulation