Batch Policy Learning in Average Reward Markov Decision Processes

Published on ● Video Link: https://www.youtube.com/watch?v=UUjcql5__44



Duration: 31:35
542 views
11


Peng Liao (Harvard)
https://simons.berkeley.edu/talks/tbd-247
Reinforcement Learning from Batch Data and Simulation







Tags:
Simons Institute
theoretical computer science
UC Berkeley
Computer Science
Theory of Computation
Theory of Computing
Peng Liao
Reinforcement Learning from Batch Data and Simulation