Convergence and Sample Complexity of Gradient Methods for the Model-Free Linear Quadratic Regulator

Published on ● Video Link: https://www.youtube.com/watch?v=iiE4ijPf27Y



Duration: 32:36
880 views
14


Mihailo Jovanovic (USC)
https://simons.berkeley.edu/talks/tbd-255
Reinforcement Learning from Batch Data and Simulation







Tags:
Simons Institute
theoretical computer science
UC Berkeley
Computer Science
Theory of Computation
Theory of Computing
Mihailo Jovanovic
Reinforcement Learning from Batch Data and Simulation