Convergence and Sample Complexity of Gradient Methods for the Model-Free Linear Quadratic Regulator
Subscribers:
68,700
Published on ● Video Link: https://www.youtube.com/watch?v=iiE4ijPf27Y
Mihailo Jovanovic (USC)
https://simons.berkeley.edu/talks/tbd-255
Reinforcement Learning from Batch Data and Simulation
Other Videos By Simons Institute for the Theory of Computing
Tags:
Simons Institute
theoretical computer science
UC Berkeley
Computer Science
Theory of Computation
Theory of Computing
Mihailo Jovanovic
Reinforcement Learning from Batch Data and Simulation