Convergence and Sample Complexity of Gradient Methods for the Model-Free Linear Quadratic Regulator

Channel:

Subscribers:

68,700

Published on December 5, 2020 6:38:35 AM ● Video Link: https://www.youtube.com/watch?v=iiE4ijPf27Y

Duration: 32:36

880 views

Mihailo Jovanovic (USC)
https://simons.berkeley.edu/talks/tbd-255
Reinforcement Learning from Batch Data and Simulation

Other Videos By Simons Institute for the Theory of Computing

2020-12-09	Price of Active Security in Multiparty Computation
2020-12-08	Game-Theoretically Secure Protocols Inspired by Blockchains
2020-12-08	Lightning Network Economics: Cost-Minimal Channels and their Implications for Network Structure
2020-12-08	Local Proofs Approaching the Witness Length
2020-12-08	Lower Bounds for Off-Chain Protocols: Exploring the Limits of Plasma c
2020-12-08	Central Bank Digital Currency: When Price and Bank Stability Collide
2020-12-08	Blockchain as Regulatory Technology: from Code is law to Law as Code
2020-12-07	Subquadratic SNARGs in the Random Oracle Model
2020-12-05	Panel Discussion
2020-12-05	Offline Reinforcement Learning and Model-Based Optimization
2020-12-04	Convergence and Sample Complexity of Gradient Methods for the Model-Free Linear Quadratic Regulator
2020-12-04	Policy Evaluation under Interference
2020-12-04	Stable Reinforcement Learning with Unbounded State Space
2020-12-04	Multiagent Reinforcement Learning: Rollout and Policy Iteration
2020-12-04	Sample Efficient Reinforcement Learning via Low-Rank Matrix Estimation
2020-12-04	Nearly Minimax Optimal Reward-Free Reinforcement Learning
2020-12-03	Statistical Efficiency in Offline Reinforcement Learning
2020-12-03	Batch Policy Learning in Average Reward Markov Decision Processes
2020-12-03	Panel Discussion
2020-12-02	The Mean-Squared Error of Double Q-Learning
2020-12-02	Q-learning with Uniformly Bounded Variance

Tags:

Simons Institute

theoretical computer science

UC Berkeley

Computer Science

Theory of Computation

Theory of Computing

Mihailo Jovanovic

Reinforcement Learning from Batch Data and Simulation