What is the Statistical Complexity of Reinforcement Learning?

Channel:

Simons Institute for the Theory of Computing

Subscribers:

68,700

Published on May 3, 2022 7:27:56 AM ● Video Link: https://www.youtube.com/watch?v=IBr8wmEeWnM

Duration: 53:01

1,705 views

Sham Kakade (Harvard and MSR)
https://simons.berkeley.edu/talks/what-statistical-complexity-reinforcement-learning
Multi-Agent Reinforcement Learning and Bandit Learning

A fundamental question in the theory of reinforcement learning is what (representational or structural) conditions govern our ability to generalize and avoid the curse of dimensionality. With regards to supervised learning, these questions are well understood theoretically: practically, we have overwhelming evidence on the value of representational learning (say through modern deep networks) as a means for sample efficient learning, and, theoretically, there are well-known complexity measures (e.g. the VC dimension and Rademacher complexity) that govern the statistical complexity of learning. Providing an analogous theory for reinforcement learning is far more challenging, where even characterizing any structural conditions which support sample efficient generalization is far less well understood. This talk will highlight recent advances towards characterizing when generalization is possible in reinforcement learning (both in online and offline settings), focusing on both necessary and sufficient conditions. In particular, we will introduce a new complexity measure, the Decision-Estimation Coefficient, that is proven to be necessary (and, essentially, sufficient) for sample-efficient interactive learning.

Other Videos By Simons Institute for the Theory of Computing

2022-05-04	Independent Learning in Stochastic Games
2022-05-04	On Rewards in Multi-Agent Systems
2022-05-04	Learning Automata as Building Blocks for MARL
2022-05-04	Efficient Error Correction in Neutral Atoms via Erasure Conversion \| Quantum Colloquium
2022-05-04	Multi-Agent Reinforcement Learning in the High Population Regime
2022-05-04	A Regret Minimization Approach to Mutli-Agent Control and RL
2022-05-03	The Complexity of Markov Equilibrium in Stochastic Games
2022-05-03	The Complexity of Infinite-Horizon General-Sum Stochastic Games: Turn-Based and Simultaneous Play
2022-05-03	Policy Gradients in General-Sum Dynamic Games: When Do They Even Converge?
2022-05-03	No-Regret Learning in Time-Varying Zero-Sum Games
2022-05-03	What is the Statistical Complexity of Reinforcement Learning?
2022-05-03	V-Learning: Simple, Efficient, Decentralized Algorithm for Multiagent RL
2022-05-02	"Calibeating": Beating Forecasters at Their Own Game
2022-04-30	Adjudicating Between Different Causal Accounts of Bell Inequality Violations
2022-04-30	Why Born Probabilities?
2022-04-30	Causal Discovery in the Quantum Context
2022-04-30	"Fine-Tuned", "Unfaithful", "Unnatural": Abuse of Terminology in Causal Modeling
2022-04-30	Causal Influence in Quantum Theory
2022-04-29	A Dynamic-Epistemic Approach to Conditionals
2022-04-29	A Bayesian Probability Calculus for Density Matrices
2022-04-29	Instrumental Variables in Sparse and Dynamical Settings

Tags:

Simons Institute

theoretical computer science

UC Berkeley

Computer Science

Theory of Computation

Theory of Computing

Multi-Agent Reinforcement Learning and Bandit Learning

Sham Kakade

Channel	Latest
RoninRevil	6 hours ago
Wos	6 hours ago
MOMOKO YODA	6 hours ago
iGuti89	6 hours ago
Pierro_fps	6 hours ago
Dragomazing	7 hours ago
Sport Piceno Game	7 hours ago
Thích Violin	7 hours ago
Der Mikeintosh	7 hours ago
UltimateNyde	7 hours ago
Nexific	7 hours ago
KevGaming87	7 hours ago
Liban Ali	7 hours ago
Reborn Project	7 hours ago
Mokka Commentry	7 hours ago
CARBON	7 hours ago
SkyWhait	7 hours ago
Lostgamerrus	8 hours ago
Crouch Gaming	8 hours ago
RayThaGawd	8 hours ago
Schannel	8 hours ago
la cueva de lobo	8 hours ago
Geezax	8 hours ago
Nubo BIT	8 hours ago
Inter	8 hours ago