No-Regret Learning in Time-Varying Zero-Sum Games

Channel:

Simons Institute for the Theory of Computing

Subscribers:

68,700

Published on May 3, 2022 9:41:16 AM ● Video Link: https://www.youtube.com/watch?v=cNyjGj9R2xE

Duration: 35:35

508 views

Haipeng Luo (University of Southern California)
https://simons.berkeley.edu/talks/no-regret-learning-time-varying-zero-sum-games
Multi-Agent Reinforcement Learning and Bandit Learning

Learning from repeated play in a fixed two-player zero-sum game is a classic problem in game theory and online learning. This talk focuses on a natural yet underexplored variant of this problem where the game payoff matrix changes over time, possibly in an adversarial manner. In the first part of the talk, I will discuss what the appropriate performance measures are for this problem (and argue that some measures from prior works might be unreasonable). In the second part of the talk, I will present a new parameter-free algorithm that simultaneously enjoys favorable guarantees under three different performance measures. These guarantees are adaptive to different non-stationarity measures of the payoff matrices and, importantly, recover the best known results when the payoff matrix is fixed. I will conclude the talk by discussing some future directions on the extensions to multi-player bandits and reinforcement learning.

Other Videos By Simons Institute for the Theory of Computing

2022-05-04	Sequential Information Design: Markov Persuasion Process and Its Efficient Reinforcement Learning
2022-05-04	Independent Learning in Stochastic Games
2022-05-04	On Rewards in Multi-Agent Systems
2022-05-04	Learning Automata as Building Blocks for MARL
2022-05-04	Efficient Error Correction in Neutral Atoms via Erasure Conversion \| Quantum Colloquium
2022-05-04	Multi-Agent Reinforcement Learning in the High Population Regime
2022-05-04	A Regret Minimization Approach to Mutli-Agent Control and RL
2022-05-03	The Complexity of Markov Equilibrium in Stochastic Games
2022-05-03	The Complexity of Infinite-Horizon General-Sum Stochastic Games: Turn-Based and Simultaneous Play
2022-05-03	Policy Gradients in General-Sum Dynamic Games: When Do They Even Converge?
2022-05-03	No-Regret Learning in Time-Varying Zero-Sum Games
2022-05-03	What is the Statistical Complexity of Reinforcement Learning?
2022-05-03	V-Learning: Simple, Efficient, Decentralized Algorithm for Multiagent RL
2022-05-02	"Calibeating": Beating Forecasters at Their Own Game
2022-04-30	Adjudicating Between Different Causal Accounts of Bell Inequality Violations
2022-04-30	Why Born Probabilities?
2022-04-30	Causal Discovery in the Quantum Context
2022-04-30	"Fine-Tuned", "Unfaithful", "Unnatural": Abuse of Terminology in Causal Modeling
2022-04-30	Causal Influence in Quantum Theory
2022-04-29	A Dynamic-Epistemic Approach to Conditionals
2022-04-29	A Bayesian Probability Calculus for Density Matrices

Tags:

Simons Institute

theoretical computer science

UC Berkeley

Computer Science

Theory of Computation

Theory of Computing

Multi-Agent Reinforcement Learning and Bandit Learning

Haipeng Luo

Channel	Latest
けい	9 hours ago
The Silly Steve Show	12 hours ago
Shazam Sakazaki	13 hours ago
血夜の檸檬	13 hours ago
Hobbynize Blog	13 hours ago
Nao BGR	13 hours ago
ぴノまるGame	14 hours ago
OPUS ASTORA	14 hours ago
Bring the Asteroid	14 hours ago
Kitab Gaming	14 hours ago
Reyju Gaming	14 hours ago
AussieAntics	14 hours ago
SamuraiTacos1	14 hours ago
VIDΣGΛMMΛ	14 hours ago
Shravan Srinivasan	14 hours ago
Ib Gaming	14 hours ago
Gotenks0002	15 hours ago
アベレージ / Average Channel	15 hours ago
Two Bros' Game Night	15 hours ago
SUPER SOCCER RUBRO NEGRO -ᄅ-	15 hours ago
The Other Guy	15 hours ago
OMNIxEVIL	15 hours ago
Seer CRZ	15 hours ago
Dreezus	15 hours ago
Daryus P	15 hours ago