General Game-Theoretic Multiagent Reinforcement Learning

Published on ● Video Link: https://www.youtube.com/watch?v=lPxFn1-mhVY



Duration: 40:25
1,339 views
30


Marc Lanctot (DeepMind)
https://simons.berkeley.edu/talks/general-game-theoretic-multiagent-reinforcement-learning
Multi-Agent Reinforcement Learning and Bandit Learning

Regret minimizing agents in self-play have been used to learn approximate minimax-optimal strategies with much success, scaling to large hold’em poker games and to super-human level performance in very large multiplayer games. This prescriptive approach has guided the development of algorithms for two-player zero-sum games, and similarly for fully-cooperative games. What about the fully general case– what could a prescriptive agenda look like there? Is there an agent-centric criterion that can be optimized without relying on outside authorities or third parties? In this talk, I will quickly survey the recent approaches to game-theoretic multiagent reinforcement learning in general games, and then focus on ideas that could attempt to answer these open questions in multiagent reinforcement learning.




Other Videos By Simons Institute for the Theory of Computing


2022-05-11Black Holes and the Quantum-Extended Church-Turing Thesis | Quantum Colloquium
2022-05-06The Role of Conventions in Adaptive Human-AI Interaction
2022-05-06Learning Decentralized Policies in Multiagent Systems: How to Learn Efficiently and ...
2022-05-06No-Regret Learning in Extensive-Form Games
2022-05-06Learning and Equilibrium Refinements
2022-05-05Global Convergence of Multi-Agent Policy Gradient in Markov Potential Games
2022-05-05Multi-Player Bandits With No Collisions
2022-05-05What Does Machine Learning Offer Game Theory (And Vice Versa)?
2022-05-05Variants and Invariants in No-Regret Algorithms
2022-05-05When Is Offline Two-Player Zero-Sum Markov Game Solvable?
2022-05-05General Game-Theoretic Multiagent Reinforcement Learning
2022-05-05Kernelized Multiplicative Weights for 0/1-Polyhedral Games:...
2022-05-04Multi-Agent Reinforcement Learning Towards Zero-Shot Communication
2022-05-04Sequential Information Design: Markov Persuasion Process and Its Efficient Reinforcement Learning
2022-05-04Independent Learning in Stochastic Games
2022-05-04On Rewards in Multi-Agent Systems
2022-05-04Learning Automata as Building Blocks for MARL
2022-05-04Efficient Error Correction in Neutral Atoms via Erasure Conversion | Quantum Colloquium
2022-05-04Multi-Agent Reinforcement Learning in the High Population Regime
2022-05-04A Regret Minimization Approach to Mutli-Agent Control and RL
2022-05-03The Complexity of Markov Equilibrium in Stochastic Games



Tags:
Simons Institute
theoretical computer science
UC Berkeley
Computer Science
Theory of Computation
Theory of Computing
Multi-Agent Reinforcement Learning and Bandit Learning
Marc Lanctot