Learning Automata as Building Blocks for MARL
Subscribers:
68,700
Published on ● Video Link: https://www.youtube.com/watch?v=3-_LSemCbSw
Ann Nowe (Vrije Universiteit Brussel)
https://simons.berkeley.edu/talks/learning-automata-building-blocks-marl
Multi-Agent Reinforcement Learning and Bandit Learning
In this talk I will show that Learning Automata (LA), and more precisely Reward in Action update schemes are interesting building blocks for Multi-agent RL, both in bandit settings as well as stateful RL. Based on the theorem of Narendra and Wheeler we have convergence guarantees in n-person non-zero sum games. However, LA have also shown to be robust in more relaxed settings, such as queueing systems, where updates happen asynchronously and the feedback sent to the agents is delayed.
Other Videos By Simons Institute for the Theory of Computing
Tags:
Simons Institute
theoretical computer science
UC Berkeley
Computer Science
Theory of Computation
Theory of Computing
Multi-Agent Reinforcement Learning and Bandit Learning
Ann Nowe