Markov Persuasion Process and its Reinforcement Learning

Channel:

Simons Institute for the Theory of Computing

Subscribers:

68,700

Published on September 16, 2022 5:05:05 AM ● Video Link: https://www.youtube.com/watch?v=zUmXByRRB0c

Duration: 46:00

764 views

Haifeng Xu (University of Chicago)
https://simons.berkeley.edu/talks/tbd-476
Quantifying Uncertainty: Stochastic, Adversarial, and Beyond

Standard Markov decision process (MDP) features a single planner who observes the underlying state of a world and then acts. This talk will study a natural variant of this fundamental model, in which one agent observes the state whereas another agent acts. Such sequential interactions among different agents arise in various recommender systems such as ride-sharing platforms or content recommendations. When agents have different incentives, the state-informed agent can partially reveal information about the realized state to influence the actor at each round in order to steer their collective actions towards a desirable outcome, a problem which we coin Markov Persuasion Process (MPP) inspired by the celebrated recent literature of Bayesian persuasion. We will talk about both computational and reinforcement learning questions in MPP.

Other Videos By Simons Institute for the Theory of Computing

2022-09-26	Random Cluster Model on Regular Graphs
2022-09-17	Generalization and Robustness in Offline Reinforcement Learning
2022-09-17	Adaptivity and Confounding in Multi-armed Bandit Experiments
2022-09-16	When Can We Use Weak Function Approximation to Solve Large Scale Planning Problems in MDPs?
2022-09-16	Dynamic Regret Minimization for Bandits without Prior Knowledge
2022-09-16	Oracle-Efficient Online Learning or: How to Use Non-Robust Optimization for Robust Learning
2022-09-16	Adaptive Monopoly Regulation
2022-09-16	Information Collection Through Strategic Agents
2022-09-16	Incentivized Exploration
2022-09-15	Attributes: Selective Learning and Influence
2022-09-15	Markov Persuasion Process and its Reinforcement Learning
2022-09-15	Parsimonious Learning-Augmented Algorithms
2022-09-14	Machine Learning for Faster Optimization
2022-09-14	Flow Time Scheduling with Uncertain Processing Time
2022-09-14	Best Of Both Worlds: Stochastic & Adversarial Best-Arm Identification
2022-09-14	Best of Both World Algorithms from I.I.D. to Adversarial Data
2022-09-14	Corruption-Robust Contextual Search
2022-09-14	Contextual Inverse Optimization: Offline and Online Learning
2022-09-13	Causal Inference in Complex Systems: Network Interference, Strategic Agents, and Beyond
2022-09-13	Inference and Interference in Marketplace Experimentation
2022-09-13	Causal Matrix Completion: Applications to Offline Causal Reinforcement Learning

Tags:

Simons Institute

theoretical computer science

UC Berkeley

Computer Science

Theory of Computation

Theory of Computing

Quantifying Uncertainty: Stochastic Adversarial and Beyond

Haifeng Xu

Channel	Latest
Vardoc1	7 hours ago
Mon Facts	9 hours ago
GeorgeMallouris	9 hours ago
Big punchman	9 hours ago
Jakou	10 hours ago
HOWTONEVOLUTION	10 hours ago
Brunoborne	10 hours ago
Goodblue77	10 hours ago
Stan's Mod Gaming	10 hours ago
OPEN TV	10 hours ago
neXzen MMD & MUSIC	10 hours ago
flipswitch3111	10 hours ago
WalkthroughGuy	10 hours ago
ТРЕНДИ ШОРТС	10 hours ago
eagLe34	10 hours ago
Melody /ميلودي	10 hours ago
Linkwolf	10 hours ago
아루우	10 hours ago
Nostradamus	10 hours ago
Xeres Artrophel Ch.	10 hours ago
Dandy Caballero	10 hours ago
OUDO - ON THE RIFT	11 hours ago
Zaus Eragon	11 hours ago
Ian Harrison	11 hours ago
KiLLiNG MaCHiNE ( ͡° ͜ʖ ͡°)	11 hours ago