Best Of Both Worlds: Stochastic & Adversarial Best-Arm Identification

Channel:

Simons Institute for the Theory of Computing

Subscribers:

68,700

Published on September 14, 2022 11:50:37 AM ● Video Link: https://www.youtube.com/watch?v=exgLvFglYVQ

Duration: 45:00

373 views

Victor Gabillon (Queensland University of Technology)
https://simons.berkeley.edu/talks/best-both-worlds-stochastic-adversarial-best-arm-identification
Quantifying Uncertainty: Stochastic, Adversarial, and Beyond

We study bandit best-arm identification with arbitrary and potentially adversarial rewards. A simple random uniform learner obtains the optimal rate of error in the adversarial scenario. However, this type of strategy is suboptimal when the rewards are sampled stochastically. Therefore, we ask: Can we design a learner that performs optimally in both the stochastic and adversarial problems while not being aware of the nature of the rewards? First, we show that designing such a learner is impossible in general. In particular, to be robust to adversarial rewards, we can only guarantee optimal rates of error on a subset of the stochastic problems. We give a lower bound that characterizes the optimal rate in stochastic problems if the strategy is constrained to be robust to adversarial rewards. Finally, we design a simple parameter-free algorithm and show that its probability of error matches (up to log factors) the lower bound in stochastic problems, and it is also robust to adversarial ones.

Other Videos By Simons Institute for the Theory of Computing

2022-09-16	Dynamic Regret Minimization for Bandits without Prior Knowledge
2022-09-16	Oracle-Efficient Online Learning or: How to Use Non-Robust Optimization for Robust Learning
2022-09-16	Adaptive Monopoly Regulation
2022-09-16	Information Collection Through Strategic Agents
2022-09-16	Incentivized Exploration
2022-09-15	Attributes: Selective Learning and Influence
2022-09-15	Markov Persuasion Process and its Reinforcement Learning
2022-09-15	Parsimonious Learning-Augmented Algorithms
2022-09-14	Machine Learning for Faster Optimization
2022-09-14	Flow Time Scheduling with Uncertain Processing Time
2022-09-14	Best Of Both Worlds: Stochastic & Adversarial Best-Arm Identification
2022-09-14	Best of Both World Algorithms from I.I.D. to Adversarial Data
2022-09-14	Corruption-Robust Contextual Search
2022-09-14	Contextual Inverse Optimization: Offline and Online Learning
2022-09-13	Causal Inference in Complex Systems: Network Interference, Strategic Agents, and Beyond
2022-09-13	Inference and Interference in Marketplace Experimentation
2022-09-13	Causal Matrix Completion: Applications to Offline Causal Reinforcement Learning
2022-09-13	Greedy Approximation Algorithms for Active Sequential Hypothesis Testing
2022-09-13	Expert Advice in Complex Environments
2022-09-12	Retrospective Search: Exploration and Ambition on Uncharted Terrain
2022-09-12	Dynamically Aggregating Diverse Information

Tags:

Simons Institute

theoretical computer science

UC Berkeley

Computer Science

Theory of Computation

Theory of Computing

Quantifying Uncertainty: Stochastic Adversarial and Beyond

Victor Gabillon

Channel	Latest
Sey Senpai	9 hours ago
Vardoc1	11 hours ago
Anton Petrov	11 hours ago
Many A True Nerd	12 hours ago
LInk02	12 hours ago
Mon Facts	13 hours ago
GeorgeMallouris	13 hours ago
Big punchman	14 hours ago
Jakou	14 hours ago
HOWTONEVOLUTION	14 hours ago
Brunoborne	14 hours ago
Goodblue77	14 hours ago
lugeyps3	14 hours ago
Stan's Mod Gaming	14 hours ago
OPEN TV	14 hours ago
neXzen MMD & MUSIC	14 hours ago
flipswitch3111	14 hours ago
WalkthroughGuy	14 hours ago
ТРЕНДИ ШОРТС	14 hours ago
eagLe34	14 hours ago
Melody /ميلودي	14 hours ago
Linkwolf	14 hours ago
아루우	15 hours ago
Nostradamus	15 hours ago
Xeres Artrophel Ch.	15 hours ago