Best Of Both Worlds: Stochastic & Adversarial Best-Arm Identification

Published on ● Video Link: https://www.youtube.com/watch?v=exgLvFglYVQ



Duration: 45:00
373 views
8


Victor Gabillon (Queensland University of Technology)
https://simons.berkeley.edu/talks/best-both-worlds-stochastic-adversarial-best-arm-identification
Quantifying Uncertainty: Stochastic, Adversarial, and Beyond

We study bandit best-arm identification with arbitrary and potentially adversarial rewards. A simple random uniform learner obtains the optimal rate of error in the adversarial scenario. However, this type of strategy is suboptimal when the rewards are sampled stochastically. Therefore, we ask: Can we design a learner that performs optimally in both the stochastic and adversarial problems while not being aware of the nature of the rewards? First, we show that designing such a learner is impossible in general. In particular, to be robust to adversarial rewards, we can only guarantee optimal rates of error on a subset of the stochastic problems. We give a lower bound that characterizes the optimal rate in stochastic problems if the strategy is constrained to be robust to adversarial rewards. Finally, we design a simple parameter-free algorithm and show that its probability of error matches (up to log factors) the lower bound in stochastic problems, and it is also robust to adversarial ones.







Tags:
Simons Institute
theoretical computer science
UC Berkeley
Computer Science
Theory of Computation
Theory of Computing
Quantifying Uncertainty: Stochastic Adversarial and Beyond
Victor Gabillon