Contextual Inverse Optimization: Offline and Online Learning

Channel:

Simons Institute for the Theory of Computing

Subscribers:

68,700

Published on September 14, 2022 7:02:24 AM ● Video Link: https://www.youtube.com/watch?v=l8sQfoEGuwM

Duration: 42:31

483 views

Ilan Lobel (NYU Stern)
https://simons.berkeley.edu/talks/contextual-inverse-optimization-offline-and-online-learning
Quantifying Uncertainty: Stochastic, Adversarial, and Beyond

We study the problems of offline and online contextual optimization with feedback information, where instead of observing the loss, we observe, after-the-fact, the optimal action an oracle with full knowledge of the objective function would have taken. We aim to minimize regret, which is defined as the difference between our losses and the ones incurred by an all-knowing oracle. In the offline setting, the decision-maker has information available from past periods and needs to make one decision, while in the online setting, the decision-maker optimizes decisions dynamically over time based a new set of feasible actions and contextual functions in each period. For the offline setting, we characterize the optimal minimax policy, establishing the performance that can be achieved as a function of the underlying geometry of the information induced by the data. In the online setting, we leverage this geometric characterization to optimize the cumulative regret. We develop an algorithm that yields the first regret bound for this problem that is logarithmic in the time horizon. Finally, we show via simulation that our proposed algorithms outperform previous methods from the literature.

Other Videos By Simons Institute for the Theory of Computing

2022-09-16	Information Collection Through Strategic Agents
2022-09-16	Incentivized Exploration
2022-09-15	Attributes: Selective Learning and Influence
2022-09-15	Markov Persuasion Process and its Reinforcement Learning
2022-09-15	Parsimonious Learning-Augmented Algorithms
2022-09-14	Machine Learning for Faster Optimization
2022-09-14	Flow Time Scheduling with Uncertain Processing Time
2022-09-14	Best Of Both Worlds: Stochastic & Adversarial Best-Arm Identification
2022-09-14	Best of Both World Algorithms from I.I.D. to Adversarial Data
2022-09-14	Corruption-Robust Contextual Search
2022-09-14	Contextual Inverse Optimization: Offline and Online Learning
2022-09-13	Causal Inference in Complex Systems: Network Interference, Strategic Agents, and Beyond
2022-09-13	Inference and Interference in Marketplace Experimentation
2022-09-13	Causal Matrix Completion: Applications to Offline Causal Reinforcement Learning
2022-09-13	Greedy Approximation Algorithms for Active Sequential Hypothesis Testing
2022-09-13	Expert Advice in Complex Environments
2022-09-12	Retrospective Search: Exploration and Ambition on Uncharted Terrain
2022-09-12	Dynamically Aggregating Diverse Information
2022-09-10	Simplicity and Optimality in Algorithmic Economics
2022-09-10	Algorithms Using Local Graph Features to Predict Epidemics
2022-09-10	Beyond Worst Case Analysis in ML

Tags:

Simons Institute

theoretical computer science

UC Berkeley

Computer Science

Theory of Computation

Theory of Computing

Quantifying Uncertainty: Stochastic Adversarial and Beyond

Ilan Lobel

Channel	Latest
Vardoc1	6 hours ago
Mon Facts	8 hours ago
GeorgeMallouris	9 hours ago
Big punchman	9 hours ago
Jakou	9 hours ago
HOWTONEVOLUTION	9 hours ago
Brunoborne	9 hours ago
Goodblue77	9 hours ago
Stan's Mod Gaming	9 hours ago
OPEN TV	9 hours ago
neXzen MMD & MUSIC	9 hours ago
flipswitch3111	9 hours ago
WalkthroughGuy	9 hours ago
ТРЕНДИ ШОРТС	10 hours ago
eagLe34	10 hours ago
Melody /ميلودي	10 hours ago
Linkwolf	10 hours ago
아루우	10 hours ago
Nostradamus	10 hours ago
Xeres Artrophel Ch.	10 hours ago
Dandy Caballero	10 hours ago
OUDO - ON THE RIFT	10 hours ago
Zaus Eragon	10 hours ago
Ian Harrison	10 hours ago
KiLLiNG MaCHiNE ( ͡° ͜ʖ ͡°)	10 hours ago