The Compensated Coupling (or Why the Future is the Best Guide for the Present)

Channel:

Simons Institute for the Theory of Computing

Subscribers:

68,700

Published on October 8, 2022 9:34:48 AM ● Video Link: https://www.youtube.com/watch?v=wYxq-KJ7v5A

Category:

Guide

Duration: 31:56

338 views

Sid Banerjee (Cornell University)
https://simons.berkeley.edu/talks/compensated-coupling-or-why-future-best-guide-present
Joint IFML/Data-Driven Decision Processes Workshop

What makes online decision-making different from other decision-making/optimization problems? While it seems clear that the unique features are the sequential nature of taking actions and uncertainty in future outcomes, most techniques for solving such problems tend to obfuscate these features - so are these the best ways to think about these settings? I will present the compensated coupling: a simple paradigm for reasoning about and designing online decision-making policies, based on a sample-pathwise accounting of their performance compared to some benchmark policy. This approach generalizes many standard results used in studying Markov decision processes and reinforcement learning, but also gives us new policies which are much simpler and more effective than existing heuristics. For a large class of widely-studied control problems including online resource-allocation, dynamic pricing, generalized assignment, online bin packing, and bandits with knapsacks, I will illustrate how these new algorithms achieve constant regret (i.e., additive loss compared to the hindsight optimal which is independent of the horizon and state-space) under a wide range of conditions. Time permitting, I will try and describe how we can use this technique to incorporate side information and historical data in these settings, and achieve constant regret with as little as a single data trace.

Other Videos By Simons Institute for the Theory of Computing

2022-10-12	Learning Across Bandits in High Dimension via Robust Statistics
2022-10-12	Are Multicriteria MDPs Harder to Solve Than Single-Criteria MDPs?
2022-10-12	A Game-Theoretic Approach to Offline Reinforcement Learning
2022-10-11	The Statistical Complexity of Interactive Decision Making
2022-10-11	A Tutorial on Finite-Sample Guarantees of Contractive Stochastic Approximation With...
2022-10-11	A Tutorial on Finite-Sample Guarantees of Contractive Stochastic Approximation With...
2022-10-11	Stochastic Bin Packing with Time-Varying Item Sizes
2022-10-10	Constant Regret in Exchangeable Action Models: Overbooking, Bin Packing, and Beyond
2022-10-08	On The Exploration In Load-Balancing Under Unknown Service Rates
2022-10-08	Sample Complexity Of Policy-Based Methods Under Off-Policy Sampling And ...
2022-10-08	The Compensated Coupling (or Why the Future is the Best Guide for the Present)
2022-10-08	Higher-Dimensional Expansion of Random Geometric Complexes
2022-10-08	On the Power of Preconditioning in Sparse Linear Regression
2022-10-07	What Functions Do Transformers Prefer to Represent?
2022-10-01	Optimality of Variational Inference for Stochastic Block Model
2022-10-01	Machine Learning on Large-Scale Graphs
2022-10-01	Survey on Sparse Graph Limits + A Toy Example
2022-10-01	Long Range Dependence in Evolving Networks
2022-09-30	Stochastic Processes on Sparse Graphs: Hydrodynamic Limits and Markov Approximations
2022-09-30	Large Deviation Principle for the Norm of the Adjacency Matrix and the Laplacian Matrix of...
2022-09-30	Longitudinal Network Models, Log-Linear Multigraph Models, and Implications to Estimation and...

Tags:

Simons Institute

theoretical computer science

UC Berkeley

Computer Science

Theory of Computation

Theory of Computing

Joint IFML/Data-Driven Decision Processes Workshop

Sid Banerjee

Channel	Latest
Anubis	6 hours ago
The Chicken	6 hours ago
NintendoGamerGuide	6 hours ago
4K Gaming	6 hours ago
mikeontheinternet	6 hours ago
KT Spiritual	6 hours ago
DAY NIGHT GAMERZZ	6 hours ago
Gameplay1973Channel	7 hours ago
pytagus	7 hours ago
طبيب الكمبيوتر PCD	7 hours ago
관종대왕	7 hours ago
SMILTREX	7 hours ago
Daidara Games	7 hours ago
Roar 79	7 hours ago
StopGame - All about video games!	7 hours ago
Vishal Gaming	7 hours ago
World Of Games	7 hours ago
HanTzy	7 hours ago
ZON4 G4MER 30	7 hours ago
yosum58 Youtube Channel	7 hours ago
MOBA HPG	7 hours ago
Freakin' Famous Auditions	7 hours ago
Yoyo_L	7 hours ago
hOlyhexOr	7 hours ago
Nintendo Hall	7 hours ago