Adaptive Data Collection via Autoregressive Generation

Channel:

Simons Institute for the Theory of Computing

Subscribers:

68,700

Published on November 15, 2024 12:00:00 AM ● Video Link: https://www.youtube.com/watch?v=X1eA5sQQ-7g

Duration: 0:00

140 views

Hongseok Namkoong (Columbia University)
https://simons.berkeley.edu/talks/hongseok-namkoong-coumbia-university-2024-11-15
Domain Adaptation and Related Areas

Real-world decision-making requires grappling with a perpetual lack of data as environments change; intelligent agents must comprehend uncertainty and actively gather information to resolve it. We propose a new framework for learning adaptive data collection algorithms from massive historical data, which we demonstrate in a cold-start recommendation problem. First, we use historical data to pretrain an autoregressive model to predict a sequence of repeated feedback/rewards (e.g., responses to news articles shown to different users over time). In learning to make accurate predictions, the model implicitly learns an informed prior based on rich action features (e.g., article headlines) and how to sharpen beliefs as more rewards are gathered (e.g., clicks as each article is recommended). At decision-time, we autoregressively sample (impute) an imagined sequence of rewards for each action, and choose the action with the largest average imputed reward. Far from a heuristic, our approach is an implementation of Thompson sampling (with a learned prior), a prominent active exploration algorithm. We prove our pretraining loss directly controls online decision-making performance, and we demonstrate our framework on a news recommendation task where we integrate end-to-end fine-tuning of a pretrained language model to process news article headline text to improve performance.

Other Videos By Simons Institute for the Theory of Computing

2024-11-19	Talk by Mahdi Soltanolkotabi (University of Southern California)
2024-11-18	Some Easy Optimization Problems Have the Overlap-Gap Property
2024-11-18	Understanding Contrastive Learning and Self-training
2024-11-18	Revisiting Scalarization in Multi-Task Learning
2024-11-18	Beyond Decoding: Meta-Generation Algorithms for Large Language Models (Remote Talk)
2024-11-18	Omnipredicting Single-Index Models with Multi-Index Models
2024-11-17	The Truth About Your Lying Calibrated Forecaster: How to Design Truthful Calibration Measures
2024-11-17	A Discrepancy-Based Theory of Adaptation
2024-11-17	Bypassing the Impossibility of Online Learning Thresholds: Unbounded Losses and Transductive Priors
2024-11-17	Learning from Dynamics
2024-11-14	Adaptive Data Collection via Autoregressive Generation
2024-11-13	Toward Understanding the Extrapolation of Nonlinear Models to Unseen Domains or the Whole Domain
2024-11-13	Language-guided Adaptation
2024-11-13	On Spurious Associations and LLM Alignment
2024-11-13	Causally motivated robustness to shortcut learning
2024-11-13	Talk by Zachary Lipton
2024-11-12	Distribution shift in ecological data: generalization vs. specialization,
2024-11-12	Transfer learning via local convergence rates of the nonparametric least squares estimator
2024-11-12	Transfer learning for weak-to-strong generalization
2024-11-12	User-level and federated local differential privacy
2024-11-11	The Evolution of Domain Transfer in the Era of Pre-trained Language Models

Channel	Latest
Akashi	13 hours ago
Icehiteru	15 hours ago
oGVexx	16 hours ago
KizunaAI - A.I.Channel	18 hours ago
Sey Senpai	21 hours ago
Vardoc1	23 hours ago
Anton Petrov	23 hours ago
Aphmau	23 hours ago
Many A True Nerd	1 day ago
LInk02	1 day ago
モスラメソ	1 day ago
Flik's Gaming Stuff	1 day ago
Mon Facts	1 day ago
GeorgeMallouris	1 day ago
Big punchman	1 day ago
Jakou	1 day ago
HOWTONEVOLUTION	1 day ago
Brunoborne	1 day ago
Goodblue77	1 day ago
lugeyps3	1 day ago
Stan's Mod Gaming	1 day ago
OPEN TV	1 day ago
neXzen MMD & MUSIC	1 day ago
flipswitch3111	1 day ago
WalkthroughGuy	1 day ago