Pseudo-Labeling for Covariate Shift Adaptation

Channel:

Simons Institute for the Theory of Computing

Subscribers:

69,500

Published on November 12, 2024 12:00:00 AM ● Video Link: https://www.youtube.com/watch?v=iB3RNarr4bA

Duration: 0:00

334 views

Kaizheng Wang (Columbia University)
https://simons.berkeley.edu/talks/kaizheng-wang-columbia-university-2024-11-12
Domain Adaptation and Related Areas

We develop and analyze a covariate shift adaptation method based on pseudo-labeling. The goal is to learn a regression function with small mean squared error over a target distribution, based on unlabeled data from there and labeled data that may have a different feature distribution. We propose to split the labeled data into two subsets and run regression on them separately to obtain (1) a collection of candidate models induced by different hyperparameters, and (2) an imputation model. We use the latter to fill the missing labels and then select the best candidate model accordingly. To investigate the influence of pseudo-labels on model selection, we derive a bias-variance decomposition that highlights the importance of using an imputation model with low bias. We demonstrate the efficacy of our approach through kernel ridge regression, proving that our method effectively adapts to the unknown covariate shift.

Other Videos By Simons Institute for the Theory of Computing

2024-11-14	Open-Source and Science in the Era of Foundation Models
2024-11-13	Toward Understanding the Extrapolation of Nonlinear Models to Unseen Domains or the Whole Domain
2024-11-13	Language-guided Adaptation
2024-11-13	On Spurious Associations and LLM Alignment
2024-11-13	Causally motivated robustness to shortcut learning
2024-11-13	Talk by Zachary Lipton
2024-11-12	Distribution shift in ecological data: generalization vs. specialization,
2024-11-12	Transfer learning via local convergence rates of the nonparametric least squares estimator
2024-11-12	Transfer learning for weak-to-strong generalization
2024-11-12	User-level and federated local differential privacy
2024-11-11	Pseudo-Labeling for Covariate Shift Adaptation
2024-10-16	The Enigma of LLMs: on Creativity, Compositionality, Pluralism, and Paradoxes
2024-10-02	Let’s Try and Be More Tolerant: On Tolerant Property Testing and Distance Approximation
2024-10-02	A Strong Separation for Adversarially Robust L_0 Estimation for Linear Sketches
2024-10-02	Towards Practical Distribution Testing
2024-10-02	Toward Optimal Semi-streaming Algorithm for (1+ε)-approximate Maximum Matching
2024-10-02	Plenary Talk: Privately Evaluating Untrusted Black-Box Functions
2024-10-02	The long path to \sqrt{d} monotonicity testers
2024-10-02	O(log log n) Passes is Optimal for Semi-Streaming Maximal Independent Set
2024-10-02	Distribution Learning Meets Graph Structure Sampling
2024-10-02	On the instance optimality of detecting collisions and subgraphs

Channel	Latest
nongmoclips	6 hours ago
Samsung	6 hours ago
리그 오브 레전드	6 hours ago
Zelgraz	6 hours ago
EL MALTEADA	6 hours ago
domisumReplay: Trundle	7 hours ago
fuku dubs	7 hours ago
Lukwer TFT	7 hours ago
Personalll	7 hours ago
KarussoTV	7 hours ago
Lack Of Entertainment	7 hours ago
Ungie	7 hours ago
Eagle Zero One	7 hours ago
AguilaCultura	7 hours ago
nathanman213	7 hours ago
PUBG MOBILE LATAM	7 hours ago
Julinurrohman	7 hours ago
STACK Presents	7 hours ago
ecletiverso	7 hours ago
MANTU IDAMAN 24	7 hours ago
Healthy Tips 2.0	7 hours ago
iBridge	7 hours ago
GuideRealm	7 hours ago
Rafaeu	7 hours ago
Enzo Alavaski	7 hours ago