The Devil is in the Tails and Other Stories of Interpolation

Channel:

Simons Institute for the Theory of Computing

Subscribers:

68,700

Published on August 3, 2022 10:29:54 AM ● Video Link: https://www.youtube.com/watch?v=e7Y3hgQlaaE

Duration: 54:41

768 views

Niladri Chatterji (Stanford)
https://simons.berkeley.edu/node/21930
Deep Learning Theory Workshop and Summer School

In this talk, I shall present two research vignettes on the generalization of interpolating models.

Prior work has presented strong empirical evidence demonstrating that importance weights can have little to no effect on interpolating neural networks. We show that importance weighting fails not because of the interpolation, but instead, as a result of using exponentially-tailed losses like the cross-entropy loss. As a remedy, we show that polynomially-tailed losses restore the effects of importance reweighting in correcting distribution shift in interpolating models trained by gradient descent. Surprisingly, our theory reveals that using biased importance weights can improve performance in interpolating models.

Second, I shall present lower bounds on the excess risk of sparse interpolating procedures for linear regression. Our result shows that the excess risk of the minimum L1-norm interpolant can converge at an exponentially slower rate than the minimum L2-norm interpolant, even when the ground truth is sparse. Our analysis exposes the benefit of an effect analogous to the "wisdom of the crowd", except here the harm arising from fitting the noise is ameliorated by spreading it among many directions.

Based on joint work with Tatsunori Hashimoto, Saminul Haque, Philip Long, and Alexander Wang.

Other Videos By Simons Institute for the Theory of Computing

2022-08-06	Deep Learning in Structural Biology and Protein Design: How, Where, and Why
2022-08-05	When is Scale Enough?
2022-08-05	Universality of Approximate Message Passing on Semi-random Matrices
2022-08-05	A Theoretical Framework of Convolutional Kernels on Image Datasets
2022-08-04	Tutorial: Methods from Statistical Physics III
2022-08-04	A New Perspective on High-Dimensional Causal Inference
2022-08-04	Distribution Shift as Underspecification, and What We Might Do About It
2022-08-04	Tutorial: Methods from Statistical Physics II
2022-08-03	Tutorial: Methods from Statistical Physics I
2022-08-03	Benign, Tempered, or Catastrophic: A Taxonomy of Overfitting
2022-08-03	The Devil is in the Tails and Other Stories of Interpolation
2022-08-03	Tutorial: Implicit Bias II
2022-08-02	Tutorial: Implicit Bias I
2022-08-02	Feature Selection with Gradient Descent on Two-layer Networks in Low-rotation Regimes
2022-08-02	Understanding the Robustness of Deep Learning
2022-08-02	Tutorial: Statistical Learning Theory and Neural Networks II
2022-08-01	Tutorial: Statistical Learning Theory and Neural Networks I
2022-07-23	Adversarial Examples in Deep Learning
2022-07-22	New Approaches for Phylogenetic Species Tree Estimation
2022-07-22	Single Cell Brain Isoforms in Space and Time
2022-07-22	Benchmarking, Inference, and in Silico Controls in Single-Cell and Spatial Omics Data Science

Tags:

Simons Institute

theoretical computer science

UC Berkeley

Computer Science

Theory of Computation

Theory of Computing

Deep Learning Theory Workshop and Summer School

Niladri Chatterji

Channel	Latest
The Silly Steve Show	9 hours ago
Shazam Sakazaki	10 hours ago
血夜の檸檬	10 hours ago
Hobbynize Blog	10 hours ago
Nao BGR	10 hours ago
ぴノまるGame	10 hours ago
OPUS ASTORA	10 hours ago
Bring the Asteroid	11 hours ago
Kitab Gaming	11 hours ago
Reyju Gaming	11 hours ago
AussieAntics	11 hours ago
SamuraiTacos1	11 hours ago
VIDΣGΛMMΛ	11 hours ago
Shravan Srinivasan	11 hours ago
Ib Gaming	11 hours ago
Gotenks0002	11 hours ago
アベレージ / Average Channel	11 hours ago
Two Bros' Game Night	12 hours ago
SUPER SOCCER RUBRO NEGRO -ᄅ-	12 hours ago
The Other Guy	12 hours ago
OMNIxEVIL	12 hours ago
Seer CRZ	12 hours ago
Dreezus	12 hours ago
CANAL JOSÉ MOURA FALANDO FUTEBOL E OUTROS ESPORTES	12 hours ago
Lan House Coxa Games	12 hours ago