Chasing the Long Tail: What Neural Networks Memorize and Why

Channel:

Simons Institute for the Theory of Computing

Subscribers:

68,700

Published on November 8, 2022 11:40:05 AM ● Video Link: https://www.youtube.com/watch?v=w_BUN5tPiuA

Duration: 51:41

1,712 views

Vitaly Feldman (Apple ML Research)
https://simons.berkeley.edu/node/22921
Societal Considerations and Applications

Deep learning algorithms that achieve state-of-the-art results on image and text recognition tasks tend to fit the entire training dataset (nearly) perfectly including mislabeled examples and outliers. This propensity to memorize seemingly useless data and the resulting large generalization gap have puzzled many practitioners and is not explained by existing theories of machine learning. We provide a simple conceptual explanation and a theoretical model demonstrating that memorization of outliers and mislabeled examples is necessary for achieving close-to-optimal generalization error when learning from long-tailed data distributions. Image and text data are known to follow such distributions and therefore our results establish a formal link between these empirical phenomena. We then demonstrate the utility of memorization and support our explanation empirically. These results rely on a new technique for efficiently estimating memorization and influence of training data points. Our results allow us to quantify the cost of limiting memorization in learning and explain the disparate effects that privacy and model compression have on different subgroups.

Other Videos By Simons Institute for the Theory of Computing

2022-11-10	Decision-Aware Learning for Global Health Supply Chains
2022-11-10	Supply-Side Equilibria in Recommender Systems
2022-11-10	What Really Matters for Fairness in Machine Learning: Delayed Impact and Other Desiderata
2022-11-10	Predictive Modeling in Healthcare – Special Considerations
2022-11-10	Bringing Order to Chaos: Navigating the Disagreement Problem in Explainable ML
2022-11-09	Pipeline Interventions
2022-11-09	Algorithmic Challenges in Ensuring Fairness at the Time of Decision
2022-11-09	Improving Refugee Resettlement
2022-11-09	Learning to Predict Arbitrary Quantum Processes
2022-11-09	A Kerfuffle: Differential Privacy and the 2020 Census
2022-11-08	Chasing the Long Tail: What Neural Networks Memorize and Why
2022-11-08	Concurrent Composition Theorems for all Standard Variants of Differential Privacy
2022-11-08	Privacy Management: Achieving the Possimpible
2022-11-07	Privacy-safe Measurement on the Web: Open Questions From the Privacy Sandbox
2022-10-29	Transmission Neural Networks: From Virus Spread Models to Neural Networks
2022-10-29	Spatial Spread of Dengue Virus: Appropriate Spatial Scales for Transmission
2022-10-28	A Global Comparison of COVID-19 Variant Waves and Relationships with Clinical and...
2022-10-28	Diversity and Inequality in Information Diffusion on Social Networks
2022-10-28	Learning through the Grapevine and the Impact of the Breadth and Depth of Social Networks
2022-10-28	Just a Few Seeds More: The Inflated Value of Network Data for Diffusion...
2022-10-27	Bayesian Learning in Social Networks

Tags:

Simons Institute

theoretical computer science

UC Berkeley

Computer Science

Theory of Computation

Theory of Computing

Epidemics and Information Diffusion

Vitaly Feldman

Channel	Latest
Sey Senpai	10 hours ago
Vardoc1	12 hours ago
Anton Petrov	12 hours ago
Many A True Nerd	13 hours ago
LInk02	13 hours ago
Mon Facts	14 hours ago
GeorgeMallouris	14 hours ago
Big punchman	14 hours ago
Jakou	15 hours ago
HOWTONEVOLUTION	15 hours ago
Brunoborne	15 hours ago
Goodblue77	15 hours ago
lugeyps3	15 hours ago
Stan's Mod Gaming	15 hours ago
OPEN TV	15 hours ago
neXzen MMD & MUSIC	15 hours ago
flipswitch3111	15 hours ago
WalkthroughGuy	15 hours ago
ТРЕНДИ ШОРТС	15 hours ago
eagLe34	15 hours ago
Melody /ميلودي	15 hours ago
Linkwolf	15 hours ago
아루우	16 hours ago
Nostradamus	16 hours ago
Xeres Artrophel Ch.	16 hours ago