A Unifying Theory of Distance to Calibration

Channel:

Google TechTalks

Subscribers:

349,000

Published on April 7, 2023 11:14:13 PM ● Video Link: https://www.youtube.com/watch?v=8ZhsV43x3so

Duration: 1:02:50

563 views

A Google TechTalk, presented by Jarosław Błasiok, 2023-04-04
ABSTRACT: We study the fundamental question of how to define and measure the distance from calibration for probabilistic predictors. While the notion of “perfect calibration” is well-understood, there is no consensus on how to quantify the distance from perfect calibration. Numerous calibration measures have been proposed in the literature, but it is unclear how they compare to each other, and many popular measures such as Expected Calibration Error (ECE) fail to satisfy basic properties like continuity.

We present a rigorous framework for analyzing calibration measures, inspired by the literature on property testing. We propose a ground-truth notion of distance from calibration: the $\ell_1$ distance to the nearest perfectly calibrated predictor. We define a “consistent calibration measure” as one that is a polynomial factor approximation to the this distance. Applying our framework, we identify three calibration measures that are “consistent” and can be estimated efficiently: smooth calibration, interval calibration, and Laplace kernel calibration. The former two give quadratic approximations to the ground truth distance, which we show is information-theoretically optimal. Our work thus establishes fundamental lower and upper bounds on measuring distance to calibration, and also provides theoretical justification for preferring certain metrics (like Laplace kernel calibration) in practice.

Based on joint work with Parikshit Gopalan, Lunjia Hu and Preetum Nakkiran.

Full text available at https://arxiv.org/abs/2211.16886

Bio: Jarosław Błasiok is a Junior Fellow at Simons Society of Fellows, conducting postdoctoral research at the Theory of Computation group at Columbia University, under mentorship of Professor Alex Andoni. He finished his Ph.D. at the John A. Paulson School of Engineering and Applied Sciences at Harvard University, advised by Professor Jelani Nelson. He received his B.S. and M.Sc. in Computer Science from the University of Warsaw.

He has a broad research interest in Theoretical Computer Science and has worked in design and analysis of streaming algorithms, the theory of error-correcting codes, algorithms related to machine learning, differential privacy and compressed sensing. In his dissertation, he described his research in streaming algorithms and error correction, featuring applications of high dimensional probability in these two areas.

A Google Talk Series on Algorithms, Theory, and Optimization

Other Videos By Google TechTalks

2023-05-30	Improving the Privacy Utility Tradeoff in Differentially Private Machine Learning with Public Data
2023-05-30	Randomized Approach for Tight Privacy Accounting
2023-05-30	Almost Tight Error Bounds on Differentially Private Continual Counting
2023-05-30	EIFFeL: Ensuring Integrity for Federated Learning
2023-05-30	Differentially Private Diffusion Models
2023-05-15	Damian Grimling \| Sentistocks \| Sentimenti \| web3 talks \| March 9th 2023 \| MC: Blake DeBenon
2023-04-21	Branimir Rakic \| CTO & Co-Founder of OriginTrail \| web3 talks \| Feb 27th 2023 \| MC: Alex Ticamera
2023-04-15	A Nearly Tight Analysis of Greedy k-means++
2023-04-15	Introduction to Length-Constrained Expanders and Expander Decompositions
2023-04-07	Improved Feature Importance Computation for Tree Models Based on the Banzhaf Value
2023-04-07	A Unifying Theory of Distance to Calibration
2023-04-07	Dynamic Graph Sketching: To Infinity And Beyond
2023-03-20	Sergey Nazarov \| Co-Founder Chainlink \| web3 talks \| Mar 16 2023 \| MC: Marlon Ruiz
2023-03-09	Evan Shapiro \| CEO Mina Foundation \| web3 talks \| Feb 16th 2023 \| MC: Marlon Ruiz
2023-03-07	Zürich Go Meetup: Zero-effort Type-safe Parsing of JSON and XML
2023-03-07	Zürich Go Meetup: Let’s Build a Game with Go
2023-03-07	Zürich Go Meetup: Run Go programs on your Raspberry Pi with gokrazy!
2023-03-03	Online Covering: Secretaries, Prophets and Universal Maps
2023-03-03	Auto-bidding in Online Advertising: Campaign Management and Fairness
2023-03-03	Tree Learning: Optimal Algorithms and Sample Complexity
2023-03-03	A Fast Algorithm for Adaptive Private Mean Estimation

Channel	Latest
PartyZams / Brian T Cox	6 hours ago
TheDiN23	6 hours ago
DarknessOfNoor	6 hours ago
Lady Nemhesis	7 hours ago
New York City Files	7 hours ago
Yescar Gameplays en Español	7 hours ago
James Bullock	7 hours ago
FF Hades	7 hours ago
jacurex90	7 hours ago
Moobys	7 hours ago
domisumReplay: Ornn	8 hours ago
ANHONI	8 hours ago
Trip Ni Bombi	8 hours ago
Cyber Knight	8 hours ago
Guisterix	8 hours ago
Ay0Tony	8 hours ago
Ratan Suthar (RaTs)	8 hours ago
Da Drunk Gamer	8 hours ago
domisumReplay: Pantheon	8 hours ago
DTM's Channel	8 hours ago
Davemon	8 hours ago
AGiantDog	8 hours ago
Pyrxplxyz	8 hours ago
Jaxres LoL	8 hours ago
Tunnel NY Lite	8 hours ago