A Unifying Theory of Distance to Calibration

Subscribers:
348,000
Published on ● Video Link: https://www.youtube.com/watch?v=8ZhsV43x3so



Duration: 1:02:50
563 views
8


A Google TechTalk, presented by Jarosław Błasiok, 2023-04-04
ABSTRACT: We study the fundamental question of how to define and measure the distance from calibration for probabilistic predictors. While the notion of “perfect calibration” is well-understood, there is no consensus on how to quantify the distance from perfect calibration. Numerous calibration measures have been proposed in the literature, but it is unclear how they compare to each other, and many popular measures such as Expected Calibration Error (ECE) fail to satisfy basic properties like continuity.

We present a rigorous framework for analyzing calibration measures, inspired by the literature on property testing. We propose a ground-truth notion of distance from calibration: the $\ell_1$ distance to the nearest perfectly calibrated predictor. We define a “consistent calibration measure” as one that is a polynomial factor approximation to the this distance. Applying our framework, we identify three calibration measures that are “consistent” and can be estimated efficiently: smooth calibration, interval calibration, and Laplace kernel calibration. The former two give quadratic approximations to the ground truth distance, which we show is information-theoretically optimal. Our work thus establishes fundamental lower and upper bounds on measuring distance to calibration, and also provides theoretical justification for preferring certain metrics (like Laplace kernel calibration) in practice.

Based on joint work with Parikshit Gopalan, Lunjia Hu and Preetum Nakkiran.

Full text available at https://arxiv.org/abs/2211.16886

Bio: Jarosław Błasiok is a Junior Fellow at Simons Society of Fellows, conducting postdoctoral research at the Theory of Computation group at Columbia University, under mentorship of Professor Alex Andoni. He finished his Ph.D. at the John A. Paulson School of Engineering and Applied Sciences at Harvard University, advised by Professor Jelani Nelson. He received his B.S. and M.Sc. in Computer Science from the University of Warsaw.

He has a broad research interest in Theoretical Computer Science and has worked in design and analysis of streaming algorithms, the theory of error-correcting codes, algorithms related to machine learning, differential privacy and compressed sensing. In his dissertation, he described his research in streaming algorithms and error correction, featuring applications of high dimensional probability in these two areas.

A Google Talk Series on Algorithms, Theory, and Optimization




Other Videos By Google TechTalks


2023-05-30Improving the Privacy Utility Tradeoff in Differentially Private Machine Learning with Public Data
2023-05-30Randomized Approach for Tight Privacy Accounting
2023-05-30Almost Tight Error Bounds on Differentially Private Continual Counting
2023-05-30EIFFeL: Ensuring Integrity for Federated Learning
2023-05-30Differentially Private Diffusion Models
2023-05-15Damian Grimling | Sentistocks | Sentimenti | web3 talks | March 9th 2023 | MC: Blake DeBenon
2023-04-21Branimir Rakic | CTO & Co-Founder of OriginTrail | web3 talks | Feb 27th 2023 | MC: Alex Ticamera
2023-04-15A Nearly Tight Analysis of Greedy k-means++
2023-04-15Introduction to Length-Constrained Expanders and Expander Decompositions
2023-04-07Improved Feature Importance Computation for Tree Models Based on the Banzhaf Value
2023-04-07A Unifying Theory of Distance to Calibration
2023-04-07Dynamic Graph Sketching: To Infinity And Beyond
2023-03-20Sergey Nazarov | Co-Founder Chainlink | web3 talks | Mar 16 2023 | MC: Marlon Ruiz
2023-03-09Evan Shapiro | CEO Mina Foundation | web3 talks | Feb 16th 2023 | MC: Marlon Ruiz
2023-03-07Zürich Go Meetup: Zero-effort Type-safe Parsing of JSON and XML
2023-03-07Zürich Go Meetup: Let’s Build a Game with Go
2023-03-07Zürich Go Meetup: Run Go programs on your Raspberry Pi with gokrazy!
2023-03-03Online Covering: Secretaries, Prophets and Universal Maps
2023-03-03Auto-bidding in Online Advertising: Campaign Management and Fairness
2023-03-03Tree Learning: Optimal Algorithms and Sample Complexity
2023-03-03A Fast Algorithm for Adaptive Private Mean Estimation