Data is as Data Does: The Influence of Computation on Inference

Published on ● Video Link: https://www.youtube.com/watch?v=ZVec7pw0bgw



Duration: 1:06:57
6,281 views
214


John Patrick Cunningham (Columbia University)
https://simons.berkeley.edu/talks/john-patrick-cunningham-columbia-university-2024-06-14
AI≡Science: Strengthening the Bond Between the Sciences and Artificial Intelligence

Probabilistic models remain a hugely popular class of techniques in modern machine learning, and their expressiveness has been extended by modern large-scale compute. While exciting, these generalizations almost always come with approximations, and researchers typically ignore the fundamental influence of computational approximations. Thus, results from modern probabilistic methods become as much about the approximation method as they are about the data and the model, undermining both the Bayesian principle and the practical utility of inference in probabilistic models for real applications in science and industry.

To expose this issue and to demonstrate how to do approximate inference correctly in at least one model class, in this talk I will derive a new type of Gaussian Process approximation that provides consistent estimation of the combined posterior arising from both the finite number of data observed *and* the finite amount of computation expended. The most common GP approximations map to an instance in this class, such as methods based on the Cholesky factorization, conjugate gradients, and inducing points. I will show the consequences of ignoring computational uncertainty, and prove that implicitly modeling it improves generalization performance. I will show how to do model selection while considering computation, and I will describe an application to neurobiological data.




Other Videos By Simons Institute for the Theory of Computing


2024-08-04Testing Intersectingness of Uniform Families or how Dana and I intersected
2024-08-04Recent Developments in Testing Bounded-Degree Graphs
2024-08-04Property Testing with Incomplete or Manipulated Inputs
2024-08-01Are there graphs whose shortest path structure requires large edge weights?
2024-08-01Graph Connectivity Using Star Contraction
2024-08-01Agnostic Proper Learning of Monotone Functions: Beyond the Black-Box Correction Barrier
2024-08-01Sparsifying Set Systems for Coverage Problems
2024-07-31Recent Progress on Euclidean Spanners
2024-07-31Space is a latent sequence: A theory of hippocampus and PFC
2024-06-21Robust Equation Discovery and Sparse Sensing with Guarantees
2024-06-21Data is as Data Does: The Influence of Computation on Inference
2024-06-21Some thoughts on ML-based protein engineering
2024-06-21Machine learning models of differential gene expression
2024-06-21Open challenges in AI for molecular design: representation, experimental alignment, and...
2024-06-21Illuminating protein space with generative models
2024-06-21Symmetries in Machine Learning for Materials Science
2024-06-21Large ML potentials for chemistry: generalization, inductive biases, and cancellation of errors
2024-06-21Deep learning and numerical methods intersections for improving molecular and fluid dynamics
2024-06-21ML gradients in Molecular Simulations
2024-06-21Generalizable sampling of conformational ensembles with latent space dynamics
2024-06-21The State of Protein Structure Prediction and Friends



Tags:
Simons Institute
theoretical computer science
UC Berkeley
Computer Science
Theory of Computation
Theory of Computing
AI≡Science: Strengthening the Bond Between the Sciences and Artificial Intelligence
John Patrick Cunningham