Predicting and optimizing the behavior of large ML models

Channel:

Simons Institute for the Theory of Computing

Subscribers:

69,500

Published on April 7, 2025 5:34:56 PM ● Video Link: https://www.youtube.com/watch?v=iePMkTFuEW8

Duration: 0:00

538 views

Andrew Ilyas (Stanford University)
https://simons.berkeley.edu/talks/andrew-ilyas-stanford-university-2025-04-03
The Future of Language Models and Transformers

In this talk, we study the problem of predicting (and optimizing) the counterfactual behavior of large-scale ML models. We start by focusing on “data counterfactuals,” where the goal is to estimate the effect of modifying a training dataset on the resulting machine learning outputs (and conversely, to design datasets that induce specific desired behavior). We introduce a method that almost perfectly estimates such counterfactuals, unlocking some new possibilities in the design and evaluation of ML models, including state-of-the-art data attribution, selection, and poisoning.

Other Videos By Simons Institute for the Theory of Computing

2025-04-16	Talk by Yannick Forster (INRIA)
2025-04-16	Testing Artificial Mathematical Intelligence
2025-04-16	Adventures with an Automatic Prover
2025-04-16	Formal Reasoning Meets LLMs: Toward AI for Mathematics and Verification
2025-04-16	How can Machine Learning Help Mathematicians?
2025-04-16	Computer-Assisted Intuition: SAT Solvers in Mathematical Discovery
2025-04-07	The Move Toward AGI: Why Large Language Models Surprised Almost Everyone... \| Theoretically Speaking
2025-04-07	Transformers can learn compositional function
2025-04-07	Advancing Diffusion Models for Text Generation
2025-04-07	Inference Scaling: A New Frontier for AI Capability
2025-04-07	Predicting and optimizing the behavior of large ML models
2025-04-07	Mixed-modal Language Modeling: Chameleon, Transfusion, and Mixture of Transformers
2025-04-07	Reducing the Dimension of Language: A Spectral Perspective on Transformers
2025-04-07	LLM skills and meta-cognition: scaffolding for new forms of learning?
2025-04-07	The Key Ingredients of Optimizing Test-Time Compute and What's Still Missing
2025-04-07	Controllable and Creative Natural Language Generation
2025-04-04	Field-based decoders revisited (partial)
2025-03-24	Topological quantum spin glass order and its realization in qLDPC codes (partial)
2025-03-19	How to Construct Random Unitaries \| Quantum Colloquium
2025-02-28	Neuroscience and AI: a symbiosis
2025-02-18	Panel Discussion

Channel	Latest
YouTube Trending India • 29M views • 5 hours ago	6 hours ago
Бутербродница	6 hours ago
Ilham Patria Syah	6 hours ago
Zz Man	6 hours ago
ItaliaTopGames	6 hours ago
MDZ jimmY	6 hours ago
FuzionByte	6 hours ago
Loleventvods - LoL Esports: VODs & Montages	6 hours ago
MarocSmile	6 hours ago
ポン酢パスタ-PonzuGames-	6 hours ago
NextGen Gaming	6 hours ago
lunnla	6 hours ago
G Wave-O	6 hours ago
WakezFlakez	6 hours ago
神宮寺ちゃんねるJingujiGames	6 hours ago
JustSpawn	6 hours ago
Coffee Owl Play	6 hours ago
Gvard UA Auto Chess	6 hours ago
ルミエール	6 hours ago
BRM	6 hours ago
YT 애플	6 hours ago
United Kingdom Speedrunner Gathering	6 hours ago
VargaTV	6 hours ago
Garena Free Fire Global	6 hours ago
Pak GM	6 hours ago