Algorithms and Hardness for Attention and Kernel Density Estimation

Channel:

Google TechTalks

Subscribers:

349,000

Published on May 20, 2024 6:50:11 PM ● Video Link: https://www.youtube.com/watch?v=6Dsf1E6ZGP8

Duration: 1:00:37

532 views

A Google TechTalk, presented by Josh Alman , 2024-05-16
Google Algorithms Seminar. ABSTRACT: This talk will focus on two very related computational problems. The first is Attention, the task at the core of Transformer and other Large Language Models. The second is Kernel Density Estimation, a statistical task which has seen applications from machine learning to computational physics.

Both of these problems have straightforward quadratic-time algorithms, and I'll discuss a recent line of work investigating when faster algorithms are possible. I'll survey two algorithmic techniques which lead to almost linear-time algorithms in different parameter regimes: the Fast Multipole Method, and the polynomial method. I'll then overview our new fine-grained hardness results, which prove that these techniques are essentially optimal in the parameter regimes where they apply, and highlight the situations where improvements may be possible.

Based on joint work with Amol Aggarwal (in CCC 2022), Zhao Song (in NeurIPS 2023), and Yunfeng Guan (to appear in CCC 2024).

About the Speaker: Josh Alman is an Assistant Professor of Computer Science at Columbia University. He works on algorithm design and complexity theory, focusing especially on algebraic tools for speeding up algorithms.

Other Videos By Google TechTalks

2024-08-15	How I Wrote 10K Lines of Go in a Weekend
2024-08-15	Supply Chain Security with Go
2024-07-30	A Multi Dimensional Online Contention Resolution Scheme
2024-07-09	Robust Distortion-free Watermarks for Language Models
2024-07-02	Is it possible to make self-adjusting data structures concurrent?
2024-06-21	Privacy Preserving ML with Fully Homomorphic Encryption
2024-06-21	The Chinese Computer: A Global History of the Information Age
2024-06-14	KAN: Kolmogorov-Arnold Networks
2024-05-27	Learning through Transient Matching in Congested Markets
2024-05-23	What Makes Software Work?
2024-05-20	Algorithms and Hardness for Attention and Kernel Density Estimation
2024-05-20	A Unified Analysis of Label Inference Attacks
2024-05-20	Copyright Regenerated: Harnessing GenAI to Measure Originality and Copyright Scope
2024-05-20	The Data Minimization Principle in Machine Learning
2024-05-20	Challenges in Augmenting Large Language Models with Private Data
2024-05-20	Oblivious RAM: From Theory to Large-scale Real-world Deployment
2024-05-20	Low Cost High Power Membership Inference Attacks
2024-05-20	Can LLMs Keep a Secret? Testing Privacy Implications of Language Models
2024-04-22	Design is Testability
2024-04-12	Charles Hoskinson \| CEO of Input Output Global \| web3 talks \| Apr 4th 2024 \| MC: Marlon Ruiz
2024-04-08	Limitations of Stochastic Selection with Pairwise Independent Priors

Channel	Latest
Adam Philis	6 hours ago
Extramanía	7 hours ago
Kush Mcblunt	7 hours ago
Dissidia Forums	7 hours ago
GP Gameplay	7 hours ago
Tune, Race, Repeat!	7 hours ago
隔壁玩家 / Gamer Next Door	7 hours ago
Engineering Plastics	7 hours ago
The Wizard Of Lore	7 hours ago
Picaresyn	8 hours ago
Jax R.	8 hours ago
Musou Gaming channel / 無双ゲームチャンネル	8 hours ago
Datura Plays	8 hours ago
domisumReplay: Rek'Sai	8 hours ago
Discordia Tangente	8 hours ago
TYT Sports	8 hours ago
Evil Fighter	8 hours ago
Krad Mosh	8 hours ago
domisumReplay: Karthus	8 hours ago
Cartoon Crab \| Minecraft	8 hours ago
L0ganJG	8 hours ago
Bibelot	8 hours ago
SinCityBartender	8 hours ago
MBG ARMY	8 hours ago
Vamos Show	9 hours ago