This is a game changer! (AlphaTensor by DeepMind explained)

Channel:

Yannic Kilcher

Subscribers:

298,000

Published on October 7, 2022 10:45:51 PM ● Video Link: https://www.youtube.com/watch?v=3N3Bl5AA5QU

Category:

Let's Play

Duration: 55:07

175,657 views

4,847

#alphatensor #deepmind #ai

Matrix multiplication is the most used mathematical operation in all of science and engineering. Speeding this up has massive consequences. Thus, over the years, this operation has become more and more optimized. A fascinating discovery was made when it was shown that one actually needs less than N^3 multiplication operations to multiply to NxN matrices. DeepMind goes a step further and creates AlphaTensor, a Deep Reinforcement Learning algorithm that plays a single-player game, TensorGame, in order to find even more optimized algorithms for matrix multiplication. And it turns out, there exists a plethora of undiscovered matrix multiplication algorithms, which not only will make everything from computers to smart toasters faster, but also bring new insights into fundamental math and complexity theory.

Sponsor: Assembly AI
Link: https://www.assemblyai.com/?utm_source=youtube&utm_medium=social&utm_campaign=yannic_sentiment

OUTLINE:
0:00 - Intro
1:50 - Sponsor: Assembly AI (link in description)
3:25 - What even is Matrix Multiplication?
6:10 - A very astounding fact
8:45 - Trading multiplications for additions
12:35 - Matrix Multiplication as a Tensor
17:30 - Tensor Decompositions
20:30 - A formal way of finding multiplication algorithms
31:00 - How to formulate this as a game?
39:30 - A brief primer on AlphaZero / MCTS
45:40 - The Results
48:15 - Optimizing for different hardware
52:40 - Expanding fundamental math
53:45 - Summary & Final Comments

Paper: https://www.nature.com/articles/s41586-022-05172-4
Title: Discovering faster matrix multiplication algorithms with reinforcement learning

Abstract:
Improving the efficiency of algorithms for fundamental computations can have a widespread impact, as it can affect the overall speed of a large amount of computations. Matrix multiplication is one such primitive task, occurring in many systems—from neural networks to scientific computing routines. The automatic discovery of algorithms using machine learning offers the prospect of reaching beyond human intuition and outperforming the current best human-designed algorithms. However, automating the algorithm discovery procedure is intricate, as the space of possible algorithms is enormous. Here we report a deep reinforcement learning approach based on AlphaZero1 for discovering efficient and provably correct algorithms for the multiplication of arbitrary matrices. Our agent, AlphaTensor, is trained to play a single-player game where the objective is finding tensor decompositions within a finite factor space. AlphaTensor discovered algorithms that outperform the state-of-the-art complexity for many matrix sizes. Particularly relevant is the case of 4 × 4 matrices in a finite field, where AlphaTensor’s algorithm improves on Strassen’s two-level algorithm for the first time, to our knowledge, since its discovery 50 years ago2. We further showcase the flexibility of AlphaTensor through different use-cases: algorithms with state-of-the-art complexity for structured matrix multiplication and improved practical efficiency by optimizing matrix multiplication for runtime on specific hardware. Our results highlight AlphaTensor’s ability to accelerate the process of algorithmic discovery on a range of problems, and to optimize for different criteria.

Authors: Alhussein Fawzi, Matej Balog, Aja Huang, Thomas Hubert, Bernardino Romera-Paredes, Mohammadamin Barekatain, Alexander Novikov, Francisco J. R. Ruiz, Julian Schrittwieser, Grzegorz Swirszcz, David Silver, Demis Hassabis & Pushmeet Kohli

Links:
Homepage: https://ykilcher.com
Merch: https://ykilcher.com/merch
YouTube: https://www.youtube.com/c/yannickilcher
Twitter: https://twitter.com/ykilcher
Discord: https://ykilcher.com/discord
LinkedIn: https://www.linkedin.com/in/ykilcher

If you want to support me, the best thing to do is to share out the content :)

If you want to support me financially (completely optional and voluntary, but a lot of people have asked for this):
SubscribeStar: https://www.subscribestar.com/yannickilcher
Patreon: https://www.patreon.com/yannickilcher
Bitcoin (BTC): bc1q49lsw3q325tr58ygf8sudx2dqfguclvngvy2cq
Ethereum (ETH): 0x7ad3513E3B8f66799f507Aa7874b1B0eBC7F85e2
Litecoin (LTC): LQW2TRyKYetVC8WjFkhpPhtpbDM4Vw7r9m
Monero (XMR): 4ACL8AGrEo5hAir8A9CeVrW8pEauWvnp1WnSDZxW7tziCDLhZAGsgzhRQABDnFy8yuM9fWJDviJPHKRjV4FWt19CJZN9D4n

Other Videos By Yannic Kilcher

2022-12-26	Open Assistant Live Coding (Open-Source ChatGPT Replication)
2022-12-07	ChatGPT: This AI has a JAILBREAK?! (Unbelievable AI Progress)
2022-11-27	[ML News] GPT-4 Rumors \| AI Mind Reading \| Neuron Interaction Solved \| AI Theorem Proving
2022-11-25	CICERO: An AI agent that negotiates, persuades, and cooperates with people
2022-11-19	Galactica: A Large Language Model for Science (Drama & Paper Review)
2022-11-13	[ML News] Multiplayer Stable Diffusion \| OpenAI needs more funding \| Text-to-Video models incoming
2022-11-09	The New AI Model Licenses have a Legal Loophole (OpenRAIL-M of BLOOM, Stable Diffusion, etc.)
2022-11-04	ROME: Locating and Editing Factual Associations in GPT (Paper Explained & Author Interview)
2022-11-01	Is Stability turning into OpenAI?
2022-10-21	Neural Networks are Decision Trees (w/ Alexander Mattick)
2022-10-07	This is a game changer! (AlphaTensor by DeepMind explained)
2022-10-02	[ML News] OpenAI's Whisper \| Meta Reads Brain Waves \| AI Wins Art Fair, Annoys Humans
2022-09-18	[ML News] Stable Diffusion Takes Over! (Open Source AI Art)
2022-09-17	How to make your CPU as fast as a GPU - Advances in Sparsity w/ Nir Shavit
2022-09-13	More Is Different for AI - Scaling Up, Emergence, and Paperclip Maximizers (w/ Jacob Steinhardt)
2022-09-02	The hidden dangers of loading open-source AI models (ARBITRARY CODE EXPLOIT!)
2022-08-26	The Future of AI is Self-Organizing and Self-Assembling (w/ Prof. Sebastian Risi)
2022-08-13	The Man behind Stable Diffusion
2022-08-10	[ML News] AI models that write code (Copilot, CodeWhisperer, Pangu-Coder, etc.)
2022-08-07	[ML News] Text-to-Image models are taking over! (Imagen, DALL-E 2, Midjourney, CogView 2 & more)
2022-07-31	[ML News] This AI completes Wikipedia! Meta AI Sphere \| Google Minerva \| GPT-3 writes a paper

Tags:

deep learning

machine learning

arxiv

explained

neural networks

artificial intelligence

paper

deepmind

deep mind

deepmind alphatensor

alpha tensor

deepmind math

google deep mind

google deepmind

matrix multiplication

ai matrix multiplication

matrix multiplication reinforcement learning

alphazero

alpha zero

alphazero math

deep learning tutorial

introduction to deep learning

what is deep learning

alphatensor explained

alpha tensor explained

Channel	Latest
またいち(うし) ~ ソロで遊ぶひと	6 hours ago
Shotgun Chanel	6 hours ago
오늘의 코인뉴스	6 hours ago
jimi fails	6 hours ago
PC Garage	6 hours ago
Forestechoes Ai Music	6 hours ago
strohkoenig	6 hours ago
ZMEY	6 hours ago
Axelrod	6 hours ago
Charles	6 hours ago
Northlight TV	6 hours ago
CarCrashStoriesBeamNG	6 hours ago
Gram.pl	6 hours ago
XanderSweetman	6 hours ago
The Roman Catholic Diocese of Phoenix	6 hours ago
tryclo	7 hours ago
Ptitseb	7 hours ago
Milaya Mila	7 hours ago
TheGamesEntertainer	7 hours ago
Android Let's Play Official	7 hours ago
Gazeta Sporturilor	7 hours ago
Alexi Michaud	7 hours ago
Grzechu 40	7 hours ago
AnaitGames	7 hours ago
けんき	7 hours ago