This is a game changer! (AlphaTensor by DeepMind explained)

Subscribers:
284,000
Published on ● Video Link: https://www.youtube.com/watch?v=3N3Bl5AA5QU



Category:
Let's Play
Duration: 55:07
175,657 views
4,847


#alphatensor #deepmind #ai

Matrix multiplication is the most used mathematical operation in all of science and engineering. Speeding this up has massive consequences. Thus, over the years, this operation has become more and more optimized. A fascinating discovery was made when it was shown that one actually needs less than N^3 multiplication operations to multiply to NxN matrices. DeepMind goes a step further and creates AlphaTensor, a Deep Reinforcement Learning algorithm that plays a single-player game, TensorGame, in order to find even more optimized algorithms for matrix multiplication. And it turns out, there exists a plethora of undiscovered matrix multiplication algorithms, which not only will make everything from computers to smart toasters faster, but also bring new insights into fundamental math and complexity theory.

Sponsor: Assembly AI
Link: https://www.assemblyai.com/?utm_source=youtube&utm_medium=social&utm_campaign=yannic_sentiment

OUTLINE:
0:00 - Intro
1:50 - Sponsor: Assembly AI (link in description)
3:25 - What even is Matrix Multiplication?
6:10 - A very astounding fact
8:45 - Trading multiplications for additions
12:35 - Matrix Multiplication as a Tensor
17:30 - Tensor Decompositions
20:30 - A formal way of finding multiplication algorithms
31:00 - How to formulate this as a game?
39:30 - A brief primer on AlphaZero / MCTS
45:40 - The Results
48:15 - Optimizing for different hardware
52:40 - Expanding fundamental math
53:45 - Summary & Final Comments

Paper: https://www.nature.com/articles/s41586-022-05172-4
Title: Discovering faster matrix multiplication algorithms with reinforcement learning

Abstract:
Improving the efficiency of algorithms for fundamental computations can have a widespread impact, as it can affect the overall speed of a large amount of computations. Matrix multiplication is one such primitive task, occurring in many systems—from neural networks to scientific computing routines. The automatic discovery of algorithms using machine learning offers the prospect of reaching beyond human intuition and outperforming the current best human-designed algorithms. However, automating the algorithm discovery procedure is intricate, as the space of possible algorithms is enormous. Here we report a deep reinforcement learning approach based on AlphaZero1 for discovering efficient and provably correct algorithms for the multiplication of arbitrary matrices. Our agent, AlphaTensor, is trained to play a single-player game where the objective is finding tensor decompositions within a finite factor space. AlphaTensor discovered algorithms that outperform the state-of-the-art complexity for many matrix sizes. Particularly relevant is the case of 4 × 4 matrices in a finite field, where AlphaTensor’s algorithm improves on Strassen’s two-level algorithm for the first time, to our knowledge, since its discovery 50 years ago2. We further showcase the flexibility of AlphaTensor through different use-cases: algorithms with state-of-the-art complexity for structured matrix multiplication and improved practical efficiency by optimizing matrix multiplication for runtime on specific hardware. Our results highlight AlphaTensor’s ability to accelerate the process of algorithmic discovery on a range of problems, and to optimize for different criteria.

Authors: Alhussein Fawzi, Matej Balog, Aja Huang, Thomas Hubert, Bernardino Romera-Paredes, Mohammadamin Barekatain, Alexander Novikov, Francisco J. R. Ruiz, Julian Schrittwieser, Grzegorz Swirszcz, David Silver, Demis Hassabis & Pushmeet Kohli

Links:
Homepage: https://ykilcher.com
Merch: https://ykilcher.com/merch
YouTube: https://www.youtube.com/c/yannickilcher
Twitter: https://twitter.com/ykilcher
Discord: https://ykilcher.com/discord
LinkedIn: https://www.linkedin.com/in/ykilcher

If you want to support me, the best thing to do is to share out the content :)

If you want to support me financially (completely optional and voluntary, but a lot of people have asked for this):
SubscribeStar: https://www.subscribestar.com/yannickilcher
Patreon: https://www.patreon.com/yannickilcher
Bitcoin (BTC): bc1q49lsw3q325tr58ygf8sudx2dqfguclvngvy2cq
Ethereum (ETH): 0x7ad3513E3B8f66799f507Aa7874b1B0eBC7F85e2
Litecoin (LTC): LQW2TRyKYetVC8WjFkhpPhtpbDM4Vw7r9m
Monero (XMR): 4ACL8AGrEo5hAir8A9CeVrW8pEauWvnp1WnSDZxW7tziCDLhZAGsgzhRQABDnFy8yuM9fWJDviJPHKRjV4FWt19CJZN9D4n




Other Videos By Yannic Kilcher


2022-12-26Open Assistant Live Coding (Open-Source ChatGPT Replication)
2022-12-07ChatGPT: This AI has a JAILBREAK?! (Unbelievable AI Progress)
2022-11-27[ML News] GPT-4 Rumors | AI Mind Reading | Neuron Interaction Solved | AI Theorem Proving
2022-11-25CICERO: An AI agent that negotiates, persuades, and cooperates with people
2022-11-19Galactica: A Large Language Model for Science (Drama & Paper Review)
2022-11-13[ML News] Multiplayer Stable Diffusion | OpenAI needs more funding | Text-to-Video models incoming
2022-11-09The New AI Model Licenses have a Legal Loophole (OpenRAIL-M of BLOOM, Stable Diffusion, etc.)
2022-11-04ROME: Locating and Editing Factual Associations in GPT (Paper Explained & Author Interview)
2022-11-01Is Stability turning into OpenAI?
2022-10-21Neural Networks are Decision Trees (w/ Alexander Mattick)
2022-10-07This is a game changer! (AlphaTensor by DeepMind explained)
2022-10-02[ML News] OpenAI's Whisper | Meta Reads Brain Waves | AI Wins Art Fair, Annoys Humans
2022-09-18[ML News] Stable Diffusion Takes Over! (Open Source AI Art)
2022-09-17How to make your CPU as fast as a GPU - Advances in Sparsity w/ Nir Shavit
2022-09-13More Is Different for AI - Scaling Up, Emergence, and Paperclip Maximizers (w/ Jacob Steinhardt)
2022-09-02The hidden dangers of loading open-source AI models (ARBITRARY CODE EXPLOIT!)
2022-08-26The Future of AI is Self-Organizing and Self-Assembling (w/ Prof. Sebastian Risi)
2022-08-13The Man behind Stable Diffusion
2022-08-10[ML News] AI models that write code (Copilot, CodeWhisperer, Pangu-Coder, etc.)
2022-08-07[ML News] Text-to-Image models are taking over! (Imagen, DALL-E 2, Midjourney, CogView 2 & more)
2022-07-31[ML News] This AI completes Wikipedia! Meta AI Sphere | Google Minerva | GPT-3 writes a paper



Tags:
deep learning
machine learning
arxiv
explained
neural networks
ai
artificial intelligence
paper
deepmind
deep mind
deepmind alphatensor
alpha tensor
deepmind math
google deep mind
google deepmind
matrix multiplication
ai matrix multiplication
matrix multiplication reinforcement learning
alphazero
alpha zero
alphazero math
deep learning tutorial
introduction to deep learning
what is deep learning
alphatensor explained
alpha tensor explained