Chip Placement with Deep Reinforcement Learning (Paper Explained)

Channel:

Yannic Kilcher

Subscribers:

291,000

Published on May 4, 2020 1:29:02 PM ● Video Link: https://www.youtube.com/watch?v=PDRtyrVskMU

Duration: 27:27

10,213 views

339

The AI Singularity is here! Computers designing new computers! It takes human experts multiple weeks to design new computer chips. What looks like a large game of Tetris is actually a very complex optimization problem. This paper uses Deep Reinforcement Learning to solve this optimization both faster and better than humans.

https://arxiv.org/abs/2004.10746

Abstract:
In this work, we present a learning-based approach to chip placement, one of the most complex and time-consuming stages of the chip design process. Unlike prior methods, our approach has the ability to learn from past experience and improve over time. In particular, as we train over a greater number of chip blocks, our method becomes better at rapidly generating optimized placements for previously unseen chip blocks. To achieve these results, we pose placement as a Reinforcement Learning (RL) problem and train an agent to place the nodes of a chip netlist onto a chip canvas. To enable our RL policy to generalize to unseen blocks, we ground representation learning in the supervised task of predicting placement quality. By designing a neural architecture that can accurately predict reward across a wide variety of netlists and their placements, we are able to generate rich feature embeddings of the input netlists. We then use this architecture as the encoder of our policy and value networks to enable transfer learning. Our objective is to minimize PPA (power, performance, and area), and we show that, in under 6 hours, our method can generate placements that are superhuman or comparable on modern accelerator netlists, whereas existing baselines require human experts in the loop and take several weeks.

Authors: Azalia Mirhoseini, Anna Goldie, Mustafa Yazgan, Joe Jiang, Ebrahim Songhori, Shen Wang, Young-Joon Lee, Eric Johnson, Omkar Pathak, Sungmin Bae, Azade Nazi, Jiwoo Pak, Andy Tong, Kavya Srinivasa, William Hang, Emre Tuncer, Anand Babu, Quoc V. Le, James Laudon, Richard Ho, Roger Carpenter, Jeff Dean

Links:
YouTube: https://www.youtube.com/c/yannickilcher
Twitter: https://twitter.com/ykilcher
BitChute: https://www.bitchute.com/channel/yannic-kilcher
Minds: https://www.minds.com/ykilcher

Other Videos By Yannic Kilcher

2020-05-14	[Trash] Automated Inference on Criminality using Face Images
2020-05-13	Faster Neural Network Training with Data Echoing (Paper Explained)
2020-05-12	Group Normalization (Paper Explained)
2020-05-11	Concept Learning with Energy-Based Models (Paper Explained)
2020-05-10	[News] Google’s medical AI was super accurate in a lab. Real life was a different story.
2020-05-09	Big Transfer (BiT): General Visual Representation Learning (Paper Explained)
2020-05-08	Divide-and-Conquer Monte Carlo Tree Search For Goal-Directed Planning (Paper Explained)
2020-05-07	WHO ARE YOU? 10k Subscribers Special (w/ Channel Analytics)
2020-05-06	Reinforcement Learning with Augmented Data (Paper Explained)
2020-05-05	TAPAS: Weakly Supervised Table Parsing via Pre-training (Paper Explained)
2020-05-04	Chip Placement with Deep Reinforcement Learning (Paper Explained)
2020-05-03	I talk to the new Facebook Blender Chatbot
2020-05-02	Jukebox: A Generative Model for Music (Paper Explained)
2020-05-01	[ML Coding Tips] Separate Computation & Plotting using locals
2020-04-30	The AI Economist: Improving Equality and Productivity with AI-Driven Tax Policies (Paper Explained)
2020-04-29	Deconstructing Lottery Tickets: Zeros, Signs, and the Supermask (Paper Explained)
2020-04-28	[Rant] Online Conferences
2020-04-27	Do ImageNet Classifiers Generalize to ImageNet? (Paper Explained)
2020-04-26	[Drama] Schmidhuber: Critique of Honda Prize for Dr. Hinton
2020-04-25	How much memory does Longformer use?
2020-04-24	Supervised Contrastive Learning

Tags:

deep learning

machine learning

arxiv

explained

neural networks

artificial intelligence

paper

reinforcement learning

deep reinforcement learning

gans

gan

deconvolution

computer chip

gpu

tpu

fpga

netlist

constrained

google

Channel	Latest
Skyprince777	13 hours ago
Tsubasa Yozora Ch.	13 hours ago
USIX Pro Gaming	14 hours ago
Arcade City	19 hours ago
alanzoka	20 hours ago
AnimeToons	20 hours ago
Flik's Gaming Stuff	21 hours ago
The Mexican Runner	22 hours ago
Beyond the Brick	22 hours ago
Spuffi	23 hours ago
442oons	1 day ago
Nintendo Life	1 day ago
Tamae	1 day ago
IntroGameOver	1 day ago
Dowell	1 day ago
Badaw Gaming	1 day ago
lugeyps3	1 day ago
CarbotAnimations	1 day ago
Pixelorez	1 day ago
Primal Koopa Pictures	1 day ago
BeastBoyShub	1 day ago
816	1 day ago
AoDzTo - อ๊อดโตะ	1 day ago
Chroma	1 day ago
Unnie Cj	1 day ago