Faster Neural Network Training with Data Echoing (Paper Explained)

Channel:

Yannic Kilcher

Subscribers:

291,000

Published on May 13, 2020 1:56:56 PM ● Video Link: https://www.youtube.com/watch?v=bFn2xcGi1TQ

Duration: 39:18

6,958 views

194

CPUs are often bottlenecks in Machine Learning pipelines. Data fetching, loading, preprocessing and augmentation can be slow to a point where the GPUs are mostly idle. Data Echoing is a technique to re-use data that is already in the pipeline to reclaim this idle time and keep the GPUs busy at all times.

https://arxiv.org/abs/1907.05550

Abstract:
In the twilight of Moore's law, GPUs and other specialized hardware accelerators have dramatically sped up neural network training. However, earlier stages of the training pipeline, such as disk I/O and data preprocessing, do not run on accelerators. As accelerators continue to improve, these earlier stages will increasingly become the bottleneck. In this paper, we introduce "data echoing," which reduces the total computation used by earlier pipeline stages and speeds up training whenever computation upstream from accelerators dominates the training time. Data echoing reuses (or "echoes") intermediate outputs from earlier pipeline stages in order to reclaim idle capacity. We investigate the behavior of different data echoing algorithms on various workloads, for various amounts of echoing, and for various batch sizes. We find that in all settings, at least one data echoing algorithm can match the baseline's predictive performance using less upstream computation. We measured a factor of 3.25 decrease in wall-clock time for ResNet-50 on ImageNet when reading training data over a network.

Authors: Dami Choi, Alexandre Passos, Christopher J. Shallue, George E. Dahl

Links:
YouTube: https://www.youtube.com/c/yannickilcher
Twitter: https://twitter.com/ykilcher
BitChute: https://www.bitchute.com/channel/yannic-kilcher
Minds: https://www.minds.com/ykilcher

Other Videos By Yannic Kilcher

2020-05-23	[News] The NeurIPS Broader Impact Statement
2020-05-22	When BERT Plays the Lottery, All Tickets Are Winning (Paper Explained)
2020-05-21	[News] OpenAI Model Generates Python Code
2020-05-20	Investigating Human Priors for Playing Video Games (Paper & Demo)
2020-05-19	iMAML: Meta-Learning with Implicit Gradients (Paper Explained)
2020-05-18	[Code] PyTorch sentiment classifier from scratch with Huggingface NLP Library (Full Tutorial)
2020-05-17	Planning to Explore via Self-Supervised World Models (Paper Explained)
2020-05-16	[News] Facebook's Real-Time TTS system runs on CPUs only!
2020-05-15	Weight Standardization (Paper Explained)
2020-05-14	[Trash] Automated Inference on Criminality using Face Images
2020-05-13	Faster Neural Network Training with Data Echoing (Paper Explained)
2020-05-12	Group Normalization (Paper Explained)
2020-05-11	Concept Learning with Energy-Based Models (Paper Explained)
2020-05-10	[News] Google’s medical AI was super accurate in a lab. Real life was a different story.
2020-05-09	Big Transfer (BiT): General Visual Representation Learning (Paper Explained)
2020-05-08	Divide-and-Conquer Monte Carlo Tree Search For Goal-Directed Planning (Paper Explained)
2020-05-07	WHO ARE YOU? 10k Subscribers Special (w/ Channel Analytics)
2020-05-06	Reinforcement Learning with Augmented Data (Paper Explained)
2020-05-05	TAPAS: Weakly Supervised Table Parsing via Pre-training (Paper Explained)
2020-05-04	Chip Placement with Deep Reinforcement Learning (Paper Explained)
2020-05-03	I talk to the new Facebook Blender Chatbot

Tags:

deep learning

machine learning

arxiv

explained

neural networks

artificial intelligence

paper

google

brain

pipeline

bottleneck

speed

gpu

tpu

idle

network

distributed

preprocessing

augmentation

Channel	Latest
Skyprince777	8 hours ago
Tsubasa Yozora Ch.	8 hours ago
USIX Pro Gaming	8 hours ago
alanzoka	14 hours ago
AnimeToons	14 hours ago
Flik's Gaming Stuff	15 hours ago
Beyond the Brick	16 hours ago
Nintendo Life	19 hours ago
IntroGameOver	19 hours ago
Badaw Gaming	20 hours ago
lugeyps3	20 hours ago
CarbotAnimations	21 hours ago
Pixelorez	21 hours ago
Primal Koopa Pictures	21 hours ago
BeastBoyShub	21 hours ago
816	21 hours ago
Chroma	22 hours ago
Unnie Cj	22 hours ago
Brecy	23 hours ago
Renzuwu	23 hours ago
Fal Oval	23 hours ago
fadd game	23 hours ago
Aezwozere	23 hours ago
눈사람	23 hours ago
Fragilistic	23 hours ago