Efficient and Scalable Deep Learning

Channel:

Subscribers:

344,000

Published on November 4, 2019 11:26:22 AM ● Video Link: https://www.youtube.com/watch?v=AyEFcNKgQAo

Duration: 1:10:03

3,709 views

In deep learning, researchers keep gaining higher performance by using larger models. However, there are two obstacles blocking the community to build larger models: (1) training larger models is more time-consuming, which slows down model design exploration, and (2) inference of larger models is also slow, which disables their deployment to computation constrained applications. In this talk, I will introduce some of our efforts to remove those obstacles. On the training side, we propose TernGrad to reduce communication bottleneck to scale up distributed deep learning; on the inference side, we propose structurally sparse neural networks to remove redundant neural components for faster inference. At the end, I will very briefly introduce (1) my recent efforts to accelerate AutoML, and (2) future work to utilize my research to overcome scaling issues in Natural Language Processing.

Talk slides: https://www.microsoft.com/en-us/research/uploads/prod/2019/11/Efficient-and-Scalable-Deep-Learning-SLIDES.pdf

See more on this talk at Microsoft Research: https://www.microsoft.com/en-us/research/video/efficient-and-scalable-deep-learning/

Other Videos By Microsoft Research

2019-11-20	Blind Multi-Microphone Noise Reduction and Dereverberation Algorithms
2019-11-20	High Throughput Computing in the Service of Scientific Discovery
2019-11-20	Towards Grounded Spatio-Temporal Reasoning
2019-11-20	Program synthesis and the art of programming by intent with Dr. Sumit Gulwani [Podcast]
2019-11-13	Hacking the runway with MakeCode with Dr. Thomas Ball and Dr. Teddy Seyed
2019-11-11	A Machine Learning Perspective on Managing Noisy Structured Data
2019-11-06	Optics for the cloud: storage in the zettabyte era with Dr. Ant Rowstron and Mark Russinovich
2019-11-05	Fireside Chat with Stefanie Jegelka
2019-11-05	Project Silica - Storing Data in Glass
2019-11-04	Visually Grounded Language Understanding and Generation
2019-11-04	Efficient and Scalable Deep Learning
2019-11-04	Interpretability in NLP: Moving Beyond Vision
2019-11-01	HAMS Automated License Testing Process in Dehradun
2019-10-30	Art + Architecture + AI = Ada with Jenny Sabin and Asta Roseway [Podcast]
2019-10-29	HAMS: Smartphone-based Driver License Testing Automation
2019-10-29	Structured light: seeing less to see more in optical microscopy
2019-10-25	SpaceInk: Making Space for In-Context Annotations
2019-10-25	Working at Microsoft Research Cambridge
2019-10-25	Our intern experience at Microsoft Research Cambridge
2019-10-24	Microsoft PhD Summit 2019: Scott Saponas [Short Talk]
2019-10-24	Microsoft PhD Summit 2019: Nikunj Raghuvanshi [Short Talk]

Tags:

efficient deep learning

scalable deep learning

large models

TernGrad

distributed deep learning

training large deep learning models

inference of large deep learning models

neural networks

AutoML

Natural Language Processing

NLP

Wei Wen

Microsoft Research

Channel	Latest
Riz Goodies TV	6 hours ago
wandis channel	6 hours ago
Jean 360°	6 hours ago
Sodapoppin Playthroughs	6 hours ago
CXI_NARAWI	7 hours ago
Iqbal Jabiren	7 hours ago
Aditya Aslami	7 hours ago
MISS MIARI	7 hours ago
REKSA	7 hours ago
NDD TV	7 hours ago
VUONG	7 hours ago
SahiDDROid.	7 hours ago
Borutokun Indonesia	7 hours ago
JOVCARS	8 hours ago
SfishYt	8 hours ago
OMMY TV	8 hours ago
TASVideosChannel	8 hours ago
ARASTi	8 hours ago
ZipTheWorld	8 hours ago
Hakumi Ishiki Ch.	8 hours ago
Aa AHEN	8 hours ago
Jhing Albino	8 hours ago
Mualaf Channel YT	8 hours ago
Ken RAW0880	8 hours ago
VieKiper	8 hours ago