From SqueezeNet to SqueezeBERT: Developing Efficient Deep Neural Networks

Subscribers:
344,000
Published on ● Video Link: https://www.youtube.com/watch?v=kPMaEYSywdI



Duration: 1:08:34
4,231 views
155


Deep neural networks have been trained to interpret images and text at increasingly high levels of accuracy. In many cases, these accuracy improvements are the result of developing increasingly large and computationally-intensive neural network models. These models tend to incur high latency during inference, especially when deployed on smartphones and edge-devices. In this talk, we present two lines of work that focus on mitigating the high cost of neural network inference on edge-devices. First, we review the last four years of progress in the computer vision (CV) community towards developing efficient neural networks for edge-devices, ranging from early work such as SqueezeNet, to recent work leveraging neural architecture search. Second, we present SqueezeBERT, a mobile-optimized neural network design for natural language processing (NLP) that draws on ideas from efficient CV network design. SqueezeBERT achieves a 4.3x speedup over BERT-base on a Pixel 3 smartphone. Finally, we believe that SqueezeBERT is just the beginning of several years of fruitful research in the NLP community to develop efficient neural architectures.

See more at https://www.microsoft.com/en-us/research/video/from-squeezenet-to-squeezebert-developing-efficient-deep-neural-networks/




Other Videos By Microsoft Research


2020-09-03Managing Animation Galleries in Microsoft Expressive Pixels (5 of 9)
2020-09-03Creating Animations in Microsoft Expressive Pixels (4 of 9)
2020-09-03Creating an Image in Microsoft Expressive Pixels (3 of 9)
2020-09-03Getting Started with Microsoft Expressive Pixels (2 of 9)
2020-09-03Welcome to Microsoft Expressive Pixels (1 of 9)
2020-09-03Importing Animations in Microsoft Expressive Pixels (9 of 9)
2020-09-03Galactic Bell Star Music Demo
2020-09-03Directional Sources & Listeners in Interactive Sound Propagation using Reciprocal Wave Field Coding
2020-09-01Platform for Situated Intelligence Overview
2020-08-28Programming with Proofs for High-assurance Software
2020-08-14From SqueezeNet to SqueezeBERT: Developing Efficient Deep Neural Networks
2020-08-12SkinnerDB: Regret Bounded Query Evaluation using RL
2020-08-07From Paper to Product
2020-08-06Can we make better software by using ML and AI techniques? With Chandra Maddila and Chetan Bansal
2020-08-05MineRL Competition 2020
2020-08-04Directions in ML: Algorithmic foundations of neural architecture search
2020-08-03Microsoft Urban Futures Summer Workshop | Policy and Social Impact [Day 3]
2020-08-03Microsoft Urban Futures Summer Workshop | Sensors and Data [Day 2]
2020-08-03Microsoft Urban Futures Summer Workshop | Data Driven Urban Transformation [Day 1]
2020-07-31Managing Tasks Across the Work-Life Boundary: Opportunities, Challenges, and Directions
2020-07-31Phong Surface: Efficient 3D Model Fitting using Lifted Optimization



Tags:
SqueezeNet
SqueezeBERT
mobile-optimized neural network
natural language processing
NLP
computer vision
Forrest Iandola
Sujeeth Bharadwaj
Microsoft Research