Efficient Large-Scale AI Workshop | Session 2: Training and inference efficiency

Subscribers:
343,000
Published on ● Video Link: https://www.youtube.com/watch?v=0VufaWL3Nu4



Duration: 2:15:57
1,719 views
20


This workshop was part of the Microsoft Research Summit 2022: https://www.microsoft.com/en-us/research/event/microsoft-research-summit-2022/

To bring AI to more people, models need to be cheaper to train and run, in terms of both computational and human resources. Thus, we will focus on increasing efficiency across various parts of the training and inference pipeline. 

Learn more about the Efficient Large-Scale AI Workshop: https://www.microsoft.com/en-us/research/event/efficient-large-scale-ai-workshop/

0:00 Efficient Vision Transformer
Song Han, Massachusetts Institute of Technology

38:30 Large Scale MoE Models into Cloud Scale Production with Highly Efficient Inference and Training
Young Jin Kim, Microsoft Translator
Hany Awadalla, Azure AI Cognitive Services

1:43:32 LiteTransformerSearch: Training-free On-device Search for Efficient Autoregressive Language Models
Mojan Javaheripi, Microsoft Research Redmond and University of California San Diego




Other Videos By Microsoft Research


2022-11-28SITI 2022 - Reporting from the Ground
2022-11-28SITI 2022 - Introductory talk by Sriram Rajamani
2022-11-28Microsoft Research Live Stream
2022-11-23Causal AI for Decision Making
2022-11-21MSR-IISc AI Seminar Series: Designing AI Systems w/Steerable Long-Term Dynamics - Thorsten Joachims
2022-11-17Research and Opportunities to support SMBs in Africa (lightning talk)
2022-11-16Food Security Workshop | Day 2: FarmVibes.AI Overview & Training
2022-11-16Food Security Workshop | Day 1: Modern R&D
2022-11-04Efficient Large-Scale AI Workshop | Session 1: Skills acquisition and new capabilities
2022-11-04Efficient Large-Scale AI Workshop | Session 3: Aligning models with human intent
2022-11-04Efficient Large-Scale AI Workshop | Session 2: Training and inference efficiency
2022-11-04Challenges in Evolving a Successful Database Product (SQL Server) to a Cloud Service (SQL Azure)
2022-11-03Improving text prediction accuracy using neurophysiology
2022-10-27Research talk: Storing data for millennia
2022-10-27Research talk: Low-latency ​Real-time Insights ​from Space
2022-10-27Research talk: Computing at the speed of light
2022-10-27Panel discussion: From privacy research to policy, regulations and standards
2022-10-27Lightning talks: The identity revolution: Centering trust on people
2022-10-27Research talks: An investigation into user expectations for differential privacy
2022-10-27Lightning talks: AI in healthcare
2022-10-27Lightning talks: AI in life sciences