Q-BERT: Hessian Based Ultra Low Precision Quantization of BERT | AISC

Published on ● Video Link: https://www.youtube.com/watch?v=aX4Tm1s01wY



Duration: 57:47
643 views
18


For slides and more information on the paper, visit https://ai.science/e/q-bert-hessian-based-ultra-low-precision-quantization-of-bert--t5TyN0LewFq33knWRHWY

Speaker: Amir Gholami; Host: Xi Chen

Motivation:
The motivation of Q-BERT is to enable efficient deployment at the edge with lower inference and power consumption. Furthermore, enabling high accuracy inference at the edge would help with privacy of the user, since his/her data would not need to be transmitted to the cloud for inference.




Other Videos By LLMs Explained - Aggregate Intellect - AI.SCIENCE


2020-12-07AlphaFold 2, Is Protein Folding Solved? | AISC
2020-12-04Computer vision to deeply phenotype human diseases across physiological, tissue and molecular scales
2020-12-03Serina Chang: Understanding the spread of COVID-19 using Social Network Models
2020-12-03The Importance of Strategy in AI Product Management | AISC
2020-12-01What is Wrong with Explainable AI? | AISC
2020-11-26De-identification of patients’ protected health information (PHI) from medical free-text | AISC
2020-11-24Interpretable Neural Networks for Panel Data Analysis in Economics | AISC
2020-11-23Meet the MLOps Experts, a fireside chat for newcomers to AI careers
2020-11-23Nodes, Edges and Properties; Graph Analysis Intro for ML Newcomers
2020-11-20Operationalizing AI in Business at Scale | AISC
2020-11-19Q-BERT: Hessian Based Ultra Low Precision Quantization of BERT | AISC
2020-11-18Augmented Out-of-sample Comparison Method for Time Series Forecasting Techniques | AISC
2020-11-13Generating Ampicillin-Level Antimicrobial Peptides with Activity-Aware Generative Adversarial Net
2020-11-12Disparate Interactions: An Algorithms-in-the-loop Analysis of Fairness in Risk Assessment | AISC
2020-11-11[GATA] Learning Dynamic Belief Graphs to Generalize on Text-Based Games | AISC
2020-11-11DrWhy.AI - Tools for Explainable Artificial Intelligence | AISC
2020-11-06Similarity Search for Efficient Active Learning and Search of Rare Concepts | AISC
2020-11-06Identifying Influential Individuals in Complex Networks: An Overview | AISC
2020-11-05Da Xu (Walmart Labs): Inductive Representation Learning on Temporal Graphs | AISC
2020-11-04AI, Democracy, & Disinformation
2020-10-29Application of Bayesian neural networks for aircraft safety | AISC