Get Started Post-Training Dynamic Quantization | AI Model Optimization with Intel® Neural Compressor

Subscribers:
256,000
Published on ● Video Link: https://www.youtube.com/watch?v=5xHKe4wWLes



Duration: 4:30
9,523 views
16


Learn the basics of dynamic quantization. Then see how it’s applied to a GPT2-based headline generation application using Intel Neural Compressor.

Dynamic quantization is the simplest method for quantizing AI models for efficient deployment. This technique quantizes the model weights of a pre-trained model and inserts functions into the model to quantize activations during inference. While this adds runtime overhead, it can also adapt the scale factor dynamically as the input ranges change.

Intel® Neural Compressor works across PyTorch*, TensorFlow*, and ONNX* Runtime. Learn how to implement dynamic quantization using a GPT2-based headline generation AI Reference Kit. The demonstration discusses options to customize the dynamic quantization process and shows the resulting speedup for this application.

Intel® Neural Compressor: https://bit.ly/3Nl6pVj

Intel® Neural Compressor https://GitHub: bit.ly/3NlBgkH

About the AI Model Optimization with Intel® Neural Compressor Series:
Learn how to choose and get started with AI model optimization techniques. Get started with examples using Intel® Neural Compressor, which works within PyTorch*, TensorFlow*, and ONNX* Runtime

About Intel Software:
Intel® Developer Zone is committed to empowering and assisting software developers in creating applications for Intel hardware and software products. The Intel Software YouTube channel is an excellent resource for those seeking to enhance their knowledge. Our channel provides the latest news, helpful tips, and engaging product demos from Intel and our numerous industry partners. Our videos cover various topics; you can explore them further by following the links.

Connect with Intel Software:
INTEL SOFTWARE WEBSITE: https://intel.ly/2KeP1hD
INTEL SOFTWARE on FACEBOOK: http://bit.ly/2z8MPFF
INTEL SOFTWARE on TWITTER: http://bit.ly/2zahGSn
INTEL SOFTWARE GITHUB: http://bit.ly/2zaih6z
INTEL DEVELOPER ZONE LINKEDIN: http://bit.ly/2z979qs
INTEL DEVELOPER ZONE INSTAGRAM: http://bit.ly/2z9Xsby
INTEL GAME DEV TWITCH: http://bit.ly/2BkNshu

#intelsoftware #ai #oneapi

Get Started Post-Training Dynamic Quantization | AI Model Optimization with Intel® Neural Compressor




Other Videos By Intel Software


2023-07-13Hugging Face + OpenVINO | Intel Software
2023-07-12Start Post-Training Static Quantization | AI Model Optimization with Intel® Neural Compressor
2023-07-12Hugging Face + OpenVINO™ | Intel Software
2023-07-06PyTorch Update | Summer 2023 | oneAPI Dev News | Intel Software
2023-07-05Crowd Simulation on Multiple Devices Using SYCL | Intel Software
2023-06-30Empowering Students and Educators with Intel's oneAPI Educator Program for Heterogeneous Computing
2023-06-30Generative AI with OpenVINO | OpenVINO DevCon | Intel Software
2023-06-29June 2023 | oneAPI Dev News
2023-06-29PyTorch Update | Summer 2023 | oneAPI Dev News | Intel Software
2023-06-28SYCL Custom Device Selection | Intel Software
2023-06-28Get Started Post-Training Dynamic Quantization | AI Model Optimization with Intel® Neural Compressor
2023-06-27June 2023 | IDZ News | Intel Software
2023-06-27June 2023 | IDZ News | Intel Software
2023-06-22June 2023 | oneAPI Dev News | Intel Software
2023-06-21Intel® Graphics Performance Analyzers 2023.2 Release | Intel Software
2023-06-21How to Choose AI Model Quantization Techniques | AI Model Optimization with Intel® Neural Compressor
2023-06-16Revolutionizing Programming Education in China with Intel® oneAPI | Intel Software
2023-06-15Exploring High-Performance Computing at USC with oneAPI | Intel Software
2023-06-14What is AI Model Optimization | AI Model Optimization with Intel® Neural Compressor | Intel Software
2023-06-12Screenshot Layer | Intel Graphics Performance Analyzers Framework Quick Tips | Intel Software
2023-06-06Portable Performance Across CPUs and GPUs | Intel Software



Tags:
Intel Developer Zone
IDZ
Intel Software
Software Developer
Developer Tools
Software Tools
Developer
Intel
AI model optimization
deep learning
model compression