Automatically Quantize LLMs with AutoRound | Intel Software

Channel:

Intel Software

Subscribers:

256,000

Published on May 20, 2025 2:00:30 PM ● Video Link: https://www.youtube.com/watch?v=LszyOPcajEQ

Duration: 0:00

10,935 views

If you are looking to deploy faster and smaller language models, but you don’t want to experiment with finding the right quantization settings for your deployment requirements, AutoRound makes it easy. You just specify your model, a light amount of training data, how many bits you want to quantize to, whether you want to prioritize accuracy or speed, and it will automatically tune the weight rounding and clipping ranges. AutoRound supports CPUs, GPUs, and AI accelerators from multiple vendors. Learn how to get started with this coding LLM example.

Resources:
Learn more about AutoRound: https://huggingface.co/blog/autoround
AutoRound GitHub repo: https://github.com/intel/auto-round
Intel AI software resources: https://developer.intel.com/ai

About Intel Software:
Intel® Developer Zone is committed to empowering and assisting software developers in creating applications for Intel hardware and software products. The Intel Software YouTube channel is an excellent resource for those seeking to enhance their knowledge. Our channel provides the latest news, helpful tips, and engaging product demos from Intel and our numerous industry partners. Our videos cover various topics; you can explore them further by following the links.

Connect with Intel Software:
INTEL SOFTWARE WEBSITE:https://intel.ly/2KeP1hDD
INTEL SOFTWARE on FACEBOOK:http://bit.ly/2z8MPFFF
INTEL SOFTWARE on TWITTER:http://bit.ly/2zahGSnn
INTEL SOFTWARE GITHUB:http://bit.ly/2zaih6zz
INTEL DEVELOPER ZONE LINKEDIN:http://bit.ly/2z979qss
INTEL DEVELOPER ZONE INSTAGRAM:http://bit.ly/2z9Xsbyy
INTEL GAME DEV TWITCH:http://bit.ly/2BkNshuu

#intelsoftware
Automatically Quantize LLMs with AutoRound | Intel Software

Other Videos By Intel Software

2025-06-05	vLLM Server Using OpenAI API on Gaudi 3 \| AI with Guy
2025-06-04	Build a Gen-AI Application Across Multiple AWS Instances with OPEA \| AI with Guy
2025-06-03	OPEA vs. NVIDIA NIM: What’s Best for Your GenAI Deployment?
2025-05-28	PyTorch Export Quantization with Intel GPUs \| Intel Software
2025-05-23	Unlocking Gen AI: From Experimentation to Production with Red Hat & Intel \| Intel Software
2025-05-23	Overcoming Deployment Challenges: Scaling AI in Edge Computing w/ Red Hat AI & Intel Edge Platforms
2025-05-23	Discover AI Innovations at Red Hat Summit with Intel: RHEL AI, OpenShift AI & Edge AI
2025-05-22	Explore OpenVINO Model Hub – Instantly Compare AI Model Performance Across Devices \| AI with Guy
2025-05-22	Build a RAG Chatbot with OPEA on AWS \| AI with Guy \| Intel Software
2025-05-20	Enterprise AI Inference with Intel: Bill Pearson on Infrastructure & Standards \| Intel Software
2025-05-20	Automatically Quantize LLMs with AutoRound \| Intel Software
2025-05-13	Deploy Compiled PyTorch Models on Intel GPUs with AOTInductor \| Intel Software
2025-04-21	Faster GenAI, Visual AI, Edge to Cloud, and HPC Solutions \| oneAPI & AI Tools 2025.1
2025-04-16	Run Inference with a Model from Hugging Face Hub on an Intel® Gaudi™ AI Accelerator \| Intel Software
2025-03-28	OpenVINO Notebook on Intel Tiber AI Cloud in 2 Minutes \| AI with Guy \| Intel Software
2025-03-28	AI Agents using OpenVINO and LangChain ReAct \| AI with Guy \| Intel Software
2025-03-21	AI PC: Achieving Success at Scale with Windows Copilot + Experiences \| Intel AI DevSummit
2025-03-19	OPEA (Open Platform for Enterprise AI) Chat Q&A Example \| AI with Guy \| Intel Software
2025-03-19	OPEA (Open Platform for Enterprise AI) micro-services \| AI with Guy \| Intel Software
2025-03-18	OPEA (Open Platform for Enterprise AI) Introduction \| AI with Guy \| Intel Software
2025-03-17	AI Methods for Understanding Implicit Structures in Medical Records \| Lightning Talk

Channel	Latest
MK Gamers	6 hours ago
Dragon Blogger Technology and Entertainment	6 hours ago
TheFunnyWeasel1	6 hours ago
Max Steel	6 hours ago
thegreyman	6 hours ago
SuperGainsBros	7 hours ago
CANSINO36	7 hours ago
Kaal Chamber	7 hours ago
GameTechPlanet	7 hours ago
ZdsPro	7 hours ago
CoverSolutions	7 hours ago
1stHowlerGamer	7 hours ago
OZiLLA FC	7 hours ago
Hiro. - video gamer -	7 hours ago
YT-StrongGamer	7 hours ago
Ryan the Blue Falco	7 hours ago
Mr H Reviews	7 hours ago
X365 Den	7 hours ago
MecaWOWS Gameplays	7 hours ago
BGM channel	7 hours ago
PolarisZenKai’s Amiibo Fights!	7 hours ago
Adam Savage’s Tested	7 hours ago
Inconsistent TechDad	7 hours ago
RavenFromTheSky	7 hours ago
GET PUMPED	7 hours ago