Dion: The distributed orthonormal update revolution is here

Subscribers:
351,000
Published on ● Video Link: https://www.youtube.com/watch?v=EtLdJ_vx6YE



Duration: 0:00
1,555 views
33


Kwangjun Ahn, Senior Researcher at Microsoft Research AI Frontiers, introduces Dion, a next-generation optimizer in the style of Muon that orthonormalizes only the top-r subspace via amortized power iteration. Dion retains Muonโ€™s fast convergence while significantly reducing compute and communication, scaling efficiently with FSDP/TP for very large models.

Dion paper: https://arxiv.org/abs/2504.05295
Dion optimizer: https://github.com/microsoft/dion

This session aired on September 24, 2025, at Microsoft Research Forum, Season 2 Episode 1.

Register for the series to learn about future episodes: https://aka.ms/registerresearchforumYTs2e1
Continue watching this episode: https://aka.ms/researchforumYTs2e1
Explore all previous episodes: https://aka.ms/researchforumYTplaylist




Other Videos By Microsoft Research


2025-09-24Understanding How Users Prepare for and React to Smartphone Theft
2025-09-24When LLMs Go Online: The Emerging Threat of Web-Enabled LLMs
2025-09-24A Formal Analysis of Appleโ€™s iMessage PQ3 Protocol
2025-09-24Email Spoofing with SMTP Smuggling: How the Shared Email Infrastructures Magnify this Vulnerability
2025-09-24A Framework for Abusability Analysis: The Case of Passkeys in Interpersonal Threat Models
2025-09-24โ€˜Hey mum, I dropped my phone down the toiletโ€™: Investigating Hi Mum and Dad SMS Scams in the UK
2025-09-24Dehumanizing machines: Making sense of AI systems that seem human
2025-09-24Scalable emulation of protein equilibrium ensembles with BioEmu
2025-09-24Disrupting the AI infrastructure with MicroLEDs
2025-09-24Dion: The distributed orthonormal update revolution is here
2025-09-24Pushing boundaries of complex reasoning in small language models
2025-09-22zk-promises: Anonymous Moderation, Reputation, & Blocking from Anonymous Credentials with Callbacks
2025-09-22More is Less: Extra Features in Contactless Payments Break Security
2025-09-18Sub-Population Identification of Multi-morbidity in Sub-Saharan African Populations
2025-09-03Echoes in GenAI generations
2025-08-27Six Years of Rowhammer: Breakthroughs and Future Directions
2025-08-25Sub-Population Identification of Multi-morbidity in Sub-Saharan African Populations
2025-08-19MindJourney: Test-Time Scaling with World Models for Spatial Reasoning
2025-08-11Medical Bayesian Kiosk (2010)
2025-08-07Reimagining healthcare delivery and public health with AI
2025-08-05VeriTrail: Detect hallucination and trace provenance in AI workflows