Pushing boundaries of complex reasoning in small language models

Channel:

Subscribers:

351,000

Published on September 24, 2025 3:46:56 PM ● Video Link: https://www.youtube.com/watch?v=BCn_Z90pWVM

Duration: 0:00

255 views

Mojan Javaheripi, Member of Technical Staff at Microsoft Research AI Frontiers, presents Phi-4-Reasoning and Phi-4-Reasoning-Plus—two 14B models designed to advance complex reasoning in small-scale language models. By introducing a dedicated “thinking block” and applying supervised fine-tuning and reinforcement learning on carefully curated STEM datasets, these models achieve major improvements in problem-solving capabilities.

Phi-4-Reasoning: https://huggingface.co/microsoft/Phi-4-reasoning
Phi-4-Reasoning-Plus: https://huggingface.co/microsoft/Phi-4-reasoning-plus
Phi-4 Reasoning paper (PDF): https://aka.ms/phi4reasoningPDF
Azure AI Foundry Model Catalog: https://aka.ms/AIFoundryModelCatalog
Microsoft/phi-4-gguf on Hugging Face: https://huggingface.co/microsoft/phi-4-gguf

This session aired on September 24, 2025, at Microsoft Research Forum, Season 2 Episode 1.

Register for the series to learn about future episodes: https://aka.ms/registerresearchforumYTs2e1
Continue watching this episode: https://aka.ms/researchforumYTs2e1
Explore all previous episodes: https://aka.ms/researchforumYTplaylist

Other Videos By Microsoft Research

2025-09-24	Understanding How Users Prepare for and React to Smartphone Theft
2025-09-24	When LLMs Go Online: The Emerging Threat of Web-Enabled LLMs
2025-09-24	A Formal Analysis of Apple’s iMessage PQ3 Protocol
2025-09-24	Email Spoofing with SMTP Smuggling: How the Shared Email Infrastructures Magnify this Vulnerability
2025-09-24	A Framework for Abusability Analysis: The Case of Passkeys in Interpersonal Threat Models
2025-09-24	‘Hey mum, I dropped my phone down the toilet’: Investigating Hi Mum and Dad SMS Scams in the UK
2025-09-24	Dehumanizing machines: Making sense of AI systems that seem human
2025-09-24	Scalable emulation of protein equilibrium ensembles with BioEmu
2025-09-24	Disrupting the AI infrastructure with MicroLEDs
2025-09-24	Dion: The distributed orthonormal update revolution is here
2025-09-24	Pushing boundaries of complex reasoning in small language models
2025-09-22	zk-promises: Anonymous Moderation, Reputation, & Blocking from Anonymous Credentials with Callbacks
2025-09-22	More is Less: Extra Features in Contactless Payments Break Security
2025-09-18	Sub-Population Identification of Multi-morbidity in Sub-Saharan African Populations
2025-09-03	Echoes in GenAI generations
2025-08-27	Six Years of Rowhammer: Breakthroughs and Future Directions
2025-08-25	Sub-Population Identification of Multi-morbidity in Sub-Saharan African Populations
2025-08-19	MindJourney: Test-Time Scaling with World Models for Spatial Reasoning
2025-08-11	Medical Bayesian Kiosk (2010)
2025-08-07	Reimagining healthcare delivery and public health with AI
2025-08-05	VeriTrail: Detect hallucination and trace provenance in AI workflows

Channel	Latest
Hutts 2	9 hours ago
Kamar Rama	9 hours ago
USIX Pro Gaming	9 hours ago
AnimeToons	10 hours ago
AngryJoeShow	11 hours ago
Skyprince777	12 hours ago
Nintendo of America	13 hours ago
YaBoyRoshi	13 hours ago
Anton Petrov	14 hours ago
PopCross Studios	14 hours ago
alanzoka	15 hours ago
Aaronitmar	15 hours ago
IGN	16 hours ago
Kage848	16 hours ago
Flik's Gaming Stuff	16 hours ago
CHAQN2	16 hours ago
JoBlo Animated Videos	16 hours ago
Chroma	17 hours ago
Goodblue77	17 hours ago
ZGadgetReview	17 hours ago
Skurry	17 hours ago
Pecel Boy	17 hours ago
woclips	18 hours ago
DENZ TVLOG	18 hours ago
JMGames	18 hours ago