VeriTrail: Detect hallucination and trace provenance in AI workflows

Channel:

Microsoft Research

Subscribers:

351,000

Published on August 5, 2025 4:04:03 PM ● Video Link: https://www.youtube.com/watch?v=pi6Km-vKSVY

Duration: 0:00

5,735 views

270

Dasha Metropolitansky, Research Data Scientist, Microsoft Research Special Projects, introduces VeriTrail, a new method for closed-domain hallucination detection in multi-step AI workflows. Unlike prior methods, VeriTrail provides traceability: it identifies where hallucinated content was likely introduced, and it establishes the provenance of faithful content by tracing a path to the source text. VeriTrail also outperforms baseline methods in hallucination detection. The combination of traceability and effective hallucination detection makes VeriTrail a powerful tool for auditing the integrity of content generated by language models.

Microsoft Research Special Projects VeriTrail paper: https://www.microsoft.com/en-us/research/publication/veritrail-closed-domain-hallucination-detection-with-traceability/
VeriTrail blog post: https://www.microsoft.com/en-us/research/blog/veritrail-detecting-hallucination-and-tracing-provenance-in-multi-step-ai-workflows/
Claimify video: https://www.microsoft.com/en-us/research/video/claimify-extracting-high-quality-claims-from-language-model-outputs/
Claimify paper: https://www.microsoft.com/en-us/research/publication/towards-effective-extraction-and-evaluation-of-factual-claims/
Claimify blog post: https://www.microsoft.com/en-us/research/blog/claimify-extracting-high-quality-claims-from-language-model-outputs/

Other Videos By Microsoft Research

2025-09-24	Pushing boundaries of complex reasoning in small language models
2025-09-22	zk-promises: Anonymous Moderation, Reputation, & Blocking from Anonymous Credentials with Callbacks
2025-09-22	More is Less: Extra Features in Contactless Payments Break Security
2025-09-18	Sub-Population Identification of Multi-morbidity in Sub-Saharan African Populations
2025-09-03	Echoes in GenAI generations
2025-08-27	Six Years of Rowhammer: Breakthroughs and Future Directions
2025-08-25	Sub-Population Identification of Multi-morbidity in Sub-Saharan African Populations
2025-08-19	MindJourney: Test-Time Scaling with World Models for Spatial Reasoning
2025-08-11	Medical Bayesian Kiosk (2010)
2025-08-07	Reimagining healthcare delivery and public health with AI
2025-08-05	VeriTrail: Detect hallucination and trace provenance in AI workflows
2025-07-31	Computational models for brain science
2025-07-30	VoluMe: Authentic 3D Video Calls from Live Gaussian Splat Prediction
2025-07-28	How I became a StoryTeller (and how YOU can too)
2025-07-28	Make some noise: Teaching the language of audio to an LLM using sound tokens
2025-07-28	Building Better Language Models Through Global Understanding
2025-07-24	Navigating medical education in the era of generative AI
2025-07-22	DAViD: Data-efficient and Accurate Vision Models from Synthetic Data
2025-07-21	AI Testing and Evaluation: Reflections
2025-07-20	Intern talk: Distilling Self-Supervised-Learning-Based Speech Quality Assessment into Compact Models
2025-07-15	AI Testing and Evaluation: Learnings from cybersecurity

Channel	Latest
Hutts 2	9 hours ago
Kamar Rama	9 hours ago
USIX Pro Gaming	9 hours ago
AnimeToons	10 hours ago
AngryJoeShow	11 hours ago
Skyprince777	12 hours ago
Nintendo of America	13 hours ago
YaBoyRoshi	13 hours ago
Anton Petrov	14 hours ago
PopCross Studios	14 hours ago
alanzoka	15 hours ago
Aaronitmar	15 hours ago
IGN	16 hours ago
Kage848	16 hours ago
Flik's Gaming Stuff	16 hours ago
CHAQN2	16 hours ago
JoBlo Animated Videos	16 hours ago
Chroma	17 hours ago
Goodblue77	17 hours ago
ZGadgetReview	17 hours ago
Skurry	17 hours ago
Pecel Boy	17 hours ago
woclips	18 hours ago
DENZ TVLOG	18 hours ago
JMGames	18 hours ago