Accelerating Multilingual RAG Systems

Channel:

Subscribers:

351,000

Published on January 2, 2025 6:38:50 PM ● Video Link: https://www.youtube.com/watch?v=usvu6Sk1ynk

Duration: 0:00

829 views

As Retrieval-Augmented Generation (RAG) systems gain prominence for grounding large language models (LLMs) in external knowledge, constructing evaluation frameworks is critical in accelerating developments across multiple diverse languages. This talk introduces a comprehensive multilingual RAG evaluation pipeline comprising three key components: retrieval, relevance assessment, and generation. MIRACL, a multilingual retrieval dataset with high-quality relevance judgments annotated by native speakers; NoMIRACL, a benchmark for assessing relevance in multilingual RAG, designed to measure LLM robustness against retrieval errors; and MIRAGE-Bench, an arena-based multilingual RAG evaluation framework integrating both heuristic metrics and surrogate judge models for multilingual generation evaluation. Together, these resources provide a foundation for advancing multilingual information access and enhancing the robustness of RAG systems. This talk highlights key findings from each section, challenges, and future work for multilingual RAG research.

Speaker: Nandan Thakur, University of Waterloo, Canada

Other Videos By Microsoft Research

2025-03-03	LLMs vs. Torch 1.5: Why Your Code Assistant Can't Keep Up
2025-02-25	Using LLMs for safe low-level programming \| Microsoft Research Forum
2025-02-25	AutoGen v0.4: Reimagining the foundation of agentic AI for scale and more \| Microsoft Research Forum
2025-02-25	Belief state transformers \| Microsoft Research Forum
2025-02-25	Magma: A foundation model for multimodal AI Agents \| Microsoft Research Forum
2025-02-25	Chimera: Accurate synthesis prediction by ensembling models with... \| Microsoft Research Forum
2025-02-25	AI for Precision Health: Learning the language of nature and patients \| Microsoft Research Forum
2025-02-25	Keynote: Multimodal Generative AI for Precision Health \| Microsoft Research Forum
2025-02-21	WHAM Demonstrator tutorial
2025-02-07	Attestations over TLS 1.3 and ZKP
2025-01-02	Accelerating Multilingual RAG Systems
2024-12-30	Pronouns in the Workplace: Learning Inclusive Software Design from Real-World Experiences
2024-12-20	Culturally Aware Machines: Why and when are they useful?
2024-12-18	Embodied AI Workshop at CVPR 2024
2024-12-10	GASP: Gaussian Avatars with Synthetic Priors
2024-12-09	A Closer Look at Falcon
2024-12-09	Quantum Lattice Enumeration in Limited Depth, Fernando Virdia
2024-12-09	Enhancing Security of Bluetooth Secure Connections via Deferrable Authentication
2024-12-09	Improving the Security of United States Elections with Robust Optimization
2024-11-18	Introducing BiomedParse, a groundbreaking foundation model for biomedical image analysis
2024-11-11	Low latency carbon budget 2023

Channel	Latest
BoraLo	6 hours ago
GAMErHyNas	6 hours ago
ChessBase India	6 hours ago
EvGeN Channel	6 hours ago
MG Surprise Toys	6 hours ago
Gaming Raju	6 hours ago
egboj20	6 hours ago
Adjie Cahyono	7 hours ago
Zenix4U	7 hours ago
Gothic Sorcerer	7 hours ago
ᗷᖇᑌᑕE ᒪEE ᖴIST Oᖴ ᖴᑌᖇY	7 hours ago
ATMの裏側	7 hours ago
JastrzabPost	7 hours ago
Dragon Fights	7 hours ago
DIVIDED GAMERS	7 hours ago
MGTracey	7 hours ago
ShaggyJonJ	7 hours ago
Alif Rahza	7 hours ago
Simulation	7 hours ago
THANATOS	7 hours ago
EVO World of Tanks Replays	7 hours ago
MLBB-مواجهة الأبطال	7 hours ago
JK _00	7 hours ago
チャンネルふいしんく【huisync】	7 hours ago
DieHahn	7 hours ago