Accelerating Multilingual RAG Systems

Subscribers:
351,000
Published on ● Video Link: https://www.youtube.com/watch?v=usvu6Sk1ynk



Duration: 0:00
829 views
28


As Retrieval-Augmented Generation (RAG) systems gain prominence for grounding large language models (LLMs) in external knowledge, constructing evaluation frameworks is critical in accelerating developments across multiple diverse languages. This talk introduces a comprehensive multilingual RAG evaluation pipeline comprising three key components: retrieval, relevance assessment, and generation. MIRACL, a multilingual retrieval dataset with high-quality relevance judgments annotated by native speakers; NoMIRACL, a benchmark for assessing relevance in multilingual RAG, designed to measure LLM robustness against retrieval errors; and MIRAGE-Bench, an arena-based multilingual RAG evaluation framework integrating both heuristic metrics and surrogate judge models for multilingual generation evaluation. Together, these resources provide a foundation for advancing multilingual information access and enhancing the robustness of RAG systems. This talk highlights key findings from each section, challenges, and future work for multilingual RAG research.

Speaker: Nandan Thakur, University of Waterloo, Canada




Other Videos By Microsoft Research


2025-03-03LLMs vs. Torch 1.5: Why Your Code Assistant Can't Keep Up
2025-02-25Using LLMs for safe low-level programming | Microsoft Research Forum
2025-02-25AutoGen v0.4: Reimagining the foundation of agentic AI for scale and more | Microsoft Research Forum
2025-02-25Belief state transformers | Microsoft Research Forum
2025-02-25Magma: A foundation model for multimodal AI Agents | Microsoft Research Forum
2025-02-25Chimera: Accurate synthesis prediction by ensembling models with... | Microsoft Research Forum
2025-02-25AI for Precision Health: Learning the language of nature and patients | Microsoft Research Forum
2025-02-25Keynote: Multimodal Generative AI for Precision Health | Microsoft Research Forum
2025-02-21WHAM Demonstrator tutorial
2025-02-07Attestations over TLS 1.3 and ZKP
2025-01-02Accelerating Multilingual RAG Systems
2024-12-30Pronouns in the Workplace: Learning Inclusive Software Design from Real-World Experiences
2024-12-20Culturally Aware Machines: Why and when are they useful?
2024-12-18Embodied AI Workshop at CVPR 2024
2024-12-10GASP: Gaussian Avatars with Synthetic Priors
2024-12-09A Closer Look at Falcon
2024-12-09Quantum Lattice Enumeration in Limited Depth, Fernando Virdia
2024-12-09Enhancing Security of Bluetooth Secure Connections via Deferrable Authentication
2024-12-09Improving the Security of United States Elections with Robust Optimization
2024-11-18Introducing BiomedParse, a groundbreaking foundation model for biomedical image analysis
2024-11-11Low latency carbon budget 2023