MARI Grand Seminar - Large Language Models and Low Resource Languages

Channel:

Subscribers:

351,000

Published on May 1, 2023 2:04:26 PM ● Video Link: https://www.youtube.com/watch?v=X7c0T7uwtkM

Duration: 1:47:05

1,327 views

Watch our two-hour grand seminar on Large Language Models and Low Resource Languages. The event included a keynote by Dr. Monojit Choudhury titled “Predicting, Explaining and Optimizing Performance of LLMs across Languages,” where he discussed whether massively multilingual language models (MMLM) can be leveraged to predict the accuracy of cross-lingual zero-shot and few-shot transfer for a task on target languages with little or no test data. He also gave an overview of Project LITMUS – Linguistically Informed Training and Testing of Multilingual Systems, which involved building several ML models for performance prediction and discuss the what was learnt about the factors that influence cross-lingual transfer.

The talk was followed by a panel discussion with experts from academia and research; including Dr. Monojit Chowdhury, Dr. Edward Ombui, Dr. Sunayana Sitaram, Dr. David Adelani, and moderated by Maxamed Axmed.

Keynote Abstract:

Predicting, Explaining and Optimizing Performance of LLMs across Languages

Given a massively multilingual language models (MMLM), can we predict the accuracy of cross-lingual zero-shot and few-shot transfer for a task on target languages with little or no test data? This seemingly impossible task, if solved, can have several potential benefits. First, we could estimate the performance of a model even in languages where a test set is not available, and/or building one is difficult. Second, one can predict training data configurations that would give certain desired performance across a set of languages, and accordingly strategize data collection plans; this in turn can lead to linguistically fair MMLM-based models. Third, as a byproduct, we would know which factors influence cross-lingual transfer. In this talk, I will give an overview of Project LITMUS – Linguistically Informed Training and Testing of Multilingual Systems, where we build several ML models for performance prediction; besides their applications, I will discuss what we learn about the factors that influence cross-lingual transfer.

Learn more about MARI: https://www.microsoft.com/en-us/research/group/microsoft-africa-research-institute-mari/

Other Videos By Microsoft Research

2023-08-09	Keypoint Detection for Measuring Body Size of Giraffes: Enhancing Accuracy and Precision
2023-08-04	Scalable and Efficient AI: From Supercomputers to Smartphones
2023-07-18	AI for Precision Health
2023-07-07	Multilingual Evaluation of Generative AI (MEGA)
2023-07-07	The Whole Truth and Nothing But the Truth: Faithful and Controllable Dialogue Response Generation...
2023-07-07	Privacy-Preserving Domain Adaptation of Semantic Parsers
2023-05-30	Microsoft’s Holoportation™ Communications Technology: Facilitating 3D Telemedicine
2023-05-05	Human-Centered AI: Ensuring Human Control While Increasing Automation
2023-05-03	Escapement: A Tool for Interactive Prototyping with Video via Sensor-Mediated Abstraction of Time
2023-05-03	AdHocProx: Sensing Mobile, Ad-Hoc Collaborative Device Formations using Dual Ultra-Wideband Radios
2023-05-01	MARI Grand Seminar - Large Language Models and Low Resource Languages
2023-04-27	Innovating through uncertainty: Getting super curious and combining disparate elements
2023-04-13	WiDS Career Panel: Gabriela de Queiroz, Juliet Hougland (Netflix), and Samantha Sifleet
2023-03-24	Learning to Exploit Temporal Structure for Biomedical Vision-Language Processing
2023-03-23	Foundation models and the next era of AI
2023-02-24	Behind the label: Glimpses of data labelling labours for AI
2023-02-17	Art of doing disruptive research
2023-02-17	Fighting the Global Social Media Infodemic: from Fake News to Harmful Content
2023-02-15	Responsible AI Tracker Tour
2023-02-14	Automating Commonsense Reasoning
2023-02-13	Reinforcement Learning (RL) Open Source Fest 2022 Final Project Presentations

Channel	Latest
domisumReplay: Renekton	6 hours ago
Mehmet Uzun	6 hours ago
domisumReplay: Syndra	6 hours ago
domisumReplay: Mordekaiser	6 hours ago
Shhoto	7 hours ago
DismArchus	7 hours ago
Zanginary	7 hours ago
Baba Behwish	7 hours ago
LegitKorea	7 hours ago
domisumReplay: Aatrox	7 hours ago
CamXPetra	7 hours ago
youRINK 🎶	7 hours ago
domisumReplay: Akali	7 hours ago
domisumReplay: Sett	7 hours ago
domisumReplay: Kayle	7 hours ago
iTownGamePlay Terror&Diversión	7 hours ago
David Voices	7 hours ago
Nickich	8 hours ago
Regiz	8 hours ago
PUBG MOBILE Esports MEA	8 hours ago
League of SUPPORT - LOL Replays	8 hours ago
Happy Animes Recaps	8 hours ago
HeroxHeroTV	8 hours ago
SiIvaGunner	8 hours ago
Oh Shiitake Mushrooms	8 hours ago