Behind the label: Glimpses of data labelling labours for AI

Subscribers:
344,000
Published on ● Video Link: https://www.youtube.com/watch?v=YdH7DSG96ZM



Duration: 51:29
1,877 views
40


ChatGPT is the latest of AI systems to make the headlines for its remarkable computational capabilities. Lesser known and rarely acknowledged is the human labours involved in training and supporting these celebrated AI systems. Thousands of workers, particularly in global south regions, create training datasets, validate model outcomes and mimic computational responses to sustain AI’s research, development and use. Yet little is known about what their work entails. What do data labellers do when they label data for AI?

Drawing on findings from an ethnographic study of data labelling in India, this talk offers insights into the everyday work practices of data labellers, organisational hierarchies, norms, and values that were caught in global flows of resources, rhetoric, and relations of power. We trace these practices, norms and frictions to better understand their influences on everyday annotation work as well as answer an important question, why should we, AI researchers and practitioners, concern ourselves with these seemingly distant realities?

Learn more about MARI: https://www.microsoft.com/en-us/research/group/microsoft-africa-research-institute-mari/




Other Videos By Microsoft Research


2023-07-07Privacy-Preserving Domain Adaptation of Semantic Parsers
2023-05-30Microsoft’s Holoportation™ Communications Technology: Facilitating 3D Telemedicine
2023-05-05Human-Centered AI: Ensuring Human Control While Increasing Automation
2023-05-03Escapement: A Tool for Interactive Prototyping with Video via Sensor-Mediated Abstraction of Time
2023-05-03AdHocProx: Sensing Mobile, Ad-Hoc Collaborative Device Formations using Dual Ultra-Wideband Radios
2023-05-01MARI Grand Seminar - Large Language Models and Low Resource Languages
2023-04-27Innovating through uncertainty: Getting super curious and combining disparate elements
2023-04-13WiDS Career Panel: Gabriela de Queiroz, Juliet Hougland (Netflix), and Samantha Sifleet
2023-03-24Learning to Exploit Temporal Structure for Biomedical Vision-Language Processing
2023-03-23Foundation models and the next era of AI
2023-02-24Behind the label: Glimpses of data labelling labours for AI
2023-02-17Art of doing disruptive research
2023-02-17Fighting the Global Social Media Infodemic: from Fake News to Harmful Content
2023-02-15Responsible AI Tracker Tour
2023-02-14Automating Commonsense Reasoning
2023-02-13Reinforcement Learning (RL) Open Source Fest 2022 Final Project Presentations
2023-02-13Disaggregated model evaluation and comparison
2023-02-13Neural Interfaces - Towards a new generation of human-computer interface
2023-02-13Galea: The Bridge Between Mixed Reality and Neurotechnology
2023-02-10Current and Future Application of BCIs
2023-02-01Seeing AI app - Creating a Route