AI Testing and Evaluation: Learnings from cybersecurity

Channel:

Subscribers:

351,000

Published on July 15, 2025 7:33:43 PM ● Video Link: https://www.youtube.com/watch?v=XVkvLpo0R1A

Duration: 0:00

433 views

Drawing on his previous work as the UK’s cybersecurity chief, Professor Ciaran Martin explores differentiated standards and public-private partnerships in cybersecurity, and Microsoft’s Tori Westerhoff examines the insights through an AI red-teaming lens.

Show notes: https://www.microsoft.com/en-us/research/podcast/ai-testing-and-evaluation-learnings-from-cybersecurity/
Listen to AI Testing and Evaluation: Learnings from Science and Industry series: https://www.microsoft.com/en-us/research/story/ai-testing-and-evaluation-learnings-from-science-and-industry/

Other Videos By Microsoft Research

2025-08-05	VeriTrail: Detect hallucination and trace provenance in AI workflows
2025-07-31	Computational models for brain science
2025-07-30	VoluMe: Authentic 3D Video Calls from Live Gaussian Splat Prediction
2025-07-28	How I became a StoryTeller (and how YOU can too)
2025-07-28	Make some noise: Teaching the language of audio to an LLM using sound tokens
2025-07-28	Building Better Language Models Through Global Understanding
2025-07-24	Navigating medical education in the era of generative AI
2025-07-22	DAViD: Data-efficient and Accurate Vision Models from Synthetic Data
2025-07-21	AI Testing and Evaluation: Reflections
2025-07-20	Intern talk: Distilling Self-Supervised-Learning-Based Speech Quality Assessment into Compact Models
2025-07-15	AI Testing and Evaluation: Learnings from cybersecurity
2025-07-10	Scalable emulation of protein equilibrium ensembles with BioEmu
2025-07-10	How AI will accelerate biomedical research and discovery
2025-07-09	Introducing Microsoft AI Economy Institute
2025-07-07	AI Testing and Evaluation: Learnings from pharmaceuticals and medical devices
2025-07-03	Against Softmaxing Culture: Understanding Relational Practices in Expert and Ordinary Forms of Work
2025-06-30	AI Testing and Evaluation: Learnings from genome editing
2025-06-23	AI Testing and Evaluation: Learnings from Science and Industry
2025-06-18	Precio: Private Aggregate Measurement via Oblivious Shuffling
2025-06-18	Sandi: A System for Accountability
2025-06-18	DFT for drug and material discovery