AI Testing and Evaluation: Reflections
Channel:
Subscribers:
351,000
Published on ● Video Link: https://www.youtube.com/watch?v=7q3BN24qgxg
In the series finale, Amanda Craig Deckard returns to examine what Microsoft has learned about testing as a governance tool. She also explores the roles of rigor, standardization, and interpretability in testing and what’s next for Microsoft’s AI governance work.
Show notes: https://www.microsoft.com/en-us/research/podcast/ai-testing-and-evaluation-reflections/
Listen to AI Testing and Evaluation: Learnings from Science and Industry series: https://www.microsoft.com/en-us/research/story/ai-testing-and-evaluation-learnings-from-science-and-industry/