'F' to 'A' on the N.Y. Regents Science Exams: An Overview of the Aristo Project

Subscribers:
344,000
Published on ● Video Link: https://www.youtube.com/watch?v=OcemDXpRymc



Duration: 1:16:01
390 views
10


Performing well on standardized exams has been a longstanding challenge for AI. Even in 2016, the best AI system achieved less than 60% on an 8th Grade science exam challenge. Recently, AI2's Aristo system achieved surprising success on the Grade 8 New York Regents Science Exams, scoring over 90% on the exam's non-diagram, multiple choice (NDMC) questions. How was it able to do this, and what mistakes does it still make? In this talk, I will overview Aristo and the impact of its various components, in particular, its new language model (LM) solvers. I will also present several analyses of what is going on inside Aristo, in particular probing how much the LM solvers go beyond simple pattern matching, and what kinds of errors still occur. Finally, I will speculate on the larger quest towards knowledgeable machines that can reason, explain, and interact, and what additional capabilities are needed to reach this broader goal.

Talk slides: https://www.microsoft.com/en-us/research/uploads/prod/2020/01/An-Overview-of-the-Aristo-Project.pdf

See more on this and other talks at Microsoft Research: https://www.microsoft.com/en-us/research/video/f-to-a-on-the-n-y-regents-science-exams-an-overview-of-the-aristo-project/




Other Videos By Microsoft Research


2020-02-19Democratizing data, thinking backwards and setting North Star goals with Dr. Donald Kossmann
2020-02-19Behind the scenes on Team Explorer’s practice run at Microsoft for the DARPA SubT Urban Challenge
2020-02-12Microsoft Scheduler and dawn of Intelligent PDAs with Dr. Pamela Bhattacharya | Podcast
2020-02-05Responsible AI with Dr. Saleema Amershi | Podcast
2020-02-03Perspectives on Cross-Validation
2020-01-30Data Science Summer School 2019 - Replicating "An Empirical Analysis of Racial Differences in Po..."
2020-01-29Going deep on deep learning with Dr. Jianfeng Gao | Podcast
2020-01-22Innovating in India with Dr. Sriram Rajamani [Podcast]
2020-01-17Underestimating the challenge of cognitive disabilities (and digital literacy)
2020-01-17Understanding Knowledge Distillation in Neural Sequence Generation
2020-01-17'F' to 'A' on the N.Y. Regents Science Exams: An Overview of the Aristo Project
2020-01-07Private AI Bootcamp Keynote – Sreekanth Kannepalli
2020-01-07Introduction to CKKS (Approximate Homomorphic Encryption)
2020-01-07Private AI Bootcamp Competition: Team 3
2020-01-07Conversations Based on Search Engine Result Pages
2020-01-07The Ethical Algorithm
2020-01-07Efficient Forward Architecture Search
2020-01-03Fireside Chat with Anca Dragan
2020-01-02Precision Health Engineering - Designing Health-Centered Mundane Technology to Increase Adherence
2019-12-30Checkpointing the Un-checkpointable: the Split-Process Approach for MPI and Formal Verification
2019-12-25Private AI Bootcamp Brainstorming Session + Q&A



Tags:
standardized exams for AI
Aristo Project
language model solvers
knowledgeable machines
Peter Clark
Microsoft Research