Beyond Accuracy: Behavioral Testing of NLP Models with CheckList | AISC
Subscribers:
22,300
Published on ● Video Link: https://www.youtube.com/watch?v=A0od6RosVSA
Speaker(s): Marco Tulio Ribeiro
Facilitator(s): Royal Sequiera
Find the recording, slides, and more info at https://ai.science/e/check-list-beyond-accuracy-behavioral-testing-of-nlp-models-with-check-list--lva9YvNDiwob0DFAE26o
Motivation / Abstract
- The paper proposes CheckList, a novel behavioural testing methodology
- CheckList provides you tools that will you build software engineering like test cases at scale!
- Using CheckList, the paper identifies critical failures in both commercial
and state-of-the-art models
- This paper won the overall Best Paper Award at ACL'20
------
#AISC hosts 3-5 live sessions like this on various AI research, engineering, and product topics every week! Visit https://ai.science for more details