Final intern talk: Improving Frechet Audio Distance for Generative Music Evaluation

Subscribers:
343,000
Published on ● Video Link: https://www.youtube.com/watch?v=7Z4bIQHvW5w



Duration: 41:10
370 views
7


Speakers: Azalea Gui
Host: Hannes Gamper

As generative music models become more powerful and popular, there is a growing need for robust objective metrics of music quality that correlates with human perception. The Frechet Audio Distance (FAD) is a commonly used metric for this purpose. However, its performance may be hampered by issues including sample size bias, limitations of the underlying audio embeddings, and the use of low-quality reference sets. We propose reducing sample size bias by extrapolating unbiased scores as the sample size approaches infinity. A comparison of various audio embeddings reveals that some are better suited for deriving FAD scores that capture aspects of musical or acoustic quality. Finally, our experiments underscore the importance of choosing a diverse and high-quality reference dataset for FAD calculation. Listening test results indicate that unbiased FAD scores calculated using suitable embeddings and reference music improves correlation with human ratings of musical and acoustic quality.




Other Videos By Microsoft Research


2023-11-02AI in Organizational Settings | danah boyd
2023-11-02Announcing New Microsoft Research AI & Society Fellows program
2023-11-02Task Focused IR in the Era of Generative AI Workshop: Panel Talk
2023-11-02Task Focused IR in the Era of Generative AI Workshop: Invited Talks
2023-11-02Task Focused IR in the Era of Generative AI Workshop: Intro + Keynote
2023-10-19The Prompt with Trevor Noah | Episode 1: IHME Population Mapping
2023-10-03Wildlife Conflict Resolution: Boma & Cattle Detection in the Masai Mara using AI
2023-09-26CCEdit results
2023-09-22WiDS Fireside Chat with Jaime Teevan and Ming Ye
2023-09-22End-to-End Encrypted Group Chats with MLS: Design, Implementation and Verification
2023-09-22Final intern talk: Improving Frechet Audio Distance for Generative Music Evaluation
2023-09-15Microsoft Research India - who we are.
2023-08-09Keypoint Detection for Measuring Body Size of Giraffes: Enhancing Accuracy and Precision
2023-08-04Scalable and Efficient AI: From Supercomputers to Smartphones
2023-07-18AI for Precision Health
2023-07-07Multilingual Evaluation of Generative AI (MEGA)
2023-07-07The Whole Truth and Nothing But the Truth: Faithful and Controllable Dialogue Response Generation...
2023-07-07Privacy-Preserving Domain Adaptation of Semantic Parsers
2023-05-30Microsoft’s Holoportation™ Communications Technology: Facilitating 3D Telemedicine
2023-05-05Human-Centered AI: Ensuring Human Control While Increasing Automation
2023-05-03Escapement: A Tool for Interactive Prototyping with Video via Sensor-Mediated Abstraction of Time