Research intern talk: Unified speech enhancement approach for speech degradation & noise suppression

Subscribers:
342,000
Published on ● Video Link: https://www.youtube.com/watch?v=_ggfv6eMIJs



Duration: 1:08:45
359 views
0


Speakers: Khandokar Md. Nayem
Host: Sebastian Braun

Speech enhancement approaches generally focus on removing additive noise and reverberation that adversely affects the overall speech quality and intelligibility. Another group of signal degradations like clipping, bandwidth limitations, and codec degradation can occur due to poor recording hardware, network transmission, and other pre-processing. These degradations largely impact on intelligibility and speech quality. In this work, we deploy a convolutional recurrent network to remove these speech degradations in conjunction with the noise suppression task and propose cascade and end-to-end approaches. We compare both complex mask and direct spectrum estimation approaches for this task using a small real-time capable DNN. Overall, we propose a cascaded processing approach, addressing the distortion types differently, and enabling a task-tailored modular processing.

Learn more: https://www.microsoft.com/en-us/research/video/research-intern-talk-unified-speech-enhancement-approach-for-speech-degradations-noise-suppression/




Other Videos By Microsoft Research


2023-12-05AI Forum 2023 | Innovating Intelligent Environments for Wireless Communication & Sensing
2023-12-05AI Forum 2023 | Towards Responsible AI Deployment
2023-12-05AI Forum 2023 | AI4Science: Accelerating Scientific Discovery with Artificial Intelligence
2023-12-05AI Forum 2023 | Harnessing AI for a Greener Tomorrow
2023-12-05AI Forum 2023 | Panel Discussion “AI Synergy: Science and Society”
2023-12-05AI Forum 2023 | Future of Foundation Models
2023-11-30PwR: Using representations for AI-powered software development
2023-11-10Binaural spatial audio positioning in video calls
2023-11-10Semi-supervised Multi-task learning for acoustic parameter estimation
2023-11-10Research intern talk: Real-time single-channel speech separation in noisy & reverberant environments
2023-11-10Research intern talk: Unified speech enhancement approach for speech degradation & noise suppression
2023-11-10Synchronized Audio-Visual Generation with a Joint Generative Diffusion Model and Contrastive Loss
2023-11-09Supporting the Responsible AI Red-Teaming Human Infrastructure | Jina Suh
2023-11-08Project Mosaic
2023-11-02Supporting the Responsible AI Red-Teaming Human Infrastructure | Jina Suh
2023-11-02Sociotechnical Approaches to Measuring Harms Caused by AI Systems | Hanna Wallach
2023-11-02Storytelling and futurism | Matt Corwine
2023-11-02Regulatory Innovation to Enable Use of Generative AI in Drug Development | Stephanie Simmons
2023-11-02AI Powered Community Micro-Grid for Resiliency and Equitability | Peeyush Kumar
2023-11-02Generative AI & Plural Governance: Mitigating Challenges & Surfacing Opportunities | Madeleine Daepp
2023-11-02AI in Organizational Settings | danah boyd