MSR Talk: Unsupervised Speech Reverberation Control with Diffusion Implicit Bridges

Subscribers:
342,000
Published on ● Video Link: https://www.youtube.com/watch?v=Yo0Mr4Fq17I



Duration: 56:14
309 views
16


Speaker(s): Eloi Moliner
Host: Hannes Gamper

Speech reverberation control involves the manipulation of acoustic characteristics in speech recordings, including tasks like speech dereverberation or reverberation time reduction. Diffusion implicit bridges are a recently proposed domain translation technique based on diffusion models and entropy-regularized optimal transport. They enable a bijective mapping between samples from different distributions by bridging through a prior Gaussian distribution. Diffusion bridges have the advantage of not requiring paired data samples for training and are optimized with a simple and stable Euclidean objective. This study applies diffusion implicit bridges to unsupervised speech reverberation control. We identify how a naive implementation of this method results in numerous undesired artifacts, such as speaker identity changes or babling, and attribute it to the curvature in the sampling trajectories. To mitigate these issues we propose training the model with a chunk-based optimal transport coupling between speech and noise samples, which significantly straightens the learned trajectories and improves the semantic consistency of the speech content. We study the performance of different configurations of the model through a comprehensive objective evaluation. To demonstrate the versatility of the method, we additionally conduct experiments on other tasks such as speech declipping or guitar distortion removal.

See more at https://www.microsoft.com/en-us/research/video/msr-talk-unsupervised-speech-reverberation-control-with-diffusion-implicit-bridges/




Other Videos By Microsoft Research


2024-07-12Advances in Natural Language Generation for Indian Languages
2024-06-06Making Sentence Embeddings Robust to User-Generated Content
2024-06-06Keynote: Building Globally Equitable AI
2024-06-04AutoGen Update: Complex Tasks and Agents
2024-06-04MatterGen: A Generative Model for Materials Design
2024-06-04Driving Industry Evolution: Exploring the Impact of Generative AI on Sector Transformation
2024-06-04Challenges and Opportunities of Large Multi-Modal Models for Blind and Low Vision Users: CLIP
2024-06-04Panel Discussion: Generative AI for Global Impact: Challenges and Opportunities
2024-06-04Keynote: Building Globally Equitable AI
2024-05-14Join us for Research Forum on June 4
2024-05-14MSR Talk: Unsupervised Speech Reverberation Control with Diffusion Implicit Bridges
2024-05-10Unlocking Real world solutions with AI – Chris Bishop
2024-05-10How will AI transform precision medicine? – Ava Amini
2024-05-03AI Case Studies for Natural Science Research with Bonnie Kruft
2024-04-29AI For All: Embracing Equity for All
2024-04-24TrustRate: A Decentralized Platform for Hijack-Resistant Anonymous Reviews
2024-04-22Women in Data Science Fireside Chat with Ilda Ladeira, Karin Kimbrough and Lisa Cohen
2024-04-09Combining Machine Learning and Bayesian networks for Decision Support in Arrythmia Diagnosis
2024-03-20Strategic Subset Selection in Satellite Imagery: Machine Vision Insights
2024-03-11Generative AI and Plural Governance: Mitigating Challenges and Surfacing Opportunities
2024-03-11GigaPath: Foundation Model for Digital Pathology