Multi-microphone Dereverberation and Intelligibility Estimation in Speech Processing

Subscribers:
344,000
Published on ● Video Link: https://www.youtube.com/watch?v=Lb2XJbhzz58



Duration: 1:26:28
638 views
9


When speech signals are captured by one or more microphones in realistic acoustic environments, they will be contaminated by noise due to surrounding sound sources and by reverberation due to reflections off walls and other surfaces. Noise and reverberation can have detrimental effects on the perceptual experience of a listener and, in more severe cases, they can cause intelligibility loss. Many signal processing applications, such as, speech codecs and speech recognizers deteriorate rapidly in performance as noise and reverberation levels increase. Consequently, the challenging problems of noise reduction and dereverberation have received a great deal of attention in research, especially, with the advent of mobile telephony and voice over IP. Multi-microphone speech dereverberation forms the topic of the first part of this talk. Two alternative methods will be introduced. The first method is based on the source-filter model of speech production while the second approaches the problem through blind identification and inversion of the room impulse responses. Simulation results will be presented to demonstrate the methods and to facilitate a comparison between them in terms of dereverberation performance. In the second part, the talk will focus on subject-based and automatic estimation of intelligibility in noisy and processed speech. In particular, the Bayesian Adaptive Speech Intelligibility Estimation (BASIE) method will be presented. BASIE is a tool for rapid subject-based estimation of a given speech reception threshold (SRT) and the slope at that threshold of multiple psychometric functions for speech intelligibility in noise. The core of BASIE is an adaptive Bayesian procedure, which adjusts the signal-to-noise ratio at each subsequent stimulus such that the expected variance of the threshold and slope estimates are minimised. Furthermore, strategies for using BASIE to evaluate the effects of speech processing algorithms on intelligibility and two illustrative examples for different noise reduction methods with supporting listening experiments will be given.




Other Videos By Microsoft Research


2016-08-16Abelian Sandpiles and the Harmonic Model
2016-08-16Full-rank Gaussian Modeling of Convolutive Audio Mixtures Applied to Source Separation
2016-08-16MADDER and Self-Tuning Data Analytics on Hadoop with Starfish
2016-08-16Approximation Algorithms for Correlated Knapsacks and Non-Martingale Bandits
2016-08-16ChatArt: Interactive Pictographic Chat
2016-08-16Nonnegative k-sums, fractional covers, and probability of small deviations
2016-08-16Injective Tensor Norms: Hardness and Reductions
2016-08-16Monitoring Untrusted Modern Applications with Collective Record and Replay
2016-08-16Practical Boogie (on the example of VCC)
2016-08-16Coherent Depth in Stereo Vision
2016-08-16Multi-microphone Dereverberation and Intelligibility Estimation in Speech Processing
2016-08-16From Personalized Retinal Image Mapping to Large Scale Parallel Image Processing
2016-08-16Coding4Fun XAPfest!
2016-08-16Your Abstractions are Worth Powerless! Non-Volatile Storage and Computation on Embedded Devices
2016-08-16Near Optimal Online Algorithms and Fast Approximation Algorithms for Resource Allocation Problems
2016-08-16Interpreting the Community: Information Practices and/for Deviance
2016-08-16Pretty Good Democracy for a variety of voting schemes
2016-08-16Learning Valuation Functions
2016-08-16Applying Semantic Analyses to Content-based Recommendation and Document Clustering
2016-08-16Fusing Mobile, Sensor, and Social Computing in the Cloud To Enable Context-Aware Applications
2016-08-16The Past, Present, and Future of Video Telephony



Tags:
microsoft research