Some Recent Advances in Gaussian Mixture Modeling for Speech Recognition

Subscribers:
344,000
Published on ● Video Link: https://www.youtube.com/watch?v=h7CQm7oRQGY



Duration: 1:15:04
1,297 views
6


State-of-the-art Hidden Markov Model (HMM) based speech recognition systems typically use Gaussian Mixture Models (GMMs) to model the acoustic features associated with each HMM state. Due to computational, storage and robust estimation considerations the covariance matrices of the Gaussians in these GMMs are typically diagonal. In this talk I will describe several new techniques to model the acoustic features associated with an HMM state better - subspace constrained GMMs (SCGMMs), non-linear volume-preserving acoustic feature space transformations etc. Even with better models, one has to deal with mismatches between the training and test conditions. This problem can be addressed by adapting either the acoustic features or the acoustic models to reduce the mismatch. In this talk I will present several approaches to adaptation - FMAPLR (a variant of FMLLR that works well with very little adaptation data), adaptation of the front-end parameters, adaptation of SCGMMs, etc. While the ideas presented are explored and evaluated in the context of speech recognition, the talk should appeal to anyone with an interest in statistical modeling.




Other Videos By Microsoft Research


2016-09-06Rock 'n Roll : Earthquake & Disaster Preparedness
2016-09-06Understanding Customers: Shaping Our Future through Understanding Social Change
2016-09-06Fast Database and Data Streaming Operations using Graphics Processors
2016-09-06Hyperparameter and Kernel Learning for Graph Based Semi-Supervised Classification
2016-09-06Multi-Engine Machine Translation Guided by Explicit Word Matching
2016-09-06Using Compression Models to Filter Spam; Exploiting Structural Information for Categorization
2016-09-06The Man Who Knew Too Much: Alan Turing and the Invention of the Computer [1/4]
2016-09-06Estimation of intrinsic dimensionality using high-rate vector quantization
2016-09-06Abducted: How People Come to Believe They Were Kidnapped by Aliens [1/11]
2016-09-06Spontaneous Speech: Challenges and Opportunities for Parsing
2016-09-06Some Recent Advances in Gaussian Mixture Modeling for Speech Recognition
2016-09-06How to Survive a Robot Uprising: Tips to Defend Yourself Against The Coming Rebellion
2016-09-06Body for Life for Women
2016-09-06A Low-level Approach to Reuse for Programming-Language Infrastructure
2016-09-06Sensor Networks Workshop 05 - Short Talks (See Abstract)
2016-09-06Sensor Networks Workshop 05 - Keynote
2016-09-06The Search: How Google and Its Rivals Rewrote the Rules of Business and Transformed Our Culture
2016-09-06Zero Configuration Networking with Bonjour [1/2]
2016-09-06Correlation decay in statistical physics and applications to counting problems
2016-09-06Spook: Science Tackles the Afterlife
2016-09-06Garbage-First Garbage Collection (and a Related Compiler Optimizaton) [1/2]



Tags:
microsoft research