Multi-rate neural networks for efficient acoustic modeling

Channel:

Subscribers:

350,000

Published on June 22, 2016 6:53:08 PM ● Video Link: https://www.youtube.com/watch?v=9uA0577ffNE

Duration: 1:27:42

87 views

In sequence recognition, the problem of long-span dependency in input sequences is typically tackled using recurrent neural network architectures, and robustness to sequential distortions is achieved using training data representative of a variety of these distortions. However, both these solutions substantially increase the training time. Thus low computation complexity during training is critical for acoustic modeling. This talk proposes the use of multi-rate neural network architectures to satisfy the design requirement of computational efficiency. In these architectures the network is partitioned into groups of units, operating at various sampling rates. As the network evaluates certain groups only once every few time steps, the computational cost is reduced. This talk will focus on the multi-rate feed-forward convolutional architecture. It will present results on several large vocabulary continuous speech recognition (LVCSR) tasks with training data ranging from 3 to 1800 hours to show the effectiveness of this architecture in efficiently learning wider temporal dependencies in both small and large data scenarios. Further it will discuss the use of this architecture for robust acoustic modeling in far-field environments. This model was shown to provide state-of-art results in the ASpIRE far-field recognition challenge. This talk will also discuss some preliminary results of multi-rate recurrent neural network based acoustic models.

Other Videos By Microsoft Research

2016-06-22	Oral Session: Solving Random Quadratic Systems of Equations-Nearly as Easy as Solving Linear Systems
2016-06-22	Towards Understandable Neural Networks for High Level AI Tasks - Part 7
2016-06-22	The Once and Future Internet
2016-06-22	The First Order World of Galton-Watson Trees
2016-06-22	Verasco, a formally verified C static analyzer
2016-06-22	Symposium: Deep Learning - Max Jaderberg
2016-06-22	Satisfiability of Ordering CSPs Above Average Is Fixed-Parameter Tractable
2016-06-22	Symposium: Deep Learning - Harri Valpola
2016-06-22	Machine Learning as Creative Tool for Designing Real-Time Expressive Interactions
2016-06-22	Symposium: Deep Learning - Sergey Ioffe
2016-06-22	Multi-rate neural networks for efficient acoustic modeling
2016-06-22	Robust Spectral Inference for Joint Stochastic Matrix Factorization and Topic Modeling
2016-06-22	Computational Limits in Statistical Inference: Hidden Cliques and Sum of Squares
2016-06-22	Extreme Classification: A New Paradigm for Ranking & Recommendation
2016-06-22	A Lasserre-Based (1+epsilon)-Approximation for Makespan Scheduling with Precedence Constraints
2016-06-22	Oral Session: Learning Theory and Algorithms for Forecasting Non-stationary Time Series
2016-06-22	Recent Developments in Combinatorial Optimization
2016-06-22	Invited Talks: Computational Principles for Deep Neuronal Architectures
2016-06-22	Coalescence in Branching Trees and Branching Random Walks
2016-06-22	Oral Session: Randomized Block Krylov Methods
2016-06-22	Oral Session: Fast Convergence of Regularized Learning in Games

Tags:

microsoft research

acoustic modeling

Channel	Latest
BeastBoyShub	1 day ago
CohhCarnage	1 day ago
Nintendo Life	1 day ago
Mhinx Tv	1 day ago
Buyargaming	1 day ago
TheViper	1 day ago
Andre Nicholas	1 day ago
WaterBird	1 day ago
KuyaDudz Vlog	1 day ago
NBC長崎放送	1 day ago
LUXXARIS	1 day ago
Story life brokuy	1 day ago
Grace YT	1 day ago
Trik Foto	1 day ago
MASFANO	1 day ago
nineja anime	1 day ago
KakuTVD	1 day ago
manga&animes	1 day ago
竜玉 Official	1 day ago
Zaus Eragon	1 day ago
OPEN TV	1 day ago
Venenum3600	1 day ago
ASURA_REMIL	1 day ago
Evelone Rofls	1 day ago
ばれちーのゲーム実況/barechi ch	1 day ago