Multi-rate neural networks for efficient acoustic modeling

Subscribers:
344,000
Published on ● Video Link: https://www.youtube.com/watch?v=go4B_cNCv-Y



Duration: 1:27:42
466 views
2


In sequence recognition, the problem of long-span dependency in input sequences is typically tackled using recurrent neural network architectures, and robustness to sequential distortions is achieved using training data representative of a variety of these distortions. However, both these solutions substantially increase the training time. Thus low computation complexity during training is critical for acoustic modeling. This talk proposes the use of multi-rate neural network architectures to satisfy the design requirement of computational efficiency. In these architectures the network is partitioned into groups of units, operating at various sampling rates. As the network evaluates certain groups only once every few time steps, the computational cost is reduced. This talk will focus on the multi-rate feed-forward convolutional architecture. It will present results on several large vocabulary continuous speech recognition (LVCSR) tasks with training data ranging from 3 to 1800 hours to show the effectiveness of this architecture in efficiently learning wider temporal dependencies in both small and large data scenarios. Further it will discuss the use of this architecture for robust acoustic modeling in far-field environments. This model was shown to provide state-of-art results in the ASpIRE far-field recognition challenge. This talk will also discuss some preliminary results of multi-rate recurrent neural network based acoustic models.




Other Videos By Microsoft Research


2016-06-13Towards Understandable Neural Networks for High Level AI Tasks - Part 3
2016-06-13Artist in Residence (formerly Studio99) Presents: Michael Gough and "Drawing as Literacy."
2016-06-13Towards Cross-fertilization Between Propositional Satisfiability and Data Mining
2016-06-13Making Objects Count: A Shape Analysis Framework for Proving Polynomial Time Termination
2016-06-13Human factors of software updates
2016-06-13Machine-Checked Correctness and Complexity of a Union-Find Implementation
2016-06-13Applications of 3-Dimensional Spherical Transforms to Acoustics and Personalization of Head-related
2016-06-13Network Protocols: Myths, Missteps, and Mysteries
2016-06-13Optimal and Adaptive Online Learning
2016-06-13Speaker Diarization: Optimal Clustering and Learning Speaker Embeddings
2016-06-13Multi-rate neural networks for efficient acoustic modeling
2016-06-13Unsupervised Latent Faults Detection in Data Centers
2016-06-13System and Toolchain Support for Reliable Intermittent Computing
2016-06-13Gates Foundation Presents: Crucial Areas of Fintech Innovation for the Bottom of the Pyramid
2016-06-13Social Computing Symposium 2016: Harassment, Threats, Trolling Online, Diversity in Gaming is Vital
2016-06-13Bringing Harmony Through AI and Economics
2016-06-13Approximating Integer Programming Problems by Partial Resampling
2016-06-13A Lasserre-Based (1+epsilon)-Approximation for Makespan Scheduling with Precedence Constraints
2016-06-13Towards Understandable Neural Networks for High Level AI Tasks - Part 7
2016-06-13Verasco, a formally verified C static analyzer
2016-06-13Future Microprocessors Driven by Dataflow Principles



Tags:
microsoft research
neural networks
computer systems and networking
hardware and devices
sequence recognition
acoustic modeling