Not All Frames Are Created Equal: Temporal Sparsity for Robust and Efficient ASR

Subscribers:
344,000
Published on ● Video Link: https://www.youtube.com/watch?v=D89u5p1lsek



Duration: 1:31:32
37 views
0


Traditional frame-based speech recognition technologies build sequence models of temporally dense vector time series representations that account for the entirety of the speech signal. However, under non-stationary distortion, the burden of accounting for everything can propagate errors beyond the corrupted frames. I will advocate an alternative strategy where the speech signal is instead (i) transformed into a sparse set of temporal point patterns of the most salient acoustic events and (ii) decoded using explicit models of the temporal statistics of these patterns. Formalized under a point process model framework, the proposed sparse methods exhibit sufficiency for clean speech recognition, provide a new avenue to improve noise robustness, and hold potential for significantly increased computational efficiency over their frame-based counterparts.




Other Videos By Microsoft Research


2016-08-172010 Microsoft Research eScience Workshop - Session MA4 - Health & Wellbeing I
2016-08-172010 Microsoft Research eScience Workshop - Session MA1 - Senses Across Scales
2016-08-172010 Microsoft Research eScience Workshop - Session MA6 - Complex Biological Systems in Action
2016-08-17Rank Bounds for Design Matrices
2016-08-17Price of Anarchy in Adword Auctions
2016-08-172010 Microsoft Research eScience Workshop - Jim Gray eScience Award Presentation
2016-08-17Hera-JVM: A Runtime System for Heterogeneous Multi-Core Architectures
2016-08-17UCSD Distributed Cognition and Human-Computer Interaction Lab Research
2016-08-17Hardware Trojans in Wireless Cryptographic Integrated Circuits
2016-08-17Stable, Fair Societies As The Natural Product of Local Exchange Networks
2016-08-17Not All Frames Are Created Equal: Temporal Sparsity for Robust and Efficient ASR
2016-08-17California Fault Lines: Understanding the Causes and Impact of Network Failures
2016-08-17e-Heritage Projects in Italy, Cambodia, and Japan: Lesson learned
2016-08-17Self-Stabilizing Autonomic Recoverers
2016-08-17Embedding spanning trees in random graphs
2016-08-17Real-time estimation of distributed parameters systems: application to traffic monitoring
2016-08-17A Practical-Time Related-Key Attack on the KASUMI Cryptosystem Used in GSM and 3G Telephony
2016-08-17Spectral graph sparsification Part 2: [An O(mlog^2 n) algorithm for solving SDD systems]
2016-08-17MSR New England Social Media Research
2016-08-17Exploiting Sparsity in Unsupervised Classification
2016-08-17Computational Social Science in Medicine



Tags:
microsoft research