Recent Efforts Towards Efficient And Scalable Neural Waveform Coding

Subscribers:
351,000
Published on ● Video Link: https://www.youtube.com/watch?v=ybEwJKTaY0k



Duration: 50:36
1,226 views
34


Acoustic signal compression techniques, converting the floating-point waveform into the bitstream representation, serve a cornerstone in the current data storage and telecommunication infrastructure. The rise of data-driven approaches for acoustic coding systems brings in not only potentials but also challenges, among which the model complexity is a major concern: on the one hand, this general-purpose computational paradigm features the performance superiority; on the other hand, most codecs are deployed on low power devices which barely afford the overwhelming computational overhead. In this talk, I will introduce several of our recent efforts towards a better trade-off between performance and efficiency for neural speech/audio coding. I will present on cascaded cross-module residual learning to conduct multistage quantization in deep learning techniques; in addition, a collaborative quantization scheme will be talked about to simultaneously binarize linear predictive coefficients and the corresponding residuals. If time permits, a novel perceptually salient objective function with a psychoacoustical calibration will also be discussed.

Learn more about this and other talks at Microsoft Research: https://www.microsoft.com/en-us/research/video/recent-efforts-towards-efficient-and-scalable-neural-waveform-coding/




Other Videos By Microsoft Research


2020-12-09Evidence based CS education
2020-12-09Physical computing for computer science education
2020-12-09Accessible CS Education Fall Workshop: Microsoft Chief Accessibility Officer Jenny Lay-Flurrie
2020-12-09Students with disabilities in the U.S.
2020-12-09Welcome & Introduction to Microsoft's Accessible Computer Science Education Fall Workshop
2020-12-08De-Identifying Healthcare Data for Research
2020-12-05Task-Oriented Dialogue as Dataflow Synthesis
2020-12-03The opportunities with AI and machine learning
2020-12-02Demonstration of Lumiere (1995)
2020-12-02Demonstration of Priorities & Notification Platform (2001)
2020-12-01Recent Efforts Towards Efficient And Scalable Neural Waveform Coding
2020-12-01Geometry-constrained Beamforming Network for end-to-end Farfield Sound Source Separation
2020-11-24Directions in ML: Automating Dataset Comparison and Manipulation with Optimal Transport
2020-11-13Audio-based Toxic Language Detection
2020-11-05CDO roundtable: Generating business value through data quality
2020-11-04Unlocking IoT Data for Research in Healthcare
2020-11-03MSR Twitter Local Events
2020-11-02Spotlight on advancements in AI, HCI, Computing, VR, Systems Networking & more at Microsoft Research
2020-10-30Distinct population of sudden unexpected infant death based on age
2020-10-28Enabling interaction between mixed reality and robots via cloud-based localization
2020-10-26Directions in ML: AutoML & Interpretability: Powering the machine learning revolution in healthcare



Tags:
Neural Waveform Coding
Acoustic signal compression
floating-point waveform
bitstream representation
data storage
telecommunication infrastructure
acoustic coding systems
Microsoft Research
neural speech/audio coding
cascaded cross-module residual learning
multistage quantization