Recent Efforts Towards Efficient And Scalable Neural Waveform Coding

Channel:

Subscribers:

351,000

Published on December 2, 2020 1:31:03 AM ● Video Link: https://www.youtube.com/watch?v=ybEwJKTaY0k

Duration: 50:36

1,226 views

Acoustic signal compression techniques, converting the floating-point waveform into the bitstream representation, serve a cornerstone in the current data storage and telecommunication infrastructure. The rise of data-driven approaches for acoustic coding systems brings in not only potentials but also challenges, among which the model complexity is a major concern: on the one hand, this general-purpose computational paradigm features the performance superiority; on the other hand, most codecs are deployed on low power devices which barely afford the overwhelming computational overhead. In this talk, I will introduce several of our recent efforts towards a better trade-off between performance and efficiency for neural speech/audio coding. I will present on cascaded cross-module residual learning to conduct multistage quantization in deep learning techniques; in addition, a collaborative quantization scheme will be talked about to simultaneously binarize linear predictive coefficients and the corresponding residuals. If time permits, a novel perceptually salient objective function with a psychoacoustical calibration will also be discussed.

Learn more about this and other talks at Microsoft Research: https://www.microsoft.com/en-us/research/video/recent-efforts-towards-efficient-and-scalable-neural-waveform-coding/

Other Videos By Microsoft Research

2020-12-09	Evidence based CS education
2020-12-09	Physical computing for computer science education
2020-12-09	Accessible CS Education Fall Workshop: Microsoft Chief Accessibility Officer Jenny Lay-Flurrie
2020-12-09	Students with disabilities in the U.S.
2020-12-09	Welcome & Introduction to Microsoft's Accessible Computer Science Education Fall Workshop
2020-12-08	De-Identifying Healthcare Data for Research
2020-12-05	Task-Oriented Dialogue as Dataflow Synthesis
2020-12-03	The opportunities with AI and machine learning
2020-12-02	Demonstration of Lumiere (1995)
2020-12-02	Demonstration of Priorities & Notification Platform (2001)
2020-12-01	Recent Efforts Towards Efficient And Scalable Neural Waveform Coding
2020-12-01	Geometry-constrained Beamforming Network for end-to-end Farfield Sound Source Separation
2020-11-24	Directions in ML: Automating Dataset Comparison and Manipulation with Optimal Transport
2020-11-13	Audio-based Toxic Language Detection
2020-11-05	CDO roundtable: Generating business value through data quality
2020-11-04	Unlocking IoT Data for Research in Healthcare
2020-11-03	MSR Twitter Local Events
2020-11-02	Spotlight on advancements in AI, HCI, Computing, VR, Systems Networking & more at Microsoft Research
2020-10-30	Distinct population of sudden unexpected infant death based on age
2020-10-28	Enabling interaction between mixed reality and robots via cloud-based localization
2020-10-26	Directions in ML: AutoML & Interpretability: Powering the machine learning revolution in healthcare

Tags:

Neural Waveform Coding

Acoustic signal compression

floating-point waveform

bitstream representation

data storage

telecommunication infrastructure

acoustic coding systems

Microsoft Research

neural speech/audio coding

cascaded cross-module residual learning

multistage quantization

Channel	Latest
Rukesan	6 hours ago
Villain Ki Haveli	6 hours ago
Cyber Crumbs	6 hours ago
Power_diplomacy	6 hours ago
Ranaji Gaming	6 hours ago
FBN BOOM	6 hours ago
John Christian Mateo	6 hours ago
Immortal Suyou	6 hours ago
Hazefest	6 hours ago
ArguzZetsu	6 hours ago
monyson khulpuwa	6 hours ago
Shobhit Gamer	6 hours ago
Gipo	6 hours ago
Anomax Tv	6 hours ago
HOPELESS FF	6 hours ago
Mini Otaku	6 hours ago
Board Game Museum	6 hours ago
Selp	6 hours ago
Challenger Replays	6 hours ago
CwapPlatinum	6 hours ago
BANT	6 hours ago
Z GamePlay 11	6 hours ago
FARHANHAN CHANNEL	6 hours ago
Early Bird Gaming	6 hours ago
Belion gamer	6 hours ago