Sound Capture and Speech Enhancement for Communication and Distant Speech Recognition

Channel:

Subscribers:

351,000

Published on April 29, 2021 2:08:45 AM ● Video Link: https://www.youtube.com/watch?v=VGu13TNhezo

Duration: 1:37:59

1,882 views

In this talk we will discuss the general architecture of speech enhancement pipelines for the needs of hands-free telecommunication and distant speech recognition. The talk will discuss both classical approaches using statistical signal processing and deep learning using neural networks. It will be illustrated with real-life examples from the speech enhancement audio pipelines in Kinect, HoloLens, and Teams.

PRESENTERS: Dr. Ivan J. Tashev and Dr. Sebastian Braun from Audio and Acoustics Research Group in Microsoft Research – Redmond, WA, USA.

Other Videos By Microsoft Research

2021-05-20	Failures of imagination: Discovering and measuring harms in language technologies
2021-05-13	Cities Unlocked – Introducing 3D Sound for Greater Mobility and Independence
2021-05-13	The Journey to Microsoft Soundscape
2021-05-13	Microsoft Soundscape - Lighting up the World with Sound
2021-05-12	Platform for Situated Intelligence Workshop \| Day 1
2021-05-12	Platform for Situated Intelligence Workshop \| Day 2
2021-05-03	Knowledge Distillation as Semiparametric Inference
2021-05-03	Better design, implementation, and testing of async systems with Coyote
2021-05-03	Research @Microsoft Research India: interdisciplinary and impactful with Dr. Sriram Rajamani
2021-04-29	Virtual Lake Nona Impact Forum “Health Innovation in the New Reality”
2021-04-28	Sound Capture and Speech Enhancement for Communication and Distant Speech Recognition
2021-04-27	Virtual Lake Nona Impact Forum “Health Innovation in the New Reality”
2021-04-26	FastNeRF: High-Fidelity Neural Rendering at 200FPS [Condensed]
2021-04-21	Research for Industries (RFI) Lecture Series: Warren Powell
2021-04-21	Research for Industries (RFI) Lecture Series: Andreas Haeberlen
2021-04-13	Discovering hidden connections in art with deep, interpretable visual analogies
2021-04-13	ZeRO & Fastest BERT: Increasing the scale and speed of deep learning training in DeepSpeed
2021-04-13	Interactive sound simulation: Rendering immersive soundscapes in games and virtual reality
2021-04-13	A prototype implementation of 4G packet gateway Microsoft Catapult FPGA platform
2021-04-12	Self-Tuning Networks: Amortizing the Hypergradient Computation for Hyperparameter Optimization
2021-04-06	Ultra-dense data storage and extreme parallelism with electronic-molecular systems

Channel	Latest
Mehmet Uzun	6 hours ago
domisumReplay: Syndra	6 hours ago
domisumReplay: Mordekaiser	6 hours ago
Shhoto	6 hours ago
DismArchus	6 hours ago
Baba Behwish	6 hours ago
domisumReplay: Aatrox	6 hours ago
domisumReplay: Akali	7 hours ago
domisumReplay: Sett	7 hours ago
domisumReplay: Kayle	7 hours ago
iTownGamePlay Terror&Diversión	7 hours ago
Nickich	7 hours ago
League of SUPPORT - LOL Replays	7 hours ago
Happy Animes Recaps	7 hours ago
SiIvaGunner	7 hours ago
Oh Shiitake Mushrooms	7 hours ago
domisumReplay: Nasus	7 hours ago
domisumReplay: Ahri	8 hours ago
HeroVoltsy	8 hours ago
JustSaySteven	8 hours ago
WildGamerSK	8 hours ago
Fz Frost	8 hours ago
RobtheMod	8 hours ago
domisumReplay: Camille	8 hours ago
Tivvv3 TivvyCat!	8 hours ago