DIABLo: a Deep Individual-Agnostic Binaural Localizer

Subscribers:
344,000
Published on ● Video Link: https://www.youtube.com/watch?v=q5UxYqWN0Uc



Duration: 1:05:54
1,239 views
32


In this project, we have developed and studied a deep neural network-based individual-agnostic general-purpose binaural localizer (BL) for sound sources located at arbitrary directions on the $4\pi$ sphere. Unlike binaural localization models trained with an HRIR catalog associated with a specific head and ear shape, an individual-agnostic model aims for the generalization over the individuality of HRIRs, and does not assume a-priori knowledge about the HRIRs which the sound wave is filtered through at recording time. The proposed model was evaluated via localization tests using public binaural room impulse responses (BRIRs) and binaural recording datasets and was found to deliver more robust and accurate localization in noisy and reverberant conditions and unknown recording-time HRIRs compared to BLs trained on a single subject's HRIR catalog. The proposed model is also designed to support multiple or moving sources, and demonstrations for these scenarios are provided.




Other Videos By Microsoft Research


2022-01-04Inauguration Ceremony of MSR Asia Theory Center Opening Speech from Zhi-Ming Ma
2022-01-04Plenary Talk: Theory in an industry lab
2022-01-04Plenary Talk: From mathematics to data science and back
2022-01-04Talk: Causal inference, observational studies, and the 2021 Nobel Prize in Economics
2022-01-04AI for Programming Education
2022-01-04Talk: Recent results on stochastic 3D Navier-Stokes equations
2022-01-04Talk: Theoretical Aspects of Gradient Methods in Deep Learning
2022-01-04Recap video of 2021 MSR Asia Theory Workshop (Long version)
2022-01-04SysSieve: Extracting Actionable Insights from Unstructured Text
2022-01-03Fourier Feature Networks and Neural Volume Rendering
2021-12-20DIABLo: a Deep Individual-Agnostic Binaural Localizer
2021-12-16Our Genomes, Our Selves?
2021-12-01A law of robustness and the importance of overparametrization in deep learning
2021-11-29Research for Industries (RFI) Lecture Series: Steven K. Frey
2021-11-24Research for Industries (RFI) Lecture Series: Matthew Realff & Christopher Jones
2021-11-23On Race and Technoculture
2021-11-08Acrylic, metal & a means of preparation: Imagining & living Black life beyond the surveillance state
2021-11-01FastNeRF: High-Fidelity Neural Rendering at 200FPS [Extended]
2021-10-29Full-Body Motion from a Single Head-Mounted Device: Generating SMPL Poses from Partial Observations
2021-10-29Litmus Predictor
2021-10-25Interview and Q&A with Jenny Sabin, Creator of the Ada Installation in Microsoft Building 99