Audio Cameras for Audio-Visual Scene Analysis

Subscribers:
344,000
Published on ● Video Link: https://www.youtube.com/watch?v=WUGF0sgnnMk



Duration: 1:18:32
133 views
0


Computer vision has been able to use images to reason about scenes. We develop a new device, the audio camera, that displays images of the auditory energy arriving at a particular point, much in the same way that a visual camera displays this information for light. This device is based on the spherical microphone array, and describe how we were able to achieve frame-rate operation using graphical processing. Next, we consider the combination of the audio-camera with visual cameras. Based on the observation that audio cameras also produce central projection images, we are able to jointly calibrate audio and visual cameras. This allows applications such as 1) Using the epipolar constraint to guide beamforming in noisy environments 2) Do audio-visual image transfers 3) Image room and concert-hall acoustics 4) Image-based matched filter beamforming and dereverberation The audio camera design, has also been iterated to make the design robust, and easy to deploy. Time permitting, we will also briefly touch on other research topics in our research group, including audio scene rendering; the fast multipole method, and fast kernel algorithms; scientific computing on GPUs.




Other Videos By Microsoft Research


2016-09-07UPCRC Multicore Applications Workshop - Welcome, and Visual Computing - Session # 1
2016-09-07And Then There's This: How Stories Live and Die in Viral Culture
2016-09-07Shaplets, Motifs and Discords: A set of Primitives for Mining Massive Time Series and Image Archives
2016-09-07Modern Computer Arithmetic [1/6]
2016-09-07ISP-Enabled Behavioral Ad Targeting without User Consent (and Beyond)
2016-09-07A Research Program Proposal--Universal Cache Miss Equations for the Memory Hierarchy
2016-09-07Structured Prediction Models in Computer Vision | Efficient Convex Relaxation of Mixture Regression
2016-09-07UPCRC Multicore Applications Workshop - Session # 6 - Human-machine Interaction
2016-09-07Inferring Rankings under Constrained Sensing
2016-09-07UPCRC Multicore Applications Workshop - Session # 5 - Human-machine Interaction
2016-09-07Audio Cameras for Audio-Visual Scene Analysis
2016-09-07Block Switching: Towards a Robust Protocol Stack for Diverse Wireless Networks
2016-09-07A Programming Language for the New Web
2016-09-07The Beauty and the Beast: Vulnerability in Red Hat's Packages
2016-09-07Debian: Anatomy of An Open Source Project
2016-09-07UPCRC Multicore Applications Workshop - Session # 3 - Social Interaction
2016-09-07Supersingular abelian varieties and modular forms
2016-09-07The Jasons: The Secret History of Science's Postwar Elite           
2016-09-07UPCRC Multicore Applications Workshop - Session # 4 - Speech and Audio
2016-09-07Literacy Bridge and the Talking Book Project
2016-09-07Stencil Computation Auto-tuning on Modern Multicore Architectures



Tags:
microsoft research