Look Ma, no markers: holistic performance capture without the hassle

Subscribers:
342,000
Published on ● Video Link: https://www.youtube.com/watch?v=4RkLDW3GmdY



Duration: 0:00
30,089 views
1,349


We tackle the problem of highly-accurate, holistic performance capture for the face, body and hands simultaneously. Motion-capture technologies used in film and game production typically focus only on face, body or hand capture independently, involve complex and expensive hardware and a high degree of manual intervention from skilled operators. While machine-learning-based approaches exist to overcome these problems, they usually only support a single camera, often operate on a single part of the body, do not produce precise world-space results, and rarely generalize outside specific contexts. In this work, we introduce the first technique for marker-free, high-quality reconstruction of the complete human body, including eyes and tongue, without requiring any calibration, manual intervention or custom hardware. Our approach produces stable world-space results from arbitrary camera rigs as well as supporting varied capture environments and clothing. We achieve this through a hybrid approach that leverages machine learning models trained exclusively on synthetic data and powerful parametric models of human shape and motion. We evaluate our method on a number of body, face and hand reconstruction benchmarks and demonstrate state-of-the-art results that generalize on diverse datasets.

See the project page for more details and dataset download instructions: https://aka.ms/synthmocap




Other Videos By Microsoft Research


2024-12-18Embodied AI Workshop at CVPR 2024
2024-12-10GASP: Gaussian Avatars with Synthetic Priors
2024-12-09A Closer Look at Falcon
2024-12-09Quantum Lattice Enumeration in Limited Depth, Fernando Virdia
2024-12-09Enhancing Security of Bluetooth Secure Connections via Deferrable Authentication
2024-12-09Improving the Security of United States Elections with Robust Optimization
2024-11-18Introducing BiomedParse, a groundbreaking foundation model for biomedical image analysis
2024-11-11Low latency carbon budget 2023
2024-10-31Future Directions for XR Interactions with Advanced Sensing Techniques and Haptic Design Frameworks
2024-10-31Estimating mental workload in a simulated flight task using optical f-NIRS signals
2024-10-17Look Ma, no markers: holistic performance capture without the hassle
2024-10-17Hairmony: Fairness-aware hairstyle classification
2024-10-01Data Formulator: Create Rich Visualization with AI iteratively
2024-09-27Pretrainer's Guide to Training Data: Measuring Effects of Age, Domain Coverage, Quality, & Toxicity
2024-09-18AI for Business Transformation: Lessons from Healthcare
2024-09-18AI for Business Transformation: Multimodal Models
2024-09-18AI for Business Transformation: The Business of Data
2024-09-18Ludic Design for Accessibility
2024-09-16At the Foothills of an AI Era in Science | Gilbert S. Omenn Grand Challenges Address
2024-09-03Fostering appropriate reliance on AI
2024-08-27ML for High-Performance Climate and Earth Virtualization Engines