DAViD: Data-efficient and Accurate Vision Models from Synthetic Data

Subscribers:
351,000
Published on ● Video Link: https://www.youtube.com/watch?v=FQtHC5e1dDg



Duration: 0:00
3,752 views
85


The state of the art in human-centric computer vision achieves high accuracy and robustness across a diverse range of tasks. The most effective models in this domain have billions of parameters, thus requiring extremely large datasets, expensive training regimes, and compute-intensive inference. In this paper, we demonstrate that it is possible to train models on much smaller but high-fidelity synthetic datasets, with no loss in accuracy and higher efficiency. Using synthetic training data provides us with excellent levels of detail and perfect labels, while providing strong guarantees for data provenance, usage rights, and user consent. Procedural data synthesis also provides us with explicit control on data diversity, that we can use to address unfairness in the models we train. Extensive quantitative assessment on real input images demonstrates accuracy of our models on three dense prediction tasks: depth estimation, surface normal estimation, and soft foreground segmentation. Our models require only a fraction of the cost of training and inference when compared with foundational models of similar accuracy.

Project page: https://aka.ms/DAViD




Other Videos By Microsoft Research


2025-08-19MindJourney: Test-Time Scaling with World Models for Spatial Reasoning
2025-08-11Medical Bayesian Kiosk (2010)
2025-08-07Reimagining healthcare delivery and public health with AI
2025-08-05VeriTrail: Detect hallucination and trace provenance in AI workflows
2025-07-31Computational models for brain science
2025-07-30VoluMe: Authentic 3D Video Calls from Live Gaussian Splat Prediction
2025-07-28How I became a StoryTeller (and how YOU can too)
2025-07-28Make some noise: Teaching the language of audio to an LLM using sound tokens
2025-07-28Building Better Language Models Through Global Understanding
2025-07-24Navigating medical education in the era of generative AI
2025-07-22DAViD: Data-efficient and Accurate Vision Models from Synthetic Data
2025-07-21AI Testing and Evaluation: Reflections
2025-07-20Intern talk: Distilling Self-Supervised-Learning-Based Speech Quality Assessment into Compact Models
2025-07-15AI Testing and Evaluation: Learnings from cybersecurity
2025-07-10Scalable emulation of protein equilibrium ensembles with BioEmu
2025-07-10How AI will accelerate biomedical research and discovery
2025-07-09Introducing Microsoft AI Economy Institute
2025-07-07AI Testing and Evaluation: Learnings from pharmaceuticals and medical devices
2025-07-03Against Softmaxing Culture: Understanding Relational Practices in Expert and Ordinary Forms of Work
2025-06-30AI Testing and Evaluation: Learnings from genome editing
2025-06-23AI Testing and Evaluation: Learnings from Science and Industry