Multi-level Optimization Approaches to Computer Vision

Subscribers:
344,000
Published on ● Video Link: https://www.youtube.com/watch?v=QyXOj_hqQUw



Duration: 55:37
4,241 views
77


On a broad level, computer graphics involves representing 3D information in 2D. Computer vision can be thought of as the inverse problem - inferring 3D information from a projected representation. This talk will discuss two deep learning approaches to 3D human pose estimation and single-view object reconstruction that attempt to learn about solution feasibility while incorporating simple computer graphics techniques to ensure consistency with observations. The first approach optimizes a GAN to produce a parameterization of the feasible solution space, then seeks a solution in that space which is maximally consistent with observations. The follow-up approach is based on combining these optimization steps into a single nested optimization problem.

See more at https://www.microsoft.com/en-us/research/video/multi-level-optimization-approaches-to-computer-vision/




Other Videos By Microsoft Research


2020-05-26Explaining Decisions from Vision Models and Correcting them via Human Feedback
2020-05-26Auditing Outsourced Services
2020-05-26MSR Distinguished Lecture Series: First-person Perception and Interaction
2020-05-26Large-scale live video analytics over 5G multi-hop camera networks
2020-05-26Kristin Lauter's TED Talk on Private AI at Congreso Futuro during Panel 11 / SOLVE
2020-05-19How an AI agent can balance a pole using a simulation
2020-05-19How to build Intelligent control systems using new tools from Microsoft and simulations by Mathworks
2020-05-13Diving into Deep InfoMax with Dr. Devon Hjelm | Podcast
2020-05-08An Introduction to Graph Neural Networks: Models and Applications
2020-05-07MSR Cambridge Lecture Series: Photonic-chip-based soliton microcombs
2020-05-07Multi-level Optimization Approaches to Computer Vision
2020-05-05How good is your classifier? Revisiting the role of evaluation metrics in machine learning
2020-05-05Fast and Flexible Multi-Task Classification Using Conditional Neural Adaptive Processes
2020-05-05Hypergradient descent and Universal Probabilistic Programming
2020-05-04Learning over sets, subgraphs, and streams: How to accurately incorporate graph context
2020-05-04An Ethical Crisis in Computing?
2020-04-21Presentation on “Beyond the Prototype” by Rushil Khurana
2020-04-20Understanding and Improving Database-backed Applications
2020-04-20Efficient Learning from Diverse Sources of Information
2020-04-08Project Orleans and the distributed database future with Dr. Philip Bernstein | Podcast
2020-04-07Reprogramming the American Dream: A conversation with Kevin Scott and J.D. Vance, with Greg Shaw



Tags:
Computer vision
computer graphics
3D information
deep learning
3D human pose
GAN
Dominic Jack
Microsoft Research Cambridge