Steering and Capturing Human Insight for Large-Scale Learning of Visual Objects

Subscribers:
344,000
Published on ● Video Link: https://www.youtube.com/watch?v=EbXSvWMjKoY



Duration: 1:09:58
125 views
0


An important factor determining our success on object recognition and image search problems is how machine learning algorithms solicit and exploit human knowledge. Existing recognition approaches often manage human supervision in haphazard ways, and only allow a narrow, one-way channel of input from the annotator to the system. We propose algorithms that can steer human insight towards where it will have the most impact, and expand the manner in which recognition methods can assimilate that insight. The underlying goal is to use manual effort cost-effectively for robust visual learning. More specifically, I will present an approach to actively seek annotatorsΓÇÖ input when training an object recognition system. Unlike traditional active learning methods, we target not only the example for which a label is most needed, but also the type of label itself (e.g., an image tag vs. full segmentation). Further, since annotations should be fielded by distributed, uncoordinated annotators, we develop cost-sensitive selection algorithms to compute far-sighted predictions of which batches of data ought to be labeled next. Finally, beyond ΓÇ£askingΓÇ¥ annotators the right questions, I will show how we can ΓÇ£listenΓÇ¥ more deeply to what image-taggers unknowingly reveal in their annotations, by learning implied cues about object prominence from lists of ordered keywords. Using these cues, we improve state-of-the-art object detection and image retrieval results on benchmark datasets. This is work done with Sudheendra Vijayanarasimhan and Sung Ju Hwang.




Other Videos By Microsoft Research


2016-08-17Fully Homomorphic Encryption; Bi-Deniable Encryption; We Have The Technology, Now Where Next?
2016-08-17Verifying Safety and Liveness Properties of a Kernelized Web Browser
2016-08-17The successes and challenges of making low-data languages in online automatic translation portals
2016-08-17Optimal Auctions with Budget Constraints
2016-08-17Proof of Aldous' spectral gap conjecture
2016-08-17Why Social Computing Is So Hard
2016-08-17Metastabiity and logarithmic energy barriers for a polymer dynamics
2016-08-17Predicate Encryption; Structured Encryption and Controlled Disclosure; Cloud Cryptography
2016-08-17Approximation Schemes for Optimization
2016-08-17All pairs shortest path in quadratic time with high probability
2016-08-17Steering and Capturing Human Insight for Large-Scale Learning of Visual Objects
2016-08-17Welcome and opening remarks; Point Obfuscation and Friends; Outsourcing Computation
2016-08-17Laser Processing of Materials III
2016-08-17Finding all min st-cuts in planar graphs
2016-08-17GenoZoom: Browsing the genome with Microsoft Biology Foundation, DeepZoom and Silverlight
2016-08-17Robust Face Recognition Using Recognition-by-Parts, Boosting, and Transduction
2016-08-17Laser Processing of Materials II
2016-08-17The double dimer model with quaternions
2016-08-17Robust High-dimensional Principal Component Analysis
2016-08-17FPGAs & Side Communication Channels - Black Hat Risks and White Hat Benefits
2016-08-17Laser Processing of Materials I



Tags:
microsoft research