Steering and Capturing Human Insight for Large-Scale Learning of Visual Objects

Channel:

Subscribers:

344,000

Published on August 17, 2016 9:57:13 PM ● Video Link: https://www.youtube.com/watch?v=EbXSvWMjKoY

Duration: 1:09:58

125 views

An important factor determining our success on object recognition and image search problems is how machine learning algorithms solicit and exploit human knowledge. Existing recognition approaches often manage human supervision in haphazard ways, and only allow a narrow, one-way channel of input from the annotator to the system. We propose algorithms that can steer human insight towards where it will have the most impact, and expand the manner in which recognition methods can assimilate that insight. The underlying goal is to use manual effort cost-effectively for robust visual learning. More specifically, I will present an approach to actively seek annotatorsΓÇÖ input when training an object recognition system. Unlike traditional active learning methods, we target not only the example for which a label is most needed, but also the type of label itself (e.g., an image tag vs. full segmentation). Further, since annotations should be fielded by distributed, uncoordinated annotators, we develop cost-sensitive selection algorithms to compute far-sighted predictions of which batches of data ought to be labeled next. Finally, beyond ΓÇ£askingΓÇ¥ annotators the right questions, I will show how we can ΓÇ£listenΓÇ¥ more deeply to what image-taggers unknowingly reveal in their annotations, by learning implied cues about object prominence from lists of ordered keywords. Using these cues, we improve state-of-the-art object detection and image retrieval results on benchmark datasets. This is work done with Sudheendra Vijayanarasimhan and Sung Ju Hwang.

Other Videos By Microsoft Research

2016-08-17	Fully Homomorphic Encryption; Bi-Deniable Encryption; We Have The Technology, Now Where Next?
2016-08-17	Verifying Safety and Liveness Properties of a Kernelized Web Browser
2016-08-17	The successes and challenges of making low-data languages in online automatic translation portals
2016-08-17	Optimal Auctions with Budget Constraints
2016-08-17	Proof of Aldous' spectral gap conjecture
2016-08-17	Why Social Computing Is So Hard
2016-08-17	Metastabiity and logarithmic energy barriers for a polymer dynamics
2016-08-17	Predicate Encryption; Structured Encryption and Controlled Disclosure; Cloud Cryptography
2016-08-17	Approximation Schemes for Optimization
2016-08-17	All pairs shortest path in quadratic time with high probability
2016-08-17	Steering and Capturing Human Insight for Large-Scale Learning of Visual Objects
2016-08-17	Welcome and opening remarks; Point Obfuscation and Friends; Outsourcing Computation
2016-08-17	Laser Processing of Materials III
2016-08-17	Finding all min st-cuts in planar graphs
2016-08-17	GenoZoom: Browsing the genome with Microsoft Biology Foundation, DeepZoom and Silverlight
2016-08-17	Robust Face Recognition Using Recognition-by-Parts, Boosting, and Transduction
2016-08-17	Laser Processing of Materials II
2016-08-17	The double dimer model with quaternions
2016-08-17	Robust High-dimensional Principal Component Analysis
2016-08-17	FPGAs & Side Communication Channels - Black Hat Risks and White Hat Benefits
2016-08-17	Laser Processing of Materials I

Tags:

microsoft research

Channel	Latest
Fer Tijerina	6 hours ago
They will Kill You	6 hours ago
Master Of Chaos	6 hours ago
Canal do Ed	6 hours ago
PolarisZenKai’s Amiibo Fights!	6 hours ago
Fenix	6 hours ago
Mr. Reach	7 hours ago
Agustin51	7 hours ago
foggedftw2	7 hours ago
LeagueSpotlight	7 hours ago
MORTIS	7 hours ago
DannyGaminGnC	7 hours ago
Call of Duty League	7 hours ago
Black Wolf	7 hours ago
The Lovelies FAN CHANNEL	7 hours ago
遥の趣味ブッフェ	7 hours ago
Games4Fun TV	7 hours ago
shisp	7 hours ago
PUBG MOBILE Türkiye	7 hours ago
Brinnmations	7 hours ago
EmKay	7 hours ago
Dante Dark	7 hours ago
JORPA _17	7 hours ago
V3T35	7 hours ago
The Khaorrupted	7 hours ago