The LabelMe dataset and its applications to scene and object recognition

Channel:

Google TechTalks

Subscribers:

349,000

Published on August 16, 2008 10:31:18 AM ● Video Link: https://www.youtube.com/watch?v=OPPFBaKc2eU

Duration: 1:04:11

12,350 views

Google Tech Talks
August 15, 2008

ABSTRACT

We seek to build a large collection of images with ground truth labels to be used for object detection and recognition research. We used the "Tom Sawyer fence painting" approach, and developed a web-based tool that allows easy image annotation and instant sharing of such annotations. Using this annotation tool, we have collected a large dataset that spans many object categories, often containing multiple instances over a wide variety of images. We quantify the contents of the dataset and compare against existing state of the art datasets used for object recognition and detection.

We have applied this dataset to scene and object recognition. Current object recognition systems can only recognize a limited number of object categories; scaling up to many categories is the next challenge. We seek to build a system to recognize and localize many different object categories in complex scenes. We achieve this through a simple approach: by matching the input image, in an appropriate representation, to images in a large training set of labeled images. Due to regularities in object identities across similar scenes, the retrieved matches provide hypotheses for object identities and locations. We build a probabilistic model to transfer the labels from the retrieval set to the input image. We demonstrate the effectiveness of this approach and study algorithm component contributions using held-out test sets from the LabelMe database.

Joint work with Antonio Torralba, Byran Russell, Kevin Murphy, Rob
Fergus and Ce Liu.

Speaker: Bill Freeman, MIT and Adobe Systems
Bill Freeman is a professor of Electrical Engineering and Computer Science at MIT, and a member of the Computer Science and Artificial Intelligence Laboratory (CSAIL). He studies computer vision, computer graphics, and machine learning, addressing how to represent, manipulate, and understand images. Before joining MIT, he worked for 9 years at Mitsubishi Electric Research Labs, for 6 years at the Polaroid Corporation, and for 1 year as a Foreign Expert at the Taiyuan University of Technology, Shanxi, China. Part time, he works
at Adobe's Creative Technologies Lab. Hobbies include flying cameras in kites.

Other Videos By Google TechTalks

2008-08-28	Nerdfighters: Insider View from a YouTube Persona
2008-08-26	Larry Wall Speaks at Google
2008-08-26	Bill Guschwan: A History of Video Game Development
2008-08-26	Adium: Multi-protocol Chat for the Mac
2008-08-22	Are Internet users at risk?
2008-08-22	Balance, Stress, and Optimal Health
2008-08-22	The Business Case for Protecting the Climate
2008-08-22	Digital Design: Beyond Trial and Error
2008-08-22	A Startup University to Train Public Servants: the US Public Service Academy
2008-08-21	Where the Hell is Matt?
2008-08-16	The LabelMe dataset and its applications to scene and object recognition
2008-08-15	A New Approach to Design of Massively Parallel Systems
2008-08-15	The Borgmann Project: Listing all the Words in English
2008-08-14	gPXE: Modern FOSS Network Booting
2008-08-14	The World Market for Coal: What's going on the C of "RE less than C"?
2008-08-14	News from the Caucasus
2008-08-13	Supporting Casual Data-Centric Interactions on the Web
2008-08-13	When Will We Discover the Extraterrestrials?
2008-08-13	Love and Authentication -- Addressing the problem of password reset
2008-08-12	The Xbox 360 Security System and its Weaknesses
2008-08-12	Opportunities for Open Source Biotechnology in Underdeveloped Countries

Tags:

google

techtalks

techtalk

engedu

talk

talks

googletechtalks

education

Channel	Latest
Ganyu and Sparkle	6 hours ago
Awkward	6 hours ago
Hunkpain Gaming	6 hours ago
Android4L	6 hours ago
Pe Toys	6 hours ago
Bolt Spider	6 hours ago
Team Chaubey	6 hours ago
PUBG: BATTLEGROUNDS VIETNAM	6 hours ago
PRINCESS OF LIGHT	6 hours ago
NAKA研	6 hours ago
The Upcoming	6 hours ago
David Makadi	6 hours ago
Senhor Linguica	6 hours ago
NinjaMaker2222	6 hours ago
Above RARE	7 hours ago
Dippy Sauce	7 hours ago
Alfale97	7 hours ago
Mobile Legends: Bang Bang Cambodia	7 hours ago
Rimas 100	7 hours ago
Osborne	7 hours ago
Amo Ahmad	7 hours ago
Choosey	7 hours ago
Gamemagaz	7 hours ago
さおうさん	7 hours ago
Gold WebTv	7 hours ago