What Do Those Images Have In Common?

Channel:

Google TechTalks

Subscribers:

349,000

Published on March 27, 2008 9:59:12 AM ● Video Link: https://www.youtube.com/watch?v=YoOULXBa2wg

Duration: 59:53

6,431 views

Google Tech Talks
March, 25 2008

ABSTRACT

This talk is about discovering and modeling previously unspecified, recurring themes in a given set of arbitrary images. Given a set of images, each containing frequent occurrences of objects from multiple categories, the goal is to learn a compact model of the categories as well as their relationships, for the purposes of later recognizing/segmenting any occurrences in new images. Categories are not defined by the user. Also, whether and where instances of any categories appear in a specific image is not known. This problem is challenging, since it involves the following unanswered questions. What is an object category? What image properties should be used and how to combine them to discover category occurrences? What is an efficient multicategory representation?

We will examine a methodology, developed during my postdoctoral work at UIUC. Each image is represented by a segmentation tree whose nodes correspond to image regions, segmented at all natural scales present, and edges between tree nodes capture the region embedding. The presence of any categories in the image set is then reflected in the frequent occurrence of similar subtrees within the segmentation trees. Our methodology is designed to: (1) match image trees to find similar subtrees; (2) discover categories by clustering similar subtrees, and use the properties of each cluster to learn the model of the associated category; and (3) learn the grammar of the discovered categories that compactly captures their recursive definitions in terms of other simpler (sub)categories and their relationships (e.g., containment, co- occurrence, and sharing of simple categories by more complex ones). When a new image is encountered, its segmentation tree is matched against the learned grammar to simultaneously recognize and segment all occurrences of the learned categories. This matching also provides a semantic explanation of object recognition in terms of the identified parts along with their spatial relationships.

The aforementioned methodology can also be used for identifying recurring image themes of more general kind. An example is that of extracting the stochastically repeating, elementary parts of image texture (e.g., waterlilies on the water surface, people in a crowd).

This talk will be taped by the engEDU Tech Talks Team.

Speaker: Sinisa Todorovic
Sinisa Todorovic received the joint B.S./M.S. degree with honors in electrical engineering from the University of Belgrade, Serbia, in 1994. From 1994 until 2001, he worked in the communications industry. He received the M.S. and Ph.D. degrees in electrical and computer engineering at the University of Florida, Gainesville, in 2002, and 2005, respectively. Since 2005, he holds the position of Postdoctoral Research Associate in the Beckman Institute at the University of Illinois Urbana-Champaign, where he collaborates with Prof. Narendra Ahuja. Sinisa's main research interests concern computer vision and machine learning, with current focus on unsupervised extraction and representation of visual themes recurring in images. He is the recipient of Jack Neubauer Best Paper Award 2004 for a publication in IEEE Trans. Vehicular Technology, and Outstanding Reviewer Award at the Int. Conf. on Computer Vision (ICCV) 2007. He serves as Associate Editor of Advances in Multimedia.

Other Videos By Google TechTalks

2008-04-05	jQuery
2008-04-04	Seero: Mapping live video to create a unique form of content.
2008-04-03	Don't Make Me Click
2008-04-03	Merlin Mann on Time and Attention (Getting Things Done)
2008-04-02	The Year of Healthy Living: Nutrition and Healthy Eating
2008-04-02	Coaching Series: What Tech Women Really Want
2008-04-02	Coaching Series: Create the Career You Want: A Non-Hyped App
2008-04-02	Faculty Summit
2008-03-29	Decayed MCMC for probabilistic filtering
2008-03-27	KNFB Reader, Talking OCR On Cell Phones
2008-03-27	What Do Those Images Have In Common?
2008-03-27	Movie/Script: Alignment and Parsing of Video and Text Transcription
2008-03-26	Scene Discovery by Matrix Factorization
2008-03-26	Rapid Prototyping of Ubiquitous Computing Applications: Tools & Frameworks
2008-03-26	Optimization for Machine Learning
2008-03-25	Disk-Based Parallel Computation, Rubik's Cube, and Checkpointing
2008-03-25	What Are FOSSBazaar and FOSSology and why should I care?
2008-03-22	Human Aspects of Software Engineering: Social and Cognitive Perspectives
2008-03-22	The timeless treasures in the modern world -- A life's path in paintings
2008-03-22	Robust Projected Clustering with P3C
2008-03-20	Improvement of Web Accessibility in Japan

Tags:

google

techtalks

techtalk

engedu

talk

talks

googletechtalks

education

Channel	Latest
maina Highway	6 hours ago
Saber Kingscrown	6 hours ago
MISTER ARTHER	6 hours ago
Miguel Sánchez-Ávila	6 hours ago
MirageOfShellz (Alex Fraser-Odin)	6 hours ago
Mr Wolverine	6 hours ago
Garine Family CHANNEL	7 hours ago
buBBazaur	7 hours ago
OuluH	7 hours ago
Jason San Nicolas	7 hours ago
BritFlicks	7 hours ago
Ab_ gaming	7 hours ago
JAPAH	7 hours ago
Doğukan Buny	7 hours ago
Vision Video	7 hours ago
Cyberbunny_Let'sPlay	7 hours ago
Mehmet Uğur	7 hours ago
Simulando FC	7 hours ago
Johnald Games	7 hours ago
Lost	7 hours ago
Nerd Coalition Studios	7 hours ago
Antunes Gamer	7 hours ago
Bt GamingYT	7 hours ago
MrLakios	7 hours ago
Prince of Sight Gaming	7 hours ago