DocEng 2011: Interoperable Metadata Semantics

Subscribers:
348,000
Published on ● Video Link: https://www.youtube.com/watch?v=dPriVwU73BQ



Duration: 12:57
593 views
2


The 11th ACM Symposium on Document Engineering
Mountain View, California, USA
September 19-22, 2011

Interoperable Metadata Semantics with Meta-Metadata: A Use Case Integrating Search Engines

Yin Qu, Andruid Kerne, Andrew M. Webb, Aaron Herstein
Presented by Yin Qu.

ABSTRACT

A use case involving integrating results from search engines illustrates how the meta-metadata language facilitates interoperable metadata semantics. Formal semantics can be hard to obtain directly. For example, search engines may only present results through web pages; even if they do provide web services, they don't provide them according to a mutually interoperable standard. We show how to use the open source meta-metadata language to define a common base class for search results, and how to extend the base class to create polymorphic variants that include engine-specific fields. We develop wrappers to extract data from HTML search results from engines including Google, Bing, Delicious, and Slashdot. We write a short meta-search program for integrating the search results, reranking them, and providing formatted HTML output. This provides an extensible formal and functional semantics for search. Meta-metadata also directly enables representing the same integrated search results as XML or JSON. This research can profoundly transform the derivation and representation of interoperable metadata semantics from a multitude of heterogeneous wild web sources.




Other Videos By Google TechTalks


2011-10-042011 Frontiers of Engineering: Accelerating Green Building Market Transformation with IT
2011-10-042011 Frontiers of Engineering: Where Are the Emerging Frontiers in Research and Innovation?
2011-10-04DocEng 2011: Reflowable Documents Composed from Pre-rendered Atomic Components
2011-10-04DocEng 2011: Paginate Dynamic and Web Content
2011-10-04DocEng 2011: Document Visual Similarity Measure For Document Search
2011-10-04DocEng 2011: A Versatile Model for Web Page Representation
2011-10-03DocEng 2011: Expressing Conditions in Tailored Brochures for Public Administration
2011-10-03DocEng 2011: Citation Pattern Matching Algorithms for Citation-based Plagiarism Detection
2011-10-03DocEng 2011: A Study of the Interaction of Paper Substrates on Printed Forensic Imaging
2011-09-302011 Frontiers of Engineering: Automatic Text Understanding of Content and Text Quality
2011-09-29DocEng 2011: Interoperable Metadata Semantics
2011-09-292011 Frontiers of Engineering: Large Scale Visual Semantic Extraction
2011-09-292011 Frontiers of Engineering: Advancing Natural Language Understanding
2011-09-282011 Frontiers of Engineering: Research at Google Lightning Talks
2011-09-28DocEng 2011: An Efficient Language-Independent Method to Extract Content from News Webpages
2011-09-28DocEng 2011: Dynamic Assistance to Adding Dimensions to Multi-structured Documents
2011-09-28DocEng 2011: Component-based Hypervideo Model
2011-09-282011 U.S. Frontiers of Engineering: Welcome and Opening Remarks
2011-09-282011 U.S. Frontiers of Engineering: Overview of Additive Manufacturing
2011-09-27DocEng 2011: Timesheets - When SMIL Meets HTML5 and CSS3
2011-09-27DocEng 2011: A Generic Calculus of XML Editing Deltas



Tags:
google tech talk
doceng 2011
document engineering