Semi-unsupervised learning of taxonomic and non-taxonomic relationships from the web

Subscribers:
344,000
Published on ● Video Link: https://www.youtube.com/watch?v=FzJ3_f3a67o



Duration: 1:01:08
43 views
1


Due to the size of the World Wide Web, it is necessary to develop tools for automatic or semi-automatic analyses of web data, such as finding patterns and implicit information in the web, a task usually known as Web Mining. In particular, web content mining consists of automatically mining data from textual web documents that can be represented with machine-readable semantic formalisms. While more traditional approaches to Information Extraction from text, such as those applied to the Message Understanding Conferences during the nineties, relied on small collections of documents with many semantic annotations, the characteristics of the web (its size, redundancy and the lack of semantic annotations in most texts) favor efficient algorithms able to learn from unannotated data. Furthermore, new types of web content such as web forums, blogs and wikis, are also a source of textual information that contain an underlying structure from which specialist systems can benefit. This talk will describe an ongoing project for automatically acquiring ontological knowledge (both taxonomic and non-taxonomic relationships) from the web in a partially unsupervised way. The proposed approach combines distributional semantics techniques with rote extractors. A particular focus will be set on an automatic addition of semantic tags to the Wikipedia with the aim of transforming it, with small effort, into a Semantic Wikipedia.




Other Videos By Microsoft Research


2016-09-06Guanxi (The Art of Relationships) : Microsoft, China, and Bill Gates's Plan to Win the Road Ahead
2016-09-06Increasing Concurrency using EDGE Architectures
2016-09-06Decision Procedures for Recursive Data Structures with Integer Arithmetic
2016-09-06Supporting Construction, Analysis, and Understanding of Software Models.
2016-09-06Program Verification via Three-Valued Logic Analysis
2016-09-06Efficient Data Dissemination in Bandwidth-Asymmetric P2P Networks
2016-09-06Tractable Learning of Structured Prediction Models
2016-09-06Future Hype: The Myths of Technology Change
2016-09-06Improving Packet Delivery Efficiency Using Multi-Radio Diversity in Wireless LANs
2016-09-06Algorithmic Foundations of P2P and Wireless Networks
2016-09-06Semi-unsupervised learning of taxonomic and non-taxonomic relationships from the web
2016-09-06The Weather Makers: How Man is Changing the Climate and What it Means for Life on Earth
2016-09-06Touched with Light: Scanned beams display or capture information at video rates
2016-09-06Internet Background Radiation
2016-09-06Understanding and Improving Wireless Networks
2016-09-06SAFECode: A Platform for Developing Reliable Software in Unsafe Languages
2016-09-06Enabling Internet Malware Investigation and Defense Using Virtualization
2016-09-06Cohomology in Grothendieck Topologies and Lower Bounds in Boolean Complexity
2016-09-06Approximate inference techniques for optimal design in self-assembly and automated programming
2016-09-06Machine Learning Methods for Structured and Collective Classification
2016-09-06Communication Technology: Interruption and Overload



Tags:
microsoft research