Information Extraction Crossing Language, Robustness and Domain Barriers

Subscribers:
344,000
Published on ● Video Link: https://www.youtube.com/watch?v=IloalRDpDjo



Category:
Guide
Duration: 1:00:21
134 views
1


Modern communication technologies have made massive amounts of real-time news information in several languages readily available. This led to the need to develop news-monitoring system that allows users to monitor multilingual news media in near real-time and search over stored content. One example of such a system is Translingual Automatic Language Exploration System, codenamed TALES. In this talk I will briefly describe the architecture of TALES and focus on its information extraction component. Information extraction is a crucial step toward understanding a text, as it identifies the important conceptual objects and relations between them in a discourse. I will address the portability of the used approach to different languages and show a method of propagating information into low resource languages from richer ones. Compared to other approaches that focuses on clean-text, I will also show the robustness of our technique to less-well-formed input. For example, information extraction in a multilingual broadcast processing system has to deal with inaccurate automatic transcription and translation. The resulting presence of non-target-language text in this case yields many false alarms, which raise the research problem of making information extraction robust to such noisy input text. If time permit, I will also discuss the application and adaptation of these techniques to health-care domain.




Other Videos By Microsoft Research


2016-07-27Quantum Computation for Quantum Chemistry: Status, Challenges and Prospects - Session 5
2016-07-27Markov Type and the Multi-scale Geometry of Metric Spaces - How Well Can Martingales Aim?
2016-07-27Content Everywhere: The Challenges of a Mobile, Wireless and Social Viewership
2016-07-27Computer Aided Translation
2016-07-27Inference and Learning with Random Maximum A-Posteriori Perturbations
2016-07-279.5 Theses on the Power and Efficacy of Gamification
2016-07-27Innovation in Open Networks and the MIT Media Lab
2016-07-27From C/C++11 to Power and ARM: What is Shared-Memory Concurrency Anyway?
2016-07-27Semantic Awareness for Automatic Image Interpretation
2016-07-27Overview of �Big Data� Research at TU Berlin
2016-07-27Information Extraction Crossing Language, Robustness and Domain Barriers
2016-07-27HMM-based Speech Synthesis: Fundamentals and Its Recent Advances
2016-07-27Embedded Systems and Kinetic Art: A Natural Collaboration
2016-07-27Proactive Health Management Using In-Home Sensing and Recognition Technology
2016-07-27Grand Challenges of Human-Robot Interaction in Space
2016-07-27Posted Prices Exchange for Display Advertising Contracts
2016-07-27Tensor Decompositions for Learning Hidden Variable Models
2016-07-27BodyTrack: Open Source Tools for Health Empowerment through Self-Tracking
2016-07-27Detecting Fake Reviews
2016-07-27Sublinear Optimization
2016-07-27A novel paradigm for nonlinear speech processing through local singularity analysis



Tags:
microsoft research