UW/Microsoft 19th Quarterly Symposium in Computational Linguistics

Subscribers:
344,000
Published on ● Video Link: https://www.youtube.com/watch?v=i1bjGKKcxB0



Duration: 1:25:20
201 views
1


Announcing the nineteenth Symposium in Computational Linguistics sponsored by the UW Departments of Linguistics, Electrical Engineering, and Computer Science, Microsoft Research, and UW alumni at Microsoft. Come take advantage of this opportunity to connect with the computational linguistics community at Microsoft and the University of Washington. This is a regular opportunity for computational linguists at the University of Washington and at Microsoft to discuss topics in the field and to connect in a friendly informal atmosphere. This time the symposium consists of three talks from UW summer interns (and their MS mentors), followed by an informal reception. Hitting the Right Paraphrases in Good Time Stanley Kok (CSE) and Chris Brockett (MSR-NLP) We present a random-walk-based approach to extracting paraphrases from bilingual parallel corpora. The corpora are represented as a graph in which a node corresponds to a phrase, and an edge exists between two nodes their corresponding phrases are aligned. We sample random walks to compute the average number of steps it takes to reach a ranking of paraphrases with better ones being closer to the phrase of interest. This approach allows feature nodes that represent domain knowledge to be easily incorporated into the graph, and incorporates techniques to prevent the graph from growing too large for efficiency. Current state-of-the-art approaches, by contrast, require the graph to be bipartite, are limited to finding paraphrases that are of length two away from a phrase, and do not generally permit easy incorporation of domain knowledge into the graph. Manual evaluation of generated output shows that this approach outperforms state-of-the-art. Toward the Twuring Test: Conversation Modeling using Twitter Alan Ritter (CSE) and Colin Cherry (MSR-NLP) The growing popularity of social media has had an interesting side-effect for language researchers: services such as Twitter have resulted in people having instant-messenger-style conversations using a public medium, where anyone can observe. This creates a unique opportunity to collect, study, and model large-scale conversation data. We present a method for mining conversations from Twitter's public feed. The resulting conversation corpus, which will be made publicly available, has more than 1.3 million conversations, 75 thousand of which have more than 5 turns, providing a rich resource for the study of both Twitter and internet chat. Furthermore, we present several methods that attempt to model the flow of conversation by discovering latent classes over Tweets. We show that a repurposed content model (Barzilay and Lee 2004) can discover meaningful dialogue acts, such as question and comment, which indicate not only the role a Tweet plays in its conversation, but also the sorts of Tweets that are likely to follow. This model is improved and extended by employing a Bayesian sampling-based approach, allowing us to model a conversation's topic, and to introduce sparse priors during learning. Joint Inference for Knowledge Extraction from Biomedical Literature Hoifung Poon (CSE) and Lucy Vanderwende (MSR-NLP) Automatically extracting knowledge from online repositories (e.g., PubMed) holds the promise of dramatically speeding up biomedical research and drug design, and represents an outstanding example for the great vision of knowledge extraction from the Web. After initially focusing on entity recognition and binary interaction for protein, the community has recently shifted their attention towards the more ambitious goal of recognizing complex, nested event structures, which are ubiquitous in the literature. However, the state-of-the-art systems still adopt a pipeline architecture and fail to leverage the relational structures among candidate entities for mutual disambiguation. In this paper, we present the first joint approach for bioevent extraction that obtains state-of-the-art results. Our system is based on Markov logic and jointly predicts events and their arguments. We evaluated it using the BioNLP-09 Shared Task and compared it to the participating systems. Experimental results demonstrate the advantage of our approach.




Other Videos By Microsoft Research


2016-09-07Growth and Phase Transitions in Isolated and Interacting Networks
2016-09-07Woodland Park brings the Zoo to you! - Giving Campaign Presentation
2016-09-072009 eScience: Provenir ontology: Towards a Framework for eScience Provenance Management
2016-09-072009 eScience: Facilitating Next Generation Data Intensive Science using Semantic Technologies
2016-09-07Replica Symmetry and Combinatorial Optimization
2016-09-072009 eScience: Web Service Extension of Computational Biology Application Suite
2016-09-073D Geographical Environments & Geospatial Data Exploration using Flight Simulators and Geodatabases
2016-09-072009 eScience: Towards complete functional and structural imaging of cortical circuits
2016-09-072009 eScience: Data Intensive Scalable Computing: Applying Google-Style Computing to eScience
2016-09-072009 eScience: Tools and Techniques for Computational Biology
2016-09-07UW/Microsoft 19th Quarterly Symposium in Computational Linguistics
2016-09-072009 eScience: Author identity and social networking at arXiv
2016-09-072009 eScience: Simple Provenance in Scientific Databases
2016-09-07The Laws of Disruption: Harnessing the New Laws of Disruption that Govern Life and Business
2016-09-07Generating Intelligent Multimedia Presentations from Semantic Mashups using OAI-ORE and SMIL
2016-09-072009 eScience: Enhancing BLAST Comprehension with SilverMap
2016-09-07Comparing Mechanisms by their Vulnerability to Manipulation
2016-09-072009 eScience: Scalable Solution for a Comprehensive Appraisals of Contemporary Documents
2016-09-072009 eScience: Comment by Sketch: A picture says a million words
2016-09-07The History of Uranium
2016-09-072009 eScience: Software Support for Hybrid Computing



Tags:
microsoft research