Hierarchical Phrase-Based Translation with Suffix Arrays.

Subscribers:
344,000
Published on ● Video Link: https://www.youtube.com/watch?v=_LgQlMiMJY0



Duration: 1:14:14
250 views
1


A major engineering challenge in statistical machine translation systems is the efficient representation of extremely large translation rulesets. In phrase-based models, this problem can be addressed by storing the training data in memory and using a suffix array as an efficient index to quickly lookup and extract rules on the fly. Hierarchical phrase-based translation introduces the added wrinkle of source phrases with gaps. Lookup algorithms used for contiguous phrases no longer apply and the best approximate pattern matching algorithms are much too slow, taking several minutes per sentence. I describe new lookup algorithms for hierarchical phrase-based translation that reduce the empirical computation time by nearly two orders of magnitude, making on-the-fly lookup feasible for source phrases with gaps. I will also discuss some novel applications of these algorithms.




Other Videos By Microsoft Research


2016-09-06A Passion for Calendars -- From the Maya to Mars
2016-09-06Persuasive Games: The Expressive Power of Videogames           
2016-09-06In-Network, Physical Adaptation of Sensor Networks
2016-09-06Secure Virtual Architecture: A Novel Foundation for Operating System Security
2016-09-06Engineering Performance Using Control Theory: A One Day How-To: Theory Part 2
2016-09-06Effective Scientific Data Management through Provenance Collection
2016-09-06Unified Dimensionality Reduction: Formulation, Solution and Beyond
2016-09-06Engineering Performance Using Control Theory: A How-To: Control Analysis & Real world applications
2016-09-06A Real-World Test-bed for Mobile Adhoc Networks: Methodology, Experimentations, Simulation & Results
2016-09-06Fusion of Optical and Radio Frequency Techniques: Cameras, Projectors and Wireless Tags
2016-09-06Hierarchical Phrase-Based Translation with Suffix Arrays.
2016-09-06Multi-stack automata reachability: A New Tractable Subclass
2016-09-06Seduced by Success: How the Best Companies Survive the 9 Traps of Winning          
2016-09-06Everything is Miscellaneous: The Power of the New Digital Disorder
2016-09-06Accelerating High Performance Computing Applications with Reconfigurable Logic
2016-09-06Cooperative Data and Computation Partitioning for Distributed Architectures
2016-09-06Rate Control Protocol (RCP): Congestion Control to Make Flows Complete Quickly
2016-09-06Engineering Performance Using Control Theory: A One Day How-To: Introduction & Theory Part 1
2016-09-06Paths Beyond Local Search: A Tight Bound for Randomized Fixed-Point Computation
2016-09-06Interaction Design for One-Handed Use of Mobile Devices
2016-09-06Einstein: His Life and Universe



Tags:
microsoft research