The successes and challenges of making low-data languages in online automatic translation portals

Subscribers:
351,000
Published on ● Video Link: https://www.youtube.com/watch?v=tgxnq3tlcgY



Duration: 1:28:49
145 views
2


The majority of development work and deployment of machine translation (MT) technologies over the past several decades have been for international languages. Only a few projects for low-data/low-density/low resource/sparse-data/less-prevalent/lesser-commonly taught/minority languages have led to successful prototypes and products. There are a certain number of technical, logistical, social, educational and other factors which influence and impact the potential success of implementing systems for such languages. This talk will cover many of the lessons learned from previous projects, and some of the pitfalls to avoid. It will also demonstrate how the recent efforts for making Haitian Creole available for Haiti Disaster Relief had a certain level of success in record time because of the ability to build upon previous work. Yet, there were also obstacles with have been problematic and remain a concern for this language and for other less-prevalent languages. Lastly, the discussion will mention some ways to enable proactive, forward thinking projects, using some bootstrapping methods, to reduce the risk of situations which can result from working in a primarily reactive mode. This will be an interactive dialogue with the audience, allowing for questions throughout the session, and an additional question/answer time.




Other Videos By Microsoft Research


2016-08-17Virtual Machine Reset Vulnerabilities; Subspace LWE; Cryptography Against Continuous Memory Attacks
2016-08-17Adaptive Submodularity: A New Approach to Active Learning and Stochastic Optimization
2016-08-17Spectral graph sparsification Part 1: -- (The Combinatorial Multigrid Solver)
2016-08-17Glauber Dynamics for the 2D Ising Model at Low Temperature
2016-08-17Sheriff: Detecting and Eliminating False Sharing
2016-08-17National Renewable Energy Lab, renewable energy and the compute space
2016-08-17Insights into Ad-sponsored Mobile Software
2016-08-17End-User Creation of Mashups and Cross-Device UI Prototypes
2016-08-17Fully Homomorphic Encryption; Bi-Deniable Encryption; We Have The Technology, Now Where Next?
2016-08-17Verifying Safety and Liveness Properties of a Kernelized Web Browser
2016-08-17The successes and challenges of making low-data languages in online automatic translation portals
2016-08-17Optimal Auctions with Budget Constraints
2016-08-17Proof of Aldous' spectral gap conjecture
2016-08-17Why Social Computing Is So Hard
2016-08-17Metastabiity and logarithmic energy barriers for a polymer dynamics
2016-08-17Predicate Encryption; Structured Encryption and Controlled Disclosure; Cloud Cryptography
2016-08-17Approximation Schemes for Optimization
2016-08-17All pairs shortest path in quadratic time with high probability
2016-08-17Steering and Capturing Human Insight for Large-Scale Learning of Visual Objects
2016-08-17Welcome and opening remarks; Point Obfuscation and Friends; Outsourcing Computation
2016-08-17Laser Processing of Materials III



Tags:
microsoft research