Learning to Map Natural Language to General Purpose Source Code

Subscribers:
344,000
Published on ● Video Link: https://www.youtube.com/watch?v=ccab3whv2N8



Duration: 1:24:54
625 views
13


Models that map natural language (NL) to source code in general purpose languages such as Java, Python, and SQL find utility amongst two main audiences viz. developers who can manipulate the generated code, and non-expert users who directly see the output of execution. Developing these models is challenging because of contextual dependencies of the target code, the lack of alignment between NL and code tokens, syntactic and semantic requirements of the target code, and the prohibitively expensive cost of annotating training data. The first part of my talk will focus on contextual code generation from NL for developers. I will present ways to obtain inexpensive training datasets from large online code repositories, followed by methods to incorporate contextual awareness in syntax-guided neural models to improve performance on the task. Next, I will describe techniques to extract and use programmatic idioms to significantly speed up training and scale our models. The second part of my talk will focus on building NL interfaces for querying databases using SQL, for non-expert users. I will describe methods to build deep learning models that improve in performance over time by leveraging user feedback and annotations obtained from crowd programmers. I will conclude by presenting ideas for extending our NL to code models, as well as several complementary tasks relating to interactivity and explainability.

Learn more about this and other talks at Microsoft Research: https://www.microsoft.com/en-us/research/video/learning-to-map-natural-language-to-general-purpose-source-code/




Other Videos By Microsoft Research


2019-10-24Microsoft PhD Summit 2019: Nikunj Raghuvanshi [Short Talk]
2019-10-24Microsoft PhD Summit 2019: Martin Roetteler [Short Talk]
2019-10-24Microsoft PhD Summit 2019: Asta Roseway [Short Talk]
2019-10-24Microsoft PhD Summit 2019: Mike Sinclair [Short Talk]
2019-10-24Microsoft PhD Summit 2019: Siddharth Suri [Short Talk]
2019-10-24Microsoft PhD Summit 2019: Irene Zhang [Short Talk]
2019-10-24Microsoft PhD Summit 2019: Bailu Ding [Short Talk]
2019-10-24Microsoft PhD Summit 2019: Nathalie Riche [Short Talk]
2019-10-24Microsoft PhD Summit 2019: Ken Hinckley [Short Talk]
2019-10-24'The Global AI Supercomputer' by Donald Kossmann at Microsoft PhD Summit 2019 [Keynote]
2019-10-23Learning to Map Natural Language to General Purpose Source Code
2019-10-23Hand and User Detection with Multiple Users on Large Displays
2019-10-23Machine teaching, LUIS and the democratization of custom AI with Dr. Riham Mansour [Podcast]
2019-10-21Generalization in Reinforcement Learning with Selective Noise Injection
2019-10-21DreamWalker: Substituting Real-World Walking Experiences with a Virtual Reality
2019-10-21Learning Structured Models for Safe Robot Control
2019-10-21RDMA: Provably More Powerful Communication
2019-10-21Mise-Unseen: Using Eye Tracking to Hide Virtual Reality Scene Changes in Plain Sight
2019-10-21CapstanCrunch: A Haptic VR Controller with User-supplied Force Feedback
2019-10-18Social Computing for Social Good in Low-Resource Environments
2019-10-16News from the front in the post-quantum crypto wars with Dr. Craig Costello [Podcast]



Tags:
natural language
NL interfaces
contextual code generation
microsoft research
general purpose source code
MSR
Srini Iyer