Domain-specific language model pretraining for biomedical natural language processing

Subscribers:
344,000
Published on ● Video Link: https://www.youtube.com/watch?v=axQTxj4ExAM



Duration: 1:07:46
2,829 views
71


Pretraining large neural language models, such as BERT, has led to impressive gains on many natural language processing (NLP) tasks. However, most pretraining efforts focus on general-domain corpora, such as in newswire and web text. Biomedical text is very different from general-domain text, yet biomedical NLP has been relatively underexplored. A prevailing assumption is that even domain-specific pretraining can benefit by starting from general-domain language models.

In this webinar, Microsoft researchers Hoifung Poon, Senior Director of Biomedical NLP, and Jianfeng Gao, Distinguished Scientist, will challenge this assumption by showing that for domains with abundant unlabeled text, such as biomedicine, pretraining language models from scratch results in substantial gains over continual pretraining of general-domain language models.

You will begin with understanding how biomedical text differs from general-domain text and how biomedical NLP poses substantial challenges that are not present in mainstream NLP. You will also learn about the two paradigms for domain-specific language model pretraining and see how pretraining from scratch significantly outperforms mixed-domain pretraining in a wide range of biomedical NLP tasks. Finally, find out about our comprehensive benchmark and leaderboard created specifically for biomedical NLP, called BLURB, and see how our biomedical language model, PubMedBERT, sets a new state of the art.

Together, you'll explore:

โ–  How biomedical NLP differs from mainstream NLP
โ–  A shift in approach to pretraining language models for specialized domains
โ–  BLURB: a comprehensive benchmark and leaderboard for biomedical NLP
โ–  PubMedBERT: the state-of-the-art biomedical language model pretrained from scratch on biomedical text

๐—ฅ๐—ฒ๐˜€๐—ผ๐˜‚๐—ฟ๐—ฐ๐—ฒ ๐—น๐—ถ๐˜€๐˜:

โ–  BioMed NLP Group - https://www.microsoft.com/en-us/research/group/biomedical-nlp-group
โ–  Hanover (Project page): https://www.microsoft.com/en-us/research/project/project-hanover
โ–  Deep Learning (Group Page): https://www.microsoft.com/en-us/research/group/deep-learning-group
โ–  BLURB (GitHub): https://microsoft.github.io/BLURB
โ–  PubMedBERT (GitHub): https://microsoft.github.io/BLURB/models.html
โ–  Domain-Specific Language Model Pretraining for Biomedical Natural Language Processing (Paper): https://www.microsoft.com/en-us/research/publication/domain-specific-language-model-pretraining-for-biomedical-natural-language-processing
โ–  Hoifung Poon (profile page): https://www.microsoft.com/en-us/research/people/hoifung
โ–  Jianfeng Gao (profile page): https://www.microsoft.com/en-us/research/people/jfgao

*This on-demand webinar features a previously recorded Q&A session and open captioning.

This webinar originally aired on October 15, 2020

Explore more Microsoft Research webinars: https://aka.ms/msrwebinars




Other Videos By Microsoft Research


2021-04-26FastNeRF: High-Fidelity Neural Rendering at 200FPS [Condensed]
2021-04-21Research for Industries (RFI) Lecture Series: Warren Powell
2021-04-21Research for Industries (RFI) Lecture Series: Andreas Haeberlen
2021-04-13Discovering hidden connections in art with deep, interpretable visual analogies
2021-04-13ZeRO & Fastest BERT: Increasing the scale and speed of deep learning training in DeepSpeed
2021-04-13Interactive sound simulation: Rendering immersive soundscapes in games and virtual reality
2021-04-13A prototype implementation of 4G packet gateway Microsoft Catapult FPGA platform
2021-04-12Self-Tuning Networks: Amortizing the Hypergradient Computation for Hyperparameter Optimization
2021-04-06Ultra-dense data storage and extreme parallelism with electronic-molecular systems
2021-04-06Harmonizing the declarative and imperative in database systems
2021-04-06Domain-specific language model pretraining for biomedical natural language processing
2021-03-30Platform Biography: A framework for analyzing the structures and dynamics of social media
2021-03-30Building multimodal, integrative AI systems with Platform for Situated Intelligence
2021-03-29From player to creator: Designing video games on gaming handhelds with Microsoft TileCode webinar
2021-03-29Camera-based non-contact health sensing
2021-03-29Foundations of causal inference and its impacts on machine learning webinar
2021-03-29Avatars: Finding a sense of self and others in the virtual world
2021-03-25In pursuit of responsible AI: Bringing principles to practice
2021-03-25Fairness-related harms in AI systems: Examples, assessment, and mitigation
2021-03-25Enhancing mobile work and productivity with virtual reality
2021-03-23Mixed reality and robotics: Unlocking more intuitive human-machine collaboration