Using unsupervised machine learning to uncover hidden scientific knowledge | AISC

Published on ● Video Link: https://www.youtube.com/watch?v=ej5Q17UN2Uo



Duration: 59:34
740 views
22


Speaker(s): Vahe Tshitoyan
Facilitator(s): Shahrzad Hosseini

Find the recording, slides, and more info at https://ai.science/e/using-unsupervised-machine-learning-to-uncover-hidden-scientific-knowledge--vpGhUEmJdpROAt5R7T4Q

Motivation / Abstract
The overwhelming majority of scientific knowledge is published as text, which is difficult to analyse by either traditional statistical analysis or modern machine learning methods. By contrast, the main source of machine-interpretable data for the materials research community has come from structured property databases1,2, which encompass only a small fraction of the knowledge present in the research literature. Beyond property values, publications contain valuable knowledge regarding the connections and relationships between data items as interpreted by the authors. To improve the identification and use of this knowledge, several studies have focused on the retrieval of information from scientific literature using supervised natural language processing3,4,5,6,7,8,9,10, which requires large hand-labelled datasets for training. Here we show that materials science knowledge present in the published literature can be efficiently encoded as information-dense word embeddings11,12,13 (vector representations of words) without human labelling or supervision. Without any explicit insertion of chemical knowledge, these embeddings capture complex materials science concepts such as the underlying structure of the periodic table and structure–property relationships in materials. Furthermore, we demonstrate that an unsupervised method can recommend materials for functional applications several years before their discovery. This suggests that latent knowledge regarding future discoveries is to a large extent embedded in past publications. Our findings highlight the possibility of extracting knowledge and relationships from the massive body of scientific literature in a collective manner, and point towards a generalized approach to the mining of scientific literature.




Other Videos By LLMs Explained - Aggregate Intellect - AI.SCIENCE


2020-05-28Unifying machine learning and quantum chemistry with a deep neural network | AISC
2020-05-27Model Selection for Optimal Prediction in Statistical Learning - Part 2 / 2 | AISC
2020-05-26Representation Learning of Histopathology Images using Graph Neural Networks | AISC
2020-05-26BillionX acceleration using AI Emulators | AISC
2020-05-22Machine Learning Methods for High Throughput Virtual Screening with a focus on Organic Photovoltaics
2020-05-21Learning the Graphical Structure of Electronic Health Records with Graph Convolutional Transformer
2020-05-20Reinforcement Learning for Batch-to-Batch Bioprocess Optimisation | AISC
2020-05-20Leaf Doctor: Plant Disease Detection Using Image Classification | Deep Learning Workshop Capstone
2020-05-20News ScanNER: Entity Tagging in News Headlines | Deep Learning Workshop Capstone
2020-05-19New methods for identifying latent manifold structure from neural data | ASIC
2020-05-19Using unsupervised machine learning to uncover hidden scientific knowledge | AISC
2020-05-15FineGym: A Hierarchical Video Dataset for Fine-grained Action Understanding
2020-05-14A Literature Review on ML in Climate Science | AISC
2020-05-13[cnvrg.io] Operating System for Machine Learning | AISC
2020-05-12Tobias Pfaff (DeepMind): Learning to Simulate Complex Physics with Graph Networks
2020-05-07Multi Type Mean Field Reinforcement Learning | AISC
2020-05-07Proving the Lottery Ticket Hypothesis: Pruning is All You Need | AISC Livestream with the Author
2020-05-06Generative Teaching Networks: Accelerating Neural Architecture Search by Learning to Generate Synthe
2020-05-06A Literature Review on Deep Learning in Finance | AISC
2020-05-02COVID19 and AI: Ethics and Data Rights Panel | AISC & NYAI
2020-04-30A Literature Review on Interpretability for Machine Learning | AISC