Similarity Search for Efficient Active Learning and Search of Rare Concepts | AISC

Published on ● Video Link: https://www.youtube.com/watch?v=vRVyOEK2JUU



Duration: 44:13
450 views
18


For slides and more information on the paper, visit https://ai.science/e/similarity-search-for-efficient-active-learning-and-search-of-rare-concepts--2020-11-06

Speaker: Cody Coleman; Host: Omar Nada

Motivation:
Many active learning and search approaches are intractable for industrial settings with billions of unlabeled examples. Existing approaches, such as uncertainty sampling or information density, search globally for the optimal examples to label, scaling linearly or even quadratically with the unlabeled data. However, in practice, data is often heavily skewed; only a small fraction of collected data will be relevant for a given learning task. For example, when identifying rare classes,detecting malicious content, or debugging model performance, the ratio of positive to negative examples can be 1 to 1,000 or more. In this work, we exploit this skew in large training datasets to reduce the number of unlabeled examples considered in each selection round by only looking at the nearest neighbors to the labeled examples.




Other Videos By LLMs Explained - Aggregate Intellect - AI.SCIENCE


2020-11-24Interpretable Neural Networks for Panel Data Analysis in Economics | AISC
2020-11-23Meet the MLOps Experts, a fireside chat for newcomers to AI careers
2020-11-23Nodes, Edges and Properties; Graph Analysis Intro for ML Newcomers
2020-11-20Operationalizing AI in Business at Scale | AISC
2020-11-19Q-BERT: Hessian Based Ultra Low Precision Quantization of BERT | AISC
2020-11-18Augmented Out-of-sample Comparison Method for Time Series Forecasting Techniques | AISC
2020-11-13Generating Ampicillin-Level Antimicrobial Peptides with Activity-Aware Generative Adversarial Net
2020-11-12Disparate Interactions: An Algorithms-in-the-loop Analysis of Fairness in Risk Assessment | AISC
2020-11-11[GATA] Learning Dynamic Belief Graphs to Generalize on Text-Based Games | AISC
2020-11-11DrWhy.AI - Tools for Explainable Artificial Intelligence | AISC
2020-11-06Similarity Search for Efficient Active Learning and Search of Rare Concepts | AISC
2020-11-06Identifying Influential Individuals in Complex Networks: An Overview | AISC
2020-11-05Da Xu (Walmart Labs): Inductive Representation Learning on Temporal Graphs | AISC
2020-11-04AI, Democracy, & Disinformation
2020-10-29Application of Bayesian neural networks for aircraft safety | AISC
2020-10-28AI and ML toward Telcom future | AISC
2020-10-27Investing in Emerging Technology & The Nuts & Bolts of How to Raise Money for your Startup | AISC
2020-10-23The People, Politics, & Histories Behind Machine Learning Datasets | AISC
2020-10-23Detecting and Correcting Unfairness in Machine Learning | AISC
2020-10-23Highly Recommended: A Fireside Chat with AISC's Resident Experts on Recommender Systems
2020-10-23A Fireside Chat with AISC NLP experts