Similarity Search for Efficient Active Learning and Search of Rare Concepts | AISC

Channel:

LLMs Explained - Aggregate Intellect - AI.SCIENCE

Subscribers:

22,600

Published on November 7, 2020 5:50:15 AM ● Video Link: https://www.youtube.com/watch?v=vRVyOEK2JUU

Duration: 44:13

450 views

For slides and more information on the paper, visit https://ai.science/e/similarity-search-for-efficient-active-learning-and-search-of-rare-concepts--2020-11-06

Speaker: Cody Coleman; Host: Omar Nada

Motivation:
Many active learning and search approaches are intractable for industrial settings with billions of unlabeled examples. Existing approaches, such as uncertainty sampling or information density, search globally for the optimal examples to label, scaling linearly or even quadratically with the unlabeled data. However, in practice, data is often heavily skewed; only a small fraction of collected data will be relevant for a given learning task. For example, when identifying rare classes,detecting malicious content, or debugging model performance, the ratio of positive to negative examples can be 1 to 1,000 or more. In this work, we exploit this skew in large training datasets to reduce the number of unlabeled examples considered in each selection round by only looking at the nearest neighbors to the labeled examples.

Other Videos By LLMs Explained - Aggregate Intellect - AI.SCIENCE

2020-11-24	Interpretable Neural Networks for Panel Data Analysis in Economics \| AISC
2020-11-23	Meet the MLOps Experts, a fireside chat for newcomers to AI careers
2020-11-23	Nodes, Edges and Properties; Graph Analysis Intro for ML Newcomers
2020-11-20	Operationalizing AI in Business at Scale \| AISC
2020-11-19	Q-BERT: Hessian Based Ultra Low Precision Quantization of BERT \| AISC
2020-11-18	Augmented Out-of-sample Comparison Method for Time Series Forecasting Techniques \| AISC
2020-11-13	Generating Ampicillin-Level Antimicrobial Peptides with Activity-Aware Generative Adversarial Net
2020-11-12	Disparate Interactions: An Algorithms-in-the-loop Analysis of Fairness in Risk Assessment \| AISC
2020-11-11	[GATA] Learning Dynamic Belief Graphs to Generalize on Text-Based Games \| AISC
2020-11-11	DrWhy.AI - Tools for Explainable Artificial Intelligence \| AISC
2020-11-06	Similarity Search for Efficient Active Learning and Search of Rare Concepts \| AISC
2020-11-06	Identifying Influential Individuals in Complex Networks: An Overview \| AISC
2020-11-05	Da Xu (Walmart Labs): Inductive Representation Learning on Temporal Graphs \| AISC
2020-11-04	AI, Democracy, & Disinformation
2020-10-29	Application of Bayesian neural networks for aircraft safety \| AISC
2020-10-28	AI and ML toward Telcom future \| AISC
2020-10-27	Investing in Emerging Technology & The Nuts & Bolts of How to Raise Money for your Startup \| AISC
2020-10-23	The People, Politics, & Histories Behind Machine Learning Datasets \| AISC
2020-10-23	Detecting and Correcting Unfairness in Machine Learning \| AISC
2020-10-23	Highly Recommended: A Fireside Chat with AISC's Resident Experts on Recommender Systems
2020-10-23	A Fireside Chat with AISC NLP experts

Channel	Latest
Mr Saint Jake	6 hours ago
M4cM4nus	6 hours ago
Brice Gaming Z	6 hours ago
Gemingu Channel	6 hours ago
Bota TCG	6 hours ago
Bladii	6 hours ago
노기의 게임방	6 hours ago
Nashara	6 hours ago
flaposvk	6 hours ago
SoHo WTF	6 hours ago
Heroic Spartans	6 hours ago
XboxCZ	6 hours ago
Xbox	6 hours ago
NaIgre	7 hours ago
Lost	7 hours ago
GameCross	7 hours ago
Idea Factory International	7 hours ago
Inemafoo	7 hours ago
angelo conte	7 hours ago
TEASY	7 hours ago
Ali-A	7 hours ago
JOSE - MARINE	7 hours ago
GaGzZz	7 hours ago
CrashBoomPunk	7 hours ago
Maxims Zweitkanal	7 hours ago