Adversarial Examples Are Not Bugs, They Are Features

Channel:

Yannic Kilcher

Subscribers:

291,000

Published on May 14, 2019 1:45:57 PM ● Video Link: https://www.youtube.com/watch?v=hMO6rbMAPew

Duration: 40:21

10,432 views

358

Abstract:
Adversarial examples have attracted significant attention in machine learning, but the reasons for their existence and pervasiveness remain unclear. We demonstrate that adversarial examples can be directly attributed to the presence of non-robust features: features derived from patterns in the data distribution that are highly predictive, yet brittle and incomprehensible to humans. After capturing these features within a theoretical framework, we establish their widespread existence in standard datasets. Finally, we present a simple setting where we can rigorously tie the phenomena we observe in practice to a misalignment between the (human-specified) notion of robustness and the inherent geometry of the data.

Authors: Andrew Ilyas, Shibani Santurkar, Dimitris Tsipras, Logan Engstrom, Brandon Tran, Aleksander Madry

https://arxiv.org/abs/1905.02175

Other Videos By Yannic Kilcher

2019-08-13	Gauge Equivariant Convolutional Networks and the Icosahedral CNN
2019-08-12	Processing Megapixel Images with Deep Attention-Sampling Models
2019-08-09	Manifold Mixup: Better Representations by Interpolating Hidden States
2019-08-08	Learning World Graphs to Accelerate Hierarchical Reinforcement Learning
2019-08-05	Reconciling modern machine learning and the bias-variance trade-off
2019-07-05	Conversation about Population-Based Methods (Re-upload)
2019-07-03	XLNet: Generalized Autoregressive Pretraining for Language Understanding
2019-06-13	Talking to companies at ICML19
2019-06-12	Population-Based Search and Open-Ended Algorithms
2019-06-10	I'm at ICML19 :)
2019-05-14	Adversarial Examples Are Not Bugs, They Are Features
2019-05-10	Reinforcement Learning, Fast and Slow
2019-05-09	S.H.E. - Search. Human. Equalizer.
2019-05-06	Blockwise Parallel Decoding for Deep Autoregressive Models
2019-04-27	Discriminating Systems - Gender, Race, and Power in AI
2019-02-19	The Odds are Odd: A Statistical Test for Detecting Adversarial Examples
2019-02-18	Neural Ordinary Differential Equations
2019-02-18	GPT-2: Language Models are Unsupervised Multitask Learners
2019-02-02	Batch Normalization: Accelerating Deep Network Training by Reducing Internal Covariate Shift
2019-01-30	BERT: Pre-training of Deep Bidirectional Transformers for Language Understanding
2019-01-09	What’s in a name? The need to nip NIPS

Tags:

machine learning

deep learning

adversarial examples

adversarial samples

pgd

projected gradient descent

vulnerabiliby

security

artificial intelligence

MIT

geometry

classifier

deep neural network

attack

convolutional neural networks

research

robust features

robust classifier

robust network

neural network

Channel	Latest
iBugou	6 hours ago
O-Zoeiro	6 hours ago
Alsa Honggo	6 hours ago
IND DAVID	6 hours ago
Nerdzito	6 hours ago
NAAN SHINCHAN	6 hours ago
Ron Gaming Live	6 hours ago
Cortes do Cachorro1337 [OFICIAL]	6 hours ago
Gago!Play	7 hours ago
Greentree Toyota	7 hours ago
AlanTV	7 hours ago
Sam CarLegion	7 hours ago
The Foxy Gaming	7 hours ago
Vehicle Visionary	7 hours ago
88ROTORS	7 hours ago
Crimson Jak	7 hours ago
kof tekken dosmiltrece	7 hours ago
Net Mechanics	7 hours ago
SUPER SLICK SLIME SAM	7 hours ago
KOMO News	7 hours ago
Kumotsune	7 hours ago
Gamer Lab - MELLO	7 hours ago
Zefa Gaming	7 hours ago
Tu Sempai Juega	8 hours ago
tok	8 hours ago