Group Normalization (Paper Explained)

Channel:

Yannic Kilcher

Subscribers:

291,000

Published on May 12, 2020 2:05:56 PM ● Video Link: https://www.youtube.com/watch?v=l_3zj6HeWUE

Duration: 29:06

24,730 views

953

The dirty little secret of Batch Normalization is its intrinsic dependence on the training batch size. Group Normalization attempts to achieve the benefits of normalization without batch statistics and, most importantly, without sacrificing performance compared to Batch Normalization.

https://arxiv.org/abs/1803.08494

Abstract:
Batch Normalization (BN) is a milestone technique in the development of deep learning, enabling various networks to train. However, normalizing along the batch dimension introduces problems --- BN's error increases rapidly when the batch size becomes smaller, caused by inaccurate batch statistics estimation. This limits BN's usage for training larger models and transferring features to computer vision tasks including detection, segmentation, and video, which require small batches constrained by memory consumption. In this paper, we present Group Normalization (GN) as a simple alternative to BN. GN divides the channels into groups and computes within each group the mean and variance for normalization. GN's computation is independent of batch sizes, and its accuracy is stable in a wide range of batch sizes. On ResNet-50 trained in ImageNet, GN has 10.6% lower error than its BN counterpart when using a batch size of 2; when using typical batch sizes, GN is comparably good with BN and outperforms other normalization variants. Moreover, GN can be naturally transferred from pre-training to fine-tuning. GN can outperform its BN-based counterparts for object detection and segmentation in COCO, and for video classification in Kinetics, showing that GN can effectively replace the powerful BN in a variety of tasks. GN can be easily implemented by a few lines of code in modern libraries.

Authors: Yuxin Wu, Kaiming He

Links:
YouTube: https://www.youtube.com/c/yannickilcher
Twitter: https://twitter.com/ykilcher
BitChute: https://www.bitchute.com/channel/yannic-kilcher
Minds: https://www.minds.com/ykilcher

Other Videos By Yannic Kilcher

2020-05-22	When BERT Plays the Lottery, All Tickets Are Winning (Paper Explained)
2020-05-21	[News] OpenAI Model Generates Python Code
2020-05-20	Investigating Human Priors for Playing Video Games (Paper & Demo)
2020-05-19	iMAML: Meta-Learning with Implicit Gradients (Paper Explained)
2020-05-18	[Code] PyTorch sentiment classifier from scratch with Huggingface NLP Library (Full Tutorial)
2020-05-17	Planning to Explore via Self-Supervised World Models (Paper Explained)
2020-05-16	[News] Facebook's Real-Time TTS system runs on CPUs only!
2020-05-15	Weight Standardization (Paper Explained)
2020-05-14	[Trash] Automated Inference on Criminality using Face Images
2020-05-13	Faster Neural Network Training with Data Echoing (Paper Explained)
2020-05-12	Group Normalization (Paper Explained)
2020-05-11	Concept Learning with Energy-Based Models (Paper Explained)
2020-05-10	[News] Google’s medical AI was super accurate in a lab. Real life was a different story.
2020-05-09	Big Transfer (BiT): General Visual Representation Learning (Paper Explained)
2020-05-08	Divide-and-Conquer Monte Carlo Tree Search For Goal-Directed Planning (Paper Explained)
2020-05-07	WHO ARE YOU? 10k Subscribers Special (w/ Channel Analytics)
2020-05-06	Reinforcement Learning with Augmented Data (Paper Explained)
2020-05-05	TAPAS: Weakly Supervised Table Parsing via Pre-training (Paper Explained)
2020-05-04	Chip Placement with Deep Reinforcement Learning (Paper Explained)
2020-05-03	I talk to the new Facebook Blender Chatbot
2020-05-02	Jukebox: A Generative Model for Music (Paper Explained)

Tags:

deep learning

machine learning

arxiv

explained

neural networks

artificial intelligence

paper

batchnorm

groupnorm

layer norm

group norm

batch norm

instance norm

fair

normalization

mean

standard deviation

minibatch

batch statistics

kernel

cnn

convolutional neural network

Channel	Latest
Decretum Livestreams	6 hours ago
Yodec	6 hours ago
Серега KORGE games	6 hours ago
Midget_Man_HQ	6 hours ago
MIDIFILES COM	6 hours ago
PRATA GAMER SEM LIMITES	6 hours ago
Phoenix 2002 Official Channel	6 hours ago
TheWizWiki	6 hours ago
Dragonblk Gamer	6 hours ago
alegodzilla	6 hours ago
BDF018	6 hours ago
Le Monde de Dragthor	6 hours ago
GilBroz Gaming	6 hours ago
PaludaGameplays	6 hours ago
DragonKirby	6 hours ago
ItsJah	6 hours ago
Warlich Gaming	6 hours ago
Simplicio 92	6 hours ago
Abkarino \| عبقرينو	6 hours ago
El Presente Tech	6 hours ago
Apolloxq1	7 hours ago
BLINDING HOPE	7 hours ago
tanrox	7 hours ago
Frau Schnurrhaar	7 hours ago
JHALLBALLER0	7 hours ago