Positional normalization

Publikation: Bidrag til tidsskrift › Konferenceartikel › Forskning › fagfællebedømt

Boyi Li
Felix Wu
Kilian Q. Weinberger
Belongie, Serge

A popular method to reduce the training time of deep neural networks is to normalize activations at each layer. Although various normalization schemes have been proposed, they all follow a common theme: normalize across spatial dimensions and discard the extracted statistics. In this paper, we propose an alternative normalization method that noticeably departs from this convention and normalizes exclusively across channels. We argue that the channel dimension is naturally appealing as it allows us to extract the first and second moments of features extracted at a particular image position. These moments capture structural information about the input image and extracted features, which opens a new avenue along which a network can benefit from feature normalization: Instead of disregarding the normalization constants, we propose to re-inject them into later layers to preserve or transfer structural information in generative networks.

Originalsprog	Engelsk
Tidsskrift	Advances in Neural Information Processing Systems
Vol/bind	32
ISSN	1049-5258
Status	Udgivet - 2019
Eksternt udgivet	Ja
Begivenhed	33rd Annual Conference on Neural Information Processing Systems, NeurIPS 2019 - Vancouver, Canada Varighed: 8 dec. 2019 → 14 dec. 2019

Konference

Konference	33rd Annual Conference on Neural Information Processing Systems, NeurIPS 2019
Land	Canada
By	Vancouver
Periode	08/12/2019 → 14/12/2019
Sponsor	Citadel, Doc.AI, et al., Lambda, Lyft, Microsoft Research

Bibliografisk note

ID: 301823632

Datalogisk Institut