Activation gap generators in neural networks

Davel, Marelie H.

Activation gap generators in neural networks

Date

2019-12

Authors

Davel, Marelie H.

Publisher

In Proc. South African Forum for Artificial Intelligence Research (FAIR2019)

Abstract

No framework exists that can explain and predict the generalisation ability of DNNs in general circumstances. In fact, this question has not been addressed for some of the least complicated of neural network architectures: fully-connected feedforward networks with ReLU activations and a limited number of hidden layers. Building on recent work [2] that demonstrates the ability of individual nodes in a hidden layer to draw class specific activation distributions apart, we show how a simplified network architecture can be analysed in terms of these activation distributions, and more specifically, the sample distances or activation gaps each node produces. We provide a theoretical perspective on the utility of viewing nodes as activation gap generators, and define the gap conditions that are guaranteed to result in perfect classification of a set of samples. We support these conclusions with empirical results.

Keywords

Generalization, fully-connected feedforward networks, ac- tivation distributions, MLP

Citation

Marelie H. Davel, “Activation gap generators in neural networks“, In Proc. South African Forum for Artificial Intelligence Research (FAIR2019), pp64-76, Cape Town, South Africa, December 2019.

URI

http://hdl.handle.net/10394/33958

Collections

Faculty of Engineering

Full item page

Activation gap generators in neural networks

Date

Authors

Researcher ID

Supervisors

Journal Title

Journal ISSN

Volume Title

Publisher

Record Identifier

Abstract

Sustainable Development Goals

Description

Keywords

Citation

URI

Collections

Endorsement

Review

Supplemented By

Referenced By