ReLU and sigmoidal activation functions

Pretorius, Arnold M.; Barnard, Etienne; Davel, Marelie H.

ReLU and sigmoidal activation functions

dc.contributor.author	Pretorius, Arnold M.
dc.contributor.author	Barnard, Etienne
dc.contributor.author	Davel, Marelie H.
dc.date.accessioned	2020-01-27T13:27:30Z
dc.date.available	2020-01-27T13:27:30Z
dc.date.issued	2019-12
dc.description.abstract	The generalization capabilities of deep neural networks are not well understood, and in particular, the influence of activation functions on generalization has received little theoretical attention. Phenomena such as vanishing gradients, node saturation and network sparsity have been identified as possible factors when comparing different activation functions [1]. We investigate these factors using fully connected feedforward networks on two standard benchmark problems, and find that the most salient differences between networks with sigmoidal and ReLU activations relate to the way that class-distinctive information is propagated through a network.	en_US
dc.identifier.citation	Arnold M. Pretorius, Etienne Barnard and Marelie H. Davel, “ReLU and sigmoidal activation functions“, In Proc. South African Forum for Artificial Intelligence Research (FAIR2019), pp37-48, Cape Town, South Africa, December 2019.	en_US
dc.identifier.issn	1613-0073
dc.identifier.uri	http://hdl.handle.net/10394/33957
dc.language.iso	en	en_US
dc.publisher	In Proc. South African Forum for Artificial Intelligence Research (FAIR2019)	en_US
dc.subject	Non-linear activation function	en_US
dc.subject	Generalization	en_US
dc.subject	Activation distribution	en_US
dc.subject	Sparsity	en_US
dc.title	ReLU and sigmoidal activation functions	en_US
dc.type	Other	en_US

Files

License bundle

Now showing 1 - 1 of 1

Name:: license.txt
Size:: 1.61 KB
Format:: Item-specific license agreed upon to submission
Description:

Download

Collections

Faculty of Engineering