Parametric studies of translation invariance and distortion robustness in Convolutional Neural Networks

Myburgh, Johannes Christiaan

Parametric studies of translation invariance and distortion robustness in Convolutional Neural Networks

Files

Myburg_JC.pdf (1.51 MB)

Date

2021

Authors

Myburgh, Johannes Christiaan

Publisher

North-West University (South Africa)

Abstract

Although Convolutional Neural Networks (CNNs) are widely used, their translation in-variance (ability to deal with translated inputs) is still subject to some controversy. We explore this question using translation-sensitivity maps to quantify how sensitive a standard CNN is to a translated input. We propose the use of cosine similarity as sensitivity metric over Euclidean distance, and discuss the importance of restricting the dimensionality of either of these metrics when comparing architectures. Our main focus is to investigate the eﬀect of diﬀerent architectural components of a stan-dard CNN on that network’s sensitivity to translation. To study the eﬀects of max-pool kernel size on translation invariance, we train several CNN architectures with diﬀerently shaped max-pool kernels and compare their translation invariance. The results indicate that larger max-pool kernels result in more translation invariance than smaller max-pool kernels. By varying convolutional kernel sizes and amounts of zero padding, we control the size of the feature maps produced, allowing us to quantify the extent to which these elements inﬂuence translation invariance. We also measure translation invariance at diﬀerent loca-tions within the CNN to determine the extent to which convolutional and fully connected layers, respectively, contribute to the translation invariance of a CNN as a whole. Our analysis indicates that both convolutional kernel size and feature map size have a system-atic inﬂuence on translation invariance. We also see that convolutional layers contribute less than expected to translation invariance, when not speciﬁcally forced to do so. The eﬀects of various CNN components on distortion-sensitivity is also analysed in this study. We analyse the diﬀerences between how CNNs deal with translation and distortion. Using distortion-sensitivity functions that we deﬁne, we are able to quantify how sensitive a system is to distorted inputs. The results indicate that larger max-pool kernels result in more distortion-sensitivity for CNNs trained on MNIST, similar to translation invariance. Convolutional kernel size has less of an eﬀect on distortion sensitivity for CNNs trained in MNIST while all networks, regardless of their architectural variations, learn to be less sensitive to distortion when trained on CIFAR10. All in all, it seems that convolutional layers are not fully utilised to deal with spatial information if the training task is not diﬃcult enough, forcing the fully connected layers (spatially unaware) to compensate by learning similar elements at diﬀerent inputs. By reducing feature map size to 1, forcing the convolutional layers to better deal with trans-lation, we obtain the most translation invariant system studied, when evaluated on the unseen test set. Also, we observe a few similarities in how CNNs deal with translation and distortion and attribute these similarities to the network’s ability to pinpoint impor-tant features in general, rather than speciﬁcally the movement of kernel across the input during convolution.

Description

MEng (Computer and Electronic Engineering), North-West University, Potchefstroom Campus

Keywords

Convolutional Neural Networks, Translation invariance, Distortion robustness, Translation-sensitivity map, Data augmentation

URI

https://orcid.org/0000-0002-7378-4796
http://hdl.handle.net/10394/37746

Collections

Engineering

Full item page

Parametric studies of translation invariance and distortion robustness in Convolutional Neural Networks

Files

Date

Authors

Journal Title

Journal ISSN

Volume Title

Publisher

Abstract

Description

Keywords

Citation

URI

Collections

Endorsement

Review

Supplemented By

Referenced By