A Stochastic Max Pooling Strategy for Convolutional Neural Network Trained by Noisy Samples

Shuai Sun, Bin Hu, Zhou Yu, Xiaona Song

Abstract


The deep convolutional neural network (CNN) has made remarkable progress in image classification. However, this network performs poorly and even cannot converge in many actual applications, where the training and test samples contain lots of noises. To solve the problems, this paper puts forward a network training strategy based on stochastic max pooling. Unlike the traditional max pooling, the proposed strategy first ranks all the values in each receptive field, and then selects a random value from the top-n values as the pooling result. Compared with common pooling methods, stochastic max pooling can limit the pooling selection to a larger value that represents the main information of the pooling area which reduces the chance of introducing noises into the network, and enhances the robustness of extracting noisy image features. Experimental results show that the CNN used stochastic max pooling Strategy can converge better than traditional CNN and classified noisy images much more accurately than traditional pooling methods.

Keywords


image classification, deep learning, convolutional neural network (CNN), stochastic max pooling

Full Text:

PDF

References


Benkaddour, M.K.; Bounoua, A. (2017). Feature extraction and classification using deep convolutional neural networks, PCA and SVC for face recognition, Traitement du Signal, 34(1-2), 77-91, 2017.
https://doi.org/10.3166/ts.34.77-91

Boureau, Y.L.; Bach, F.; LeCun, Y.; Ponce, J. (2010). Learning mid-level features for recognition, 2010 IEEE Computer Society Conference on Computer Vision and Pattern Recognition, 2559- 2566, 2010.
https://doi.org/10.1109/CVPR.2010.5539963

Gidaris, S.; Komodakis, N. (2015). Object detection via a multi-region and semantic segmentationaware CNN model, Proceedings of the IEEE International Conference on Computer Vision, 1134- 1142, 2015.
https://doi.org/10.1109/ICCV.2015.135

Girshick, R.; Donahue, J.; Darrell, T.; Malik, J. (2014). Rich feature hierarchies for accurate object detection and semantic segmentation, Proceedings of the IEEE conference on computer vision and pattern recognition, 580-587, 2014.
https://doi.org/10.1109/CVPR.2014.81

He, K.; Zhang, X.; Ren, S.; Sun, J. (2016). Deep residual learning for image recognition, 2016 IEEE Conference on Computer Vision and Pattern Recognition, 770-778, 2016.
https://doi.org/10.1109/CVPR.2016.90

Krizhevsky, A.; Sutskever, I.; Hinton, G.E. (2012). Imagenet classification with deep convolutional neural networks, Advances in Neural Information Processing Systems, 25(2), 1097-1105, 2012.

Lakshmipathi, A.N.; Battula, B.P. (2018). Deep convolutional neural networks for product recommendation, Ingénierie des Systèmes d'Information, 23(6), 161-172, 2018.
https://doi.org/10.3166/isi.23.6.161-172

Nair, V.; Hinton, G.E. (2010). Rectified linear units improve restricted boltzmann machines, Proceedings of the 27th international conference on machine learning (ICML-10), 807-814, 2010.

Neelapu, R.; Devi, G.L.; Rao, K.S. (2018). Deep learning based conventional neural network architecture for medical image classification, Traitement du Signal, 35(2), 169-182, 2018.
https://doi.org/10.3166/ts.35.169-182

Raguram, L.S.B.; Shanmugam, V.M. (2017). Deep belief networks for phoneme recognition in continuous Tamil speech-an analysis, Traitement du Signal, 34(3-4), 137-151, 2017.
https://doi.org/10.3166/ts.34.137-151

Ren, S.; He, K.; Girshick, R.; Sun, J. (2017). Faster R-CNN: towards real-time object detection with region proposal networks, IEEE Transactions on Pattern Analysis and Machine Intelligence, 39(6), 1137-1149, 2017.
https://doi.org/10.1109/TPAMI.2016.2577031

Simonyan, K.; Zisserman, A. (2014). Very deep convolutional networks for large-scale image recognition, arXiv.

Srivastava, N.; Hinton, G.E.; Krizhevsky, A.; Sutskever, I.; Salakhutdinov, R. (2014). Dropout: A simple way to prevent neural networks from overfitting, Journal of Machine Learning Research, 15(1), 1929-1958, 2014.

Wen, Y.; Zhang, K.; Li, Z.; Qiao, Y. (2016). A discriminative feature learning approach for deep face recognition, European Conference on Computer Vision, 499-515, 2016.
https://doi.org/10.1007/978-3-319-46478-7_31

Zeiler, M.D.; Fergus, R. (2013). Stochastic pooling for regularization of deep convolutional neural networks, arXiv.




DOI: https://doi.org/10.15837/ijccc.2020.1.3712



Copyright (c) 2020 Shuai Sun, Bin Hu, Zhou Yu, Xiaona Song

Creative Commons License
This work is licensed under a Creative Commons Attribution-NonCommercial 4.0 International License.

CC-BY-NC  License for Website User

Articles published in IJCCC user license are protected by copyright.

Users can access, download, copy, translate the IJCCC articles for non-commercial purposes provided that users, but cannot redistribute, display or adapt:

  • Cite the article using an appropriate bibliographic citation: author(s), article title, journal, volume, issue, page numbers, year of publication, DOI, and the link to the definitive published version on IJCCC website;
  • Maintain the integrity of the IJCCC article;
  • Retain the copyright notices and links to these terms and conditions so it is clear to other users what can and what cannot be done with the  article;
  • Ensure that, for any content in the IJCCC article that is identified as belonging to a third party, any re-use complies with the copyright policies of that third party;
  • Any translations must prominently display the statement: "This is an unofficial translation of an article that appeared in IJCCC. Agora University  has not endorsed this translation."

This is a non commercial license where the use of published articles for commercial purposes is forbiden. 

Commercial purposes include: 

  • Copying or downloading IJCCC articles, or linking to such postings, for further redistribution, sale or licensing, for a fee;
  • Copying, downloading or posting by a site or service that incorporates advertising with such content;
  • The inclusion or incorporation of article content in other works or services (other than normal quotations with an appropriate citation) that is then available for sale or licensing, for a fee;
  • Use of IJCCC articles or article content (other than normal quotations with appropriate citation) by for-profit organizations for promotional purposes, whether for a fee or otherwise;
  • Use for the purposes of monetary reward by means of sale, resale, license, loan, transfer or other form of commercial exploitation;

    The licensor cannot revoke these freedoms as long as you follow the license terms.

[End of CC-BY-NC  License for Website User]


INTERNATIONAL JOURNAL OF COMPUTERS COMMUNICATIONS & CONTROL (IJCCC), With Emphasis on the Integration of Three Technologies (C & C & C),  ISSN 1841-9836.

IJCCC was founded in 2006,  at Agora University, by  Ioan DZITAC (Editor-in-Chief),  Florin Gheorghe FILIP (Editor-in-Chief), and  Misu-Jan MANOLESCU (Managing Editor).

Ethics: This journal is a member of, and subscribes to the principles of, the Committee on Publication Ethics (COPE).

Ioan  DZITAC (Editor-in-Chief) at COPE European Seminar, Bruxelles, 2015:

IJCCC is covered/indexed/abstracted in Science Citation Index Expanded (since vol.1(S),  2006); JCR2018: IF=1.585..

IJCCC is indexed in Scopus from 2008 (CiteScore2018 = 1.56):

Nomination by Elsevier for Journal Excellence Award Romania 2015 (SNIP2014 = 1.029): Elsevier/ Scopus

IJCCC was nominated by Elsevier for Journal Excellence Award - "Scopus Awards Romania 2015" (SNIP2014 = 1.029).

IJCCC is in Top 3 of 157 Romanian journals indexed by Scopus (in all fields) and No.1 in Computer Science field by Elsevier/ Scopus.

 

 Impact Factor in JCR2018 (Clarivate Analytics/SCI Expanded/ISI Web of Science): IF=1.585 (Q3). Scopus: CiteScore2018=1.56 (Q2);

SCImago Journal & Country Rank

Editors-in-Chief: Ioan DZITAC & Florin Gheorghe FILIP.