Improving Offline Handwritten Digit Recognition Using Concavity-Based Features

Miran Karic, Goran Martinovic

Abstract


This paper examines benefits of using concavity-based structural features in recognition of handwritten digits. An overview of existing concavity features is presented and a new method is introduced. These features are used as complementary features to gradient and chaincode features, both among the best performing features in handwritten digit recognition. Two support vector classifiers (SVCs) are chosen for classification task as the top performers in previous works; SVC with radial basis function (RBF) kernel and the SVC with polynomial kernel. For reference, we also used the k-nearest neighbor (k-NN) classifier. Results are obtained on MNIST, USPS and DIGITS datasets. We also tested dataset independency of various feature vectors by combining different datasets. The introduced feature extraction method gives the best results in majority of tests.

Keywords


Complementary features, concavity features, digit recognition, feature extraction, handwritten character recognition, off-line recognition.

Full Text:

PDF

References


C.-L. Liu, K. Nakashima, H. Sako, H. Fujisawa, Handwritten digit recognition: investigation of normalization and feature extraction techniques, Pattern Recognition, 37(2):265-279, 2004.
http://dx.doi.org/10.1016/S0031-3203(03)00224-3

C.-L. Liu, K. Nakashima, H. Sako, H. Fujisawa, Handwritten digit recognition: benchmarking of state-of-the-art techniques, Pattern Recognition, 36(10):2271-2285, 2003.
http://dx.doi.org/10.1016/S0031-3203(03)00085-2

M. H. Nguyen, F. de la Torre, Optimal feature selection for support vector machines, Pattern Recognition, 43(3):584-591, 2010.
http://dx.doi.org/10.1016/j.patcog.2009.09.003

U. Kressel, J. SchĂźrmann, Pattern classification techniques based on function approximation, Handbook of Character Recognition and Document Image Analysis, 49-78, 1997.

B. P. Chacko, P. Babu Anto, Comparison of Statistical and Structural Features for Handwritten Numeral Recognition, Proc. of the Int. Conf. on Computational Intelligence and Multimedia Applications (ICCIMA 2007), Washington, DC (USA), 296-300, 2007.
http://dx.doi.org/10.1109/ICCIMA.2007.173

H. Liu, X. Ding, Handwritten Character Recognition Using Gradient Feature and Quadratic Classifier with Multiple Discrimination Schemes, Proc. of the Eighth Int. Conf. on Document Analysis and Recognition (ICDAR '05), Washington, DC (USA), 19-25, 2005.

O. D. Trier, A. K. Jain, T. Taxt, Feature Extraction Methods for Character Recognition - A Survey, Pattern Recognition, 29(4):641-662, 1996.
http://dx.doi.org/10.1016/0031-3203(95)00118-2

G. Vamvakas, B. Gatos, I. Pratikakis, N. Stamatopoulos, A. Roniotis, S. J. Perantonis, Hybrid off-line OCR for isolated handwritten Greek characters, Proc. of the Fourth IASTED Int. Conf. on Signal Processing, Pattern Recognition, and Applications, Innsbruck (Austria), 197-202, 2007.

J. Favata, G. Srikantan, S. Srihari, Handprinted character/digit recognition using a multiple feature/resolution philosophy, Fourth International Workshop on Frontiers in Handwriting Recognition, Taipei (Taiwan), 67-70, 1994.

Y. LeCun, L. Bottou, Y. Bengio, P. Haffner, Gradient-Based Learning Applied to Document Recognition, Proceedings of the IEEE, 86(11):2278-2324, 1998.
http://dx.doi.org/10.1109/5.726791

J. J. Hull, A Database for Handwritten Text Recognition Research, Pattern Analysis and Machine Intelligence, 16(5):550-554, 1993.
http://dx.doi.org/10.1109/34.291440

A. K. Seewald, Digits - A Dataset for Handwritten Digit Recognition, Austrian Research Institut for Artificial Intelligence Technical Report, Vienna (Austria), 2005.

C. Cortes, V. Vapnik, Support-Vector Networks, Machine Learning, 20(3):273-297, 1995.
http://dx.doi.org/10.1007/BF00994018

L. Van der Maaten, A New Benchmark Dataset for Handwritten Character Recognition, Tilburg University Technical Report, 2009.

A. K. Seewald, On the Brittleness of Handwritten Digit Recognition Models, Technical Report, Seewald Solutions, Vienna (Austria), 2009.

M. Karic, Concavity paper source code. [Online] Cited 2011-08-30. Available at: http://www.etfos.hr/mkaric/conc.

R. V. D. Heiden, F. C. A. Gren, The Box-Cox metric for nearest neighbor classification improvement, Pattern Recognition, 30(2):273-279, 1997.
http://dx.doi.org/10.1016/S0031-3203(96)00077-5

C. B. Barber, D. P. Dobkin, H. T. Huhdanpaa, The Quickhull Algorithm for Convex Hulls, ACM Trans. on Mathematical Software, 22(4):469-483, 1996.
http://dx.doi.org/10.1145/235815.235821

C.-C. Chang, C.-J. Lin, LIBSVM: a library for support vector machines, ACM Trans. on Intelligent Systems and Technology, 2(3): 1-39, 2012. Software available at http://www.csie.ntu.edu.tw/cjlin/libsvm.

N. Otsu, A Threshold Selection Method from Gray-Level Histograms. IEEE Trans. on Systems, Man and Cybernetics, 9(1):62-66, 1979.
http://dx.doi.org/10.1109/TSMC.1979.4310076

M. R. Gupta, N. P. Jacobson, E. K. Garcia, OCR binarization and image pre-processing for searching historical documents, Pattern Recognition, 40(2):389-397, 2007.
http://dx.doi.org/10.1016/j.patcog.2006.04.043

Y. LeCun, The MNIST database of handwritten digits. [Online] Cited 2011-08-30. Available at: http://yann.lecun.com/exdb/mnist.

D. Keysers, Experimental results on the USPS database. [Online] Cited 2011-08-30. Available at: http://www-i6.informatik.rwth-aachen.de/keysers/Pubs/SPR2002/node10.html.

C. E. Rasmussen, C. K. I. Williams, Gaussian Processes for Machine Learning, 2nd ed., The MIT Press, 2006.




DOI: https://doi.org/10.15837/ijccc.2013.2.303



Copyright (c) 2017 Miran Karic, Goran Martinovic

Creative Commons License
This work is licensed under a Creative Commons Attribution-NonCommercial 4.0 International License.

CC-BY-NC  License for Website User

Articles published in IJCCC user license are protected by copyright.

Users can access, download, copy, translate the IJCCC articles for non-commercial purposes provided that users, but cannot redistribute, display or adapt:

  • Cite the article using an appropriate bibliographic citation: author(s), article title, journal, volume, issue, page numbers, year of publication, DOI, and the link to the definitive published version on IJCCC website;
  • Maintain the integrity of the IJCCC article;
  • Retain the copyright notices and links to these terms and conditions so it is clear to other users what can and what cannot be done with the  article;
  • Ensure that, for any content in the IJCCC article that is identified as belonging to a third party, any re-use complies with the copyright policies of that third party;
  • Any translations must prominently display the statement: "This is an unofficial translation of an article that appeared in IJCCC. Agora University  has not endorsed this translation."

This is a non commercial license where the use of published articles for commercial purposes is forbiden. 

Commercial purposes include: 

  • Copying or downloading IJCCC articles, or linking to such postings, for further redistribution, sale or licensing, for a fee;
  • Copying, downloading or posting by a site or service that incorporates advertising with such content;
  • The inclusion or incorporation of article content in other works or services (other than normal quotations with an appropriate citation) that is then available for sale or licensing, for a fee;
  • Use of IJCCC articles or article content (other than normal quotations with appropriate citation) by for-profit organizations for promotional purposes, whether for a fee or otherwise;
  • Use for the purposes of monetary reward by means of sale, resale, license, loan, transfer or other form of commercial exploitation;

    The licensor cannot revoke these freedoms as long as you follow the license terms.

[End of CC-BY-NC  License for Website User]


INTERNATIONAL JOURNAL OF COMPUTERS COMMUNICATIONS & CONTROL (IJCCC), With Emphasis on the Integration of Three Technologies (C & C & C),  ISSN 1841-9836.

IJCCC was founded in 2006,  at Agora University, by  Ioan DZITAC (Editor-in-Chief),  Florin Gheorghe FILIP (Editor-in-Chief), and  Misu-Jan MANOLESCU (Managing Editor).

Ethics: This journal is a member of, and subscribes to the principles of, the Committee on Publication Ethics (COPE).

Ioan  DZITAC (Editor-in-Chief) at COPE European Seminar, Bruxelles, 2015:

IJCCC is covered/indexed/abstracted in Science Citation Index Expanded (since vol.1(S),  2006); JCR2018: IF=1.585..

IJCCC is indexed in Scopus from 2008 (CiteScore2018 = 1.56):

Nomination by Elsevier for Journal Excellence Award Romania 2015 (SNIP2014 = 1.029): Elsevier/ Scopus

IJCCC was nominated by Elsevier for Journal Excellence Award - "Scopus Awards Romania 2015" (SNIP2014 = 1.029).

IJCCC is in Top 3 of 157 Romanian journals indexed by Scopus (in all fields) and No.1 in Computer Science field by Elsevier/ Scopus.

 

 Impact Factor in JCR2018 (Clarivate Analytics/SCI Expanded/ISI Web of Science): IF=1.585 (Q3). Scopus: CiteScore2018=1.56 (Q2); Editors-in-Chief: Ioan DZITAC & Florin Gheorghe FILIP.