Improving Offline Handwritten Digit Recognition Using Concavity-Based Features
Keywords:
Complementary features, concavity features, digit recognition, feature extraction, handwritten character recognition, off-line recognition.Abstract
This paper examines benefits of using concavity-based structural features in recognition of handwritten digits. An overview of existing concavity features is presented and a new method is introduced. These features are used as complementary features to gradient and chaincode features, both among the best performing features in handwritten digit recognition. Two support vector classifiers (SVCs) are chosen for classification task as the top performers in previous works; SVC with radial basis function (RBF) kernel and the SVC with polynomial kernel. For reference, we also used the k-nearest neighbor (k-NN) classifier. Results are obtained on MNIST, USPS and DIGITS datasets. We also tested dataset independency of various feature vectors by combining different datasets. The introduced feature extraction method gives the best results in majority of tests.References
C.-L. Liu, K. Nakashima, H. Sako, H. Fujisawa, Handwritten digit recognition: investigation of normalization and feature extraction techniques, Pattern Recognition, 37(2):265-279, 2004. http://dx.doi.org/10.1016/S0031-3203(03)00224-3
C.-L. Liu, K. Nakashima, H. Sako, H. Fujisawa, Handwritten digit recognition: benchmarking of state-of-the-art techniques, Pattern Recognition, 36(10):2271-2285, 2003. http://dx.doi.org/10.1016/S0031-3203(03)00085-2
M. H. Nguyen, F. de la Torre, Optimal feature selection for support vector machines, Pattern Recognition, 43(3):584-591, 2010. http://dx.doi.org/10.1016/j.patcog.2009.09.003
U. Kressel, J. SchĂźrmann, Pattern classification techniques based on function approximation, Handbook of Character Recognition and Document Image Analysis, 49-78, 1997.
B. P. Chacko, P. Babu Anto, Comparison of Statistical and Structural Features for Handwritten Numeral Recognition, Proc. of the Int. Conf. on Computational Intelligence and Multimedia Applications (ICCIMA 2007), Washington, DC (USA), 296-300, 2007. http://dx.doi.org/10.1109/ICCIMA.2007.173
H. Liu, X. Ding, Handwritten Character Recognition Using Gradient Feature and Quadratic Classifier with Multiple Discrimination Schemes, Proc. of the Eighth Int. Conf. on Document Analysis and Recognition (ICDAR '05), Washington, DC (USA), 19-25, 2005.
O. D. Trier, A. K. Jain, T. Taxt, Feature Extraction Methods for Character Recognition - A Survey, Pattern Recognition, 29(4):641-662, 1996. http://dx.doi.org/10.1016/0031-3203(95)00118-2
G. Vamvakas, B. Gatos, I. Pratikakis, N. Stamatopoulos, A. Roniotis, S. J. Perantonis, Hybrid off-line OCR for isolated handwritten Greek characters, Proc. of the Fourth IASTED Int. Conf. on Signal Processing, Pattern Recognition, and Applications, Innsbruck (Austria), 197-202, 2007.
J. Favata, G. Srikantan, S. Srihari, Handprinted character/digit recognition using a multiple feature/resolution philosophy, Fourth International Workshop on Frontiers in Handwriting Recognition, Taipei (Taiwan), 67-70, 1994.
Y. LeCun, L. Bottou, Y. Bengio, P. Haffner, Gradient-Based Learning Applied to Document Recognition, Proceedings of the IEEE, 86(11):2278-2324, 1998. http://dx.doi.org/10.1109/5.726791
J. J. Hull, A Database for Handwritten Text Recognition Research, Pattern Analysis and Machine Intelligence, 16(5):550-554, 1993. http://dx.doi.org/10.1109/34.291440
A. K. Seewald, Digits - A Dataset for Handwritten Digit Recognition, Austrian Research Institut for Artificial Intelligence Technical Report, Vienna (Austria), 2005.
C. Cortes, V. Vapnik, Support-Vector Networks, Machine Learning, 20(3):273-297, 1995. http://dx.doi.org/10.1007/BF00994018
L. Van der Maaten, A New Benchmark Dataset for Handwritten Character Recognition, Tilburg University Technical Report, 2009.
A. K. Seewald, On the Brittleness of Handwritten Digit Recognition Models, Technical Report, Seewald Solutions, Vienna (Austria), 2009.
M. Karic, Concavity paper source code. [Online] Cited 2011-08-30. Available at: http://www.etfos.hr/mkaric/conc.
R. V. D. Heiden, F. C. A. Gren, The Box-Cox metric for nearest neighbor classification improvement, Pattern Recognition, 30(2):273-279, 1997. http://dx.doi.org/10.1016/S0031-3203(96)00077-5
C. B. Barber, D. P. Dobkin, H. T. Huhdanpaa, The Quickhull Algorithm for Convex Hulls, ACM Trans. on Mathematical Software, 22(4):469-483, 1996. http://dx.doi.org/10.1145/235815.235821
C.-C. Chang, C.-J. Lin, LIBSVM: a library for support vector machines, ACM Trans. on Intelligent Systems and Technology, 2(3): 1-39, 2012. Software available at http://www.csie.ntu.edu.tw/cjlin/libsvm.
N. Otsu, A Threshold Selection Method from Gray-Level Histograms. IEEE Trans. on Systems, Man and Cybernetics, 9(1):62-66, 1979. http://dx.doi.org/10.1109/TSMC.1979.4310076
M. R. Gupta, N. P. Jacobson, E. K. Garcia, OCR binarization and image pre-processing for searching historical documents, Pattern Recognition, 40(2):389-397, 2007. http://dx.doi.org/10.1016/j.patcog.2006.04.043
Y. LeCun, The MNIST database of handwritten digits. [Online] Cited 2011-08-30. Available at: http://yann.lecun.com/exdb/mnist.
D. Keysers, Experimental results on the USPS database. [Online] Cited 2011-08-30. Available at: http://www-i6.informatik.rwth-aachen.de/keysers/Pubs/SPR2002/node10.html.
C. E. Rasmussen, C. K. I. Williams, Gaussian Processes for Machine Learning, 2nd ed., The MIT Press, 2006.
Published
Issue
Section
License
ONLINE OPEN ACCES: Acces to full text of each article and each issue are allowed for free in respect of Attribution-NonCommercial 4.0 International (CC BY-NC 4.0.
You are free to:
-Share: copy and redistribute the material in any medium or format;
-Adapt: remix, transform, and build upon the material.
The licensor cannot revoke these freedoms as long as you follow the license terms.
DISCLAIMER: The author(s) of each article appearing in International Journal of Computers Communications & Control is/are solely responsible for the content thereof; the publication of an article shall not constitute or be deemed to constitute any representation by the Editors or Agora University Press that the data presented therein are original, correct or sufficient to support the conclusions reached or that the experiment design or methodology is adequate.