Commenced in January 2007
Frequency: Monthly
Edition: International
Paper Count: 30753
Study of Features for Hand-printed Recognition

Authors: Satish Kumar

Abstract:

The feature extraction method(s) used to recognize hand-printed characters play an important role in ICR applications. In order to achieve high recognition rate for a recognition system, the choice of a feature that suits for the given script is certainly an important task. Even if a new feature required to be designed for a given script, it is essential to know the recognition ability of the existing features for that script. Devanagari script is being used in various Indian languages besides Hindi the mother tongue of majority of Indians. This research examines a variety of feature extraction approaches, which have been used in various ICR/OCR applications, in context to Devanagari hand-printed script. The study is conducted theoretically and experimentally on more that 10 feature extraction methods. The various feature extraction methods have been evaluated on Devanagari hand-printed database comprising more than 25000 characters belonging to 43 alphabets. The recognition ability of the features have been evaluated using three classifiers i.e. k-NN, MLP and SVM.

Keywords: Database, features, classifier, Hand-printed, Devanagari

Digital Object Identifier (DOI): doi.org/10.5281/zenodo.1071003

Procedia APA BibTeX Chicago EndNote Harvard JSON MLA RIS XML ISO 690 PDF Downloads 1317

References:


[1] V. K. Govindan and A. P. Shivaprasad, "Character Recognition - A Review", Pattern Recognition, Vol. 23, No. 7 , 1990.
[2] R. Plamondon and S. N. Srihari, "On-line and Off-line Handwriting Recognition: a Comprehensive Survey", IEEE Transactions on Pattern Analysis and Machine Intelligence, Vol. 22, No.1,pp. 63-84, 2000.
[3] A. L. Koerich, R. Sabourin and C.Y. Suen, "Large Vocabulary Off-line Handwriting Recognition: a Survey", Pattern Analysis Applications, Vol. 6, pp. 97-121, 2003.
[4] N. Arica and T. Yarman-Vural, "An Overview of Character Recognition Focused on Off-line Handwriting", IEEE Transactions on Systems, Man, and Cybernetics-Part C: Applications and Reviews, Vol. 31, No. 2, 2001.
[5] Satish Kumar, "The Headline Removal Algorithm and its Effect on Recognition of Devanagari Handwritten Characters", International Journal of Systemics, Cybernetics and Informatics, April 2009.
[6] U. Pal, B.B. Chaudhuri, "Indian script character recognition: a survey", Pattern Recognition, 37, pp. 1887-1899, 2004.
[7] R. Kirsch, "Computer Determination of the Constituent Structure of Biomedical Images," Computers and Biomedical Research, Vol 4, pp. 315-328, 1971.
[8] J. Birk, R. Kelley, N. Chen and L. Wilson, "Image Feature Extraction using Diameter-Limited Gradient Direction Histograms", IEEE Transactions on Pattern Analysis and Machine Intelligence, Vol. 1, pp. 228-235, 1979.
[9] W. K. Pratt, "Digital Image Processing", Third Edition, Wiley, New York , 2001.
[10] M. Bosker, "Omnidocument Technologies", Proceedings of the IEEE, Vol. 80, No. 7 , 1992.
[11] J. Cao, M. Ahmadi and M. Shridhar, Recognition of Handwritten Numerals with Multiple Feature and Multistage Classifier, Pattern Recognition, Vol. 28, No. 2, pp. 153-160, 1995.
[12] K. M. Kim, J.J. Park, Y.G. Song, I. C. Kim and C. Y. Suen, "Recognition of Handwritten Numerals Using a Combined Classifier with Hybrid Features", SSPR & SPR, LNCS 3138, pp. 992-1000, 2004.
[13] N. Arica and F. T. Yarman-Vural, "Optical Character Recognition for Cursive Handwriting", IEEE Transactions on Pattern Analysis and Machine Intelligence, Vol. 24, No. 6, 2002.
[14] M. H. Glauberman, "Character Recognition for Business Machines", Electronics, pp. 132-136, 1956.
[15] O. D. Trier, A. K. Jain and T. Taxt, "Feature Extraction Method for Character Recognition - a Survey", Pattern Recognition, Vol. 29, No. 4, pp. 641-662, 1996.
[16] M. Shridhar and A. Badreldin, "Recognition of Isolated and Connected Handwritten Numerals", Proceedings of the IEEE International Conference on Systems, Man and Cybernetics, pp. 142-146, 1984.
[17] L. Heutte, T. Paquet, J. Moreau, Y. Lecourtier and C. Olivier, "A Sturctural / Statistical Feature Based Vector for Handwritten Character Recognition", Pattern Recognition Letter, Vol. 19, pp. 629-641, 1998.
[18] C.- L. Liu, K. Nakashima, H. Sako and H. Fujisawa, "Handwritten Digit Recognition: Benchmarking of State-of-the-Art", Pattern Recognition, No. 36, pp. 2271-2285 , 2003.
[19] L. Koerich, "Large Vocabulary Off-line Handwritten Word Recognition", Ph. D. Thesis, ├ëcole de Technologie Supérieure, Montreal-Canada , 2004.
[20] R. M. Bozinovic and S. N. Srihari, Offline Cursive Script Word Recognition, IEEE Transactions on Pattern Analysis and Machine Intelligence, Vol. 11, No. 1, pp. 68-83 , 1989.
[21] A. D. S. Britto Jr., R. Sabourin, E. Lethelier, F. Bortolozzi and C. Y. Suen, "Improvement in Handwritten Numeral String Recognition by Slant Normalization and Contextual Information", Proceedings of the Seventh International Workshop on Frontiers of Handwriting Recognition, Amsterdam-Netherlands, pp. 323-332 , 2000.
[22] E. Kavallieratou, N. Fakotakis and G. Kokkinakis, "Slant Estimation Algorithm for OCR Systems", Pattern Recognition, Vol. 34, pp. 2515- 2522, 2001.
[23] D. Guillevic and C. Y. Suen, "Cursive Script Recognition: A Sentence Level Recognition Scheme", Proceedings of International Workshop on Frontiers of Handwriting Recognition, pp. 216-223 , 1994.
[24] G. Nicchiotti and C. Scagliola, "Generalized Projections: A Tool for Cursive Handwriting Normalization", Proceedings of the Fifth International Conference on Document Analysis and Recognition, Bangalore, India, pp. 729-732 , 1999.
[25] S. Madhvanath, G. Kim and V. Govindaraju, "Chaincode Contour Processing for Handwritten Word Recognition", IEEE Transactions on Pattern Analysis and Machine Intelligence, Vol. 21, No. 9, pp. 928-932 , 1999.
[26] F. Kimura, Y. Miyake, and M. Shridhar, "Handwritten ZIP Code Recognition using Lexicon Free Word Recognition Algorithm", International Conference on Document Analysis and Recognition, Montreal, Que., Canada, pp. 906-910 , 1995.
[27] Y. Wen, Y. Lu and P. Shi, "Handwritten Bangla Numeral Recognition System and its Application to Postal Automation", Pattern Recognition, Vol. 40, pp. 99-107, 2007.
[28] S. Knerr, L. Personnaz and G. Dreyfus, "Handwritten Digit Recognition by Neural Networks with Single-Layer Training", IEEE Transactions on Neural Networks, Vol. 3, No. 6, pp. 962-968, 1992.
[29] S.-B. Cho, "Neural-Network Classifiers for Recognizing Totally Unconstrained Handwritten Numerals", IEEE Transactions on Neural Networks, Vol. 4, No. 1, pp. 43-53, 1997.
[30] L. S. Davis, "Survey of Edge Detection Techniques", Computer Graphics and Image Processing, Vol. 4, pp. 248-270, 1975.
[31] G. Srikantan, S. W. Lam and S.N. Srihari, "Gradient Based Contour Encoding for Character Recognition", Pattern Recognition, Vol. 29, No. 7, pp. 1147-1160, 1996.
[32] C.- L. Liu, K. Nakashima, H. Sako and H. Fujisawa, "Handwritten Digit Recognition: Benchmarking of State-of-the-Art", Pattern Recognition, No. 36, pp. 2271-2285 , 2003.
[33] H. Liu and X. Ding, "Handwritten Character Recognition using Gradient Feature and Quadratic Classifier with Multiple Discrimination Schemes", Proceedings of the Eighth International Conference on Document Analysis and Recognition, pp. 19-25, 2005.
[34] H. Fujisawa and C.-L. Liu, "Directional Pattern Matching for Character Recognition Revisited", Proceedings of the Seventh International Conference on Document Analysis and Recognition, pp. 794-798, 2003.
[35] A. Kawamura, et al, "On-line Recognition of Freely Handwritten Japanese Characters using Directional Feature Densities", Proceedings of the Eleventh International Conference on Pattern Recognition, Vol. II, pp. 183-186, 1992.
[36] G. Borgefors, "Distance Transformations in Digital Images", Computer Vision, Graphics and Image Processing, Vol. 34, pp. 344-371, 1986.
[37] A. Negi, C. Bhagvati and B. Krishna, "An OCR System for Telugu", Proceedings of the Sixth International Conference on Document Processing, pp. 1110-1114, 2001.
[38] S. J. Smith, M. O. Bourgoin, K. Sims and H.L. Voorhees, "Handwritten Character Classification using Nearest Neighbor in Large Database", IEEE Transactions on Pattern Analysis and Machine Intelligence, Vol.16, No. 9, 915-919, 1994.
[39] Zs. M. Kovics and R. Guerrieri, "Massively-Parallel Handwritten Character Recognition Based on the Distance Transform", Pattern Recognition, Vol. 28, No. 3, pp. 293-301, 1995.
[40] II-S. Oh, C.Y. Suen, "Distance Features for Neural Network-based Recognition of Handwritten Characters", International Journal on Document Analysis and Recognition, Vol.1, pp. 73-88, 1998.
[41] H. Freeman, "Computer Processing of Line Drawings", Computing Surveys, Vol. 6, pp. 57-97, 1974.
[42] R. C. Gonzalez and R. E. Woods, "Digital Image Processing", 2nd Ed., Pearson Education , 2002.
[43] F. Kimura and M. Shridhar, Handwritten Numeral Recognition Based on Multiple Algorithms, Pattern Recognition, Vol. 24, No. 10, pp. 969- 983,1991.
[44] A. Rosenfeld and J. L. Pfaltz, "Distance Functions on Digital Pictures", Pattern Recognition, Vol. 1, No. 1, pp. 33-61, 1968.
[45] C.-L. Liu, K. Nakashima, H. Sako and H. Fujisawa, "Handwritten Digit Recognition: Investigation of Normalization and Feature Extraction Techniques", Pattern Recognition, Vol. 37, pp. 265-279, 2004.
[46] N. Arica and F. T. Yarman-Vural, "Optical Character Recognition for Cursive Handwriting", IEEE Transactions on Pattern Analysis and Machine Intelligence, Vol. 24, No. 6, 2002.
[47] Y. Le Cun, O. Mattan, B. Boser, J.S. Denker et al, "Handwritten Zip Code Recognition with Multilayer Networks", Proceedings of International Conference on Pattern Recognition, Atlantic City, USA, Vol. 2, pp. 35-40, 1990.
[48] P. S. Deshpande, L. Malik and S. Arora, Journal of Computers, Vol. 3, No. 5, pp. 11-17, May 2008.
[49] S. K. Parui and B. Shaw, "Off-line Devanagari Handwritten Word Recognition: An HMM based approach", Proc. PReMI-2007(Springer), LNCS-4815, pp. 528-535, Dec. 2007.
[50] B. Shaw, S. K. Parui and M. Shridhar, "Off-line Handwritten Devanagari Word Recognition: A Segmentation Based Approach", IEEE ,2008.
[51] Satish Kumar, "Performance Comparison of Features on Devanagari Hand-printed Dataset", International Journal of Recent Trends in Engineering, Vol. 1, No. 2, pp. 33-37, May 2009.
[52] Satish Kumar, "Neighborhood Pixels Weights-A New Feature Extractor", International Journal of Computer Theory and Engineering, Vol. 1, No. 6, pp. 69-77, Feb 2010.
[53] Satish Kumar, " A Study of Discrimination Ability of Features on Handprinted Characters in Context to Noisy and Slanted Patterns" , Punjab Institute of Management and Technology(PIMT) Journal of Research, Vol. 2, No. 1, pp. 66-72, March 2009.
[54] Satish Kumar, "Devanagari Hand-printed Character Recognition using Multiple Features and Multi-stage Classifier", International Journal of Computer Information Systems and Industrial Management Applications (IJCISIM), Vol. 2, pp.039-055, 2010.