Application of Genetic Algorithms to Feature Subset Selection in a Farsi OCR
Commenced in January 2007
Frequency: Monthly
Edition: International
Paper Count: 33122
Application of Genetic Algorithms to Feature Subset Selection in a Farsi OCR

Authors: M. Soryani, N. Rafat

Abstract:

Dealing with hundreds of features in character recognition systems is not unusual. This large number of features leads to the increase of computational workload of recognition process. There have been many methods which try to remove unnecessary or redundant features and reduce feature dimensionality. Besides because of the characteristics of Farsi scripts, it-s not possible to apply other languages algorithms to Farsi directly. In this paper some methods for feature subset selection using genetic algorithms are applied on a Farsi optical character recognition (OCR) system. Experimental results show that application of genetic algorithms (GA) to feature subset selection in a Farsi OCR results in lower computational complexity and enhanced recognition rate.

Keywords: Feature Subset Selection, Genetic Algorithms, Optical Character Recognition.

Digital Object Identifier (DOI): doi.org/10.5281/zenodo.1060615

Procedia APA BibTeX Chicago EndNote Harvard JSON MLA RIS XML ISO 690 PDF Downloads 1982

References:


[1] Oliveira, L. S., Benahmed, N., Sabourin, R., Bortolozzi, F., Suen, C. Y., "Feature Subset Selection Using Genetic Algorithms for Handwritten Digit Recognition" Proc. XIV Brazilian Symposium on Computer Graphics and Image Processing (SIBGRAPI-01), P.362, 2001.
[2] Yang, J., Honavar, V., "Feature Subset Selection Using a Genetic Algorithm," Proc. IEEE Intelligent Systems, vol. 13, no. 2, pp. 44- 49, 1998.
[3] Sarfraz, M., Nawaz, S., N., Al-Khuraidly A., "Offline Arabic Text Recognition System" Proc. 2003 International Conference on Geometric Modeling and Graphics (GMAG'03), 2003.
[4] Deb, K., "Genetic Algorithm in Search and Optimization: the Technique and Applications" Proc. International Workshop on Soft Computing and Intelligent Systems, pp. 58-87, Calcutta, India, 1998.
[5] Kudo M, Sklansky J. , "Comparison of Algorithms that Select Features for Pattern Classifiers" Pattern Recognition, Vol.33, pp.25-41, 2000.
[6] Kim, G., Kim, S., "Feature Selection Using Genetic Algorithms for Handwritten Character Recognition" Proc. Seventh International Workshop on Frontiers in Handwritten Recognition, Amsterdam, 2000.
[7] Sural, S., Das, P. K., "A Genetic Algorithm for Feature Selection in a Neuro-Fuzzy OCR System" Proc. Sixth International Conference on Document Analysis and Recognition (ICDAR-01), P.0987, 2001.
[8] Morita, M., Sabourin, R., Bortolozzi, F., Suen, C. Y., "Unsupervised Feature Selection Using Multi-Objective Genetic Algorithms for Handwritten Word Recognition " Proc. Seventh International Conference on Document Analysis and Recognition (ICDAR-03), Vol.2, P.666, 2003.
[9] Shi, D., Shu, W., Liu, H., "Feature Selection for Handwritten Chinese Character Recognition Based on Genetic Algorithms" Proc. IEEE Int. Conference on Systems, Man, and Cybernetics, vol. 5, pp. 4201-6, 1998.
[10] Ebrahimi, A., Kabir, E., "A Two Step Method for the Recognition of Printed Subwords", Iranian Journal of Electrical and Computer Engineering, Vol.2, No.2, pp.57-62, 2005 (in Farsi).