Vision Based Hand Gesture Recognition Using Generative and Discriminative Stochastic Models
Authors: Mahmoud Elmezain, Samar El-shinawy
Abstract:
Many approaches to pattern recognition are founded on probability theory, and can be broadly characterized as either generative or discriminative according to whether or not the distribution of the image features. Generative and discriminative models have very different characteristics, as well as complementary strengths and weaknesses. In this paper, we study these models to recognize the patterns of alphabet characters (A-Z) and numbers (0-9). To handle isolated pattern, generative model as Hidden Markov Model (HMM) and discriminative models like Conditional Random Field (CRF), Hidden Conditional Random Field (HCRF) and Latent-Dynamic Conditional Random Field (LDCRF) with different number of window size are applied on extracted pattern features. The gesture recognition rate is improved initially as the window size increase, but degrades as window size increase further. Experimental results show that the LDCRF is the best in terms of results than CRF, HCRF and HMM at window size equal 4. Additionally, our results show that; an overall recognition rates are 91.52%, 95.28%, 96.94% and 98.05% for CRF, HCRF, HMM and LDCRF respectively.
Keywords: Statistical Pattern Recognition, Generative Model, Discriminative Model, Human Computer Interaction.
Digital Object Identifier (DOI): doi.org/10.5281/zenodo.1088742
Procedia APA BibTeX Chicago EndNote Harvard JSON MLA RIS XML ISO 690 PDF Downloads 2946References:
[1] M. Elmezain, A. Al-Hamadi and B. Michaelis, Real-Time Capable System for Hand Gesture Recognition Using Hidden Markov Models in Stereo Color Image Sequences, Journal of WSCG, Vol. 16, No. 1, pp. 65-72, 2008.
[2] M. Yang, N. Ahuja and M. Tabb, Extraction of 2D Motion Trajectories and its Application to Hand Gesture Recognition, EEE Trans. on PAMI, Vol. 24, No. 8, pp. 1061-1074, 2002.
[3] Yang, S. Sclaroff and S. Lee, Sign Language Spotting with a Threshold Model Based on Conditional Random Fields, IEEE Trans. on PAMI, Vol. 31(7), pp. 1264-1277, 2009.
[4] C. Sminchisescu, A. Kananujia and D. Metaxas, Conditional Models for Contextual Human Motion Recognition, Journal of CVIU, Vol.104, No. 2, pp. 210-220, 2006.
[5] J. Lafferty, A. McCallum and F. Pereira, Conditional Random Fields: Probabilistic Models for Segmenting and Labeling sequence Data, Conf. on ICML, pp. 282-289, 2001.
[6] L. P. Morency, A. Quattoni and T. Darrell, Latent-Dynamic Discriminative Models for Continuous Gesture Recognition, IIEEE Conf. on CVPR, pp. 1-8, 2007.
[7] X. Deyou, A Network Approach for Hand Gesture Recognition in Virtual Reality Driving Training System of SPG, Conf. on ICPR, pp.519-522, 2006.
[8] M. Elmezain, A. Al-Hamadi, and B. Michaelis, Spatio- Temporal Feature Extraction-Based Hand Gesture Recognition for Isolated American Sign Language and Arabic Numbers, IEEE Symposium on ISPA, pp. 254-259, 2009.
[9] R. R. Lawrence, A Tutorial on Hidden Markov Models and Selected Applications in Speech Recognition, Proceeding of the IEEE, Vol.77(2), pp. 257-286, 1989.
[10] K. Takahashi, S. Sexi and R. Oka, Spotting Recognition of Human Gestures From Motion Images, In Technical Report IE92-134, pp.9-16, 1992.
[11] A. Gunawardana, M. Mahajan, A. Acero and J. C .Platt, Hidden Conditional Random Fields for Phone Classification, Proceeding of European Conf. on Speech Communication and technology, pp. 1117-1120, 2005.
[12] A. Quattoni, S. Wang, L. P. Morency, M. Collins and T. Darrell, Hidden Conditional Random Fields,EEE Trans. on PAMI, Vol. 29, No.10, pp. 1848-1852, 2007.
[13] L. Rabiner, A Tutorial on Hidden Markov Models and Selected Applications in Speech Recognition, Proceedings of the IEEE, Vol. 77, No. 2, pp. 257-286, 1989.
[14] X. D. Huang, Y. Ariki, and M. Jack, Hidden Markov Models for Speech Recognition, Edinburgh University Press, 1990.
[15] S. Goronzy, HRobust Adaptation to Non-Native Accents in Automatic Speech Recognition, Lecture Notes in Computer Sciences, Springer, ISBN-13: 978- 540003250, 2002.
[16] M. Elmezain, A. Al-Hamadi, and B. Michaelis, Discriminative Models-Based Hand Gesture Recognition, International Conference on Machine Vision, pp. 123-127, 2009.
[17] A. McCallum, D. Freitag, and F. Pereira, Maximum Entropy Markov Models for Information Extraction and Segmentation,International Conference on Machine Learning, pp. 591-598, 2000.
[18] J. Lafferty, A. McCallum, and F. Pereira, Conditional Random Fields: Probabilistic Models for Segmenting and Labeling sequence Data, International Conference on ICML, pp. 282-289, 2001.
[19] A. McCallum, Efficiently Inducing Features of Conditional Random Fields, Conf. on Uncertainty in AI, 2003.
[20] L. P. Morency, A. Quattoni, C. M. Christoudias, and S. Wang, Hidden-state Conditional Random Field Library, EVersion 1.3c, http://pt.sourceforge.jp/projects/sfnet hcrf/,” 2008.