Visual Thing Recognition with Binary Scale-Invariant Feature Transform and Support Vector Machine Classifiers Using Color Information

Wei-Jong Yang; Wei-Hau Du; Pau-Choo Chang; Jar-Ferr Yang; Pi-Hsia Hung

Commenced in January 2007

Frequency: Monthly

Edition: International

Paper Count: 33122

Visual Thing Recognition with Binary Scale-Invariant Feature Transform and Support Vector Machine Classifiers Using Color Information

Authors: Wei-Jong Yang, Wei-Hau Du, Pau-Choo Chang, Jar-Ferr Yang, Pi-Hsia Hung

Abstract:

The demands of smart visual thing recognition in various devices have been increased rapidly for daily smart production, living and learning systems in recent years. This paper proposed a visual thing recognition system, which combines binary scale-invariant feature transform (SIFT), bag of words model (BoW), and support vector machine (SVM) by using color information. Since the traditional SIFT features and SVM classifiers only use the gray information, color information is still an important feature for visual thing recognition. With color-based SIFT features and SVM, we can discard unreliable matching pairs and increase the robustness of matching tasks. The experimental results show that the proposed object recognition system with color-assistant SIFT SVM classifier achieves higher recognition rate than that with the traditional gray SIFT and SVM classification in various situations.

Keywords: Color moments, visual thing recognition system, SIFT, color SIFT.

Digital Object Identifier (DOI): doi.org/10.5281/zenodo.1132150

Procedia APA BibTeX Chicago EndNote Harvard JSON MLA RIS XML ISO 690 PDF Downloads 1036

References:

[1] D. Zhang, K. H. Yap, S. Subbhuraam, “Mobile Product Recognition with Efferent Bag-of-Phrase Visual Search,” Communications, Control and Signal Processing (ISCCSP), pp. 65-68, 2014.
[2] W. Zhang, K. H. Yap, D. J. Zhang, Z. W. Miao, “Feature Weighting in Visual Product Recognition”, Proc. of IEEE International Symposium on Circuits and Systems, pp.734-737, 2015.
[3] S.-M. Huang and J.-F. Yang, “Improved Principal Component Regression for Face Recognition under Illumination Variations”, IEEE Signal Processing Letter, vol. 19, no. 4, pp. 179-182, April 2012.
[4] C.-Y. Su and J.-F. Yang, “Histogram of Gradient Phases: A New Local Descriptor for Face Recognition”, IET Computer Vision, vol. 8, no.6, pp.556-567, December 2014.
[5] D. Lowe, “Object Recognition from Local Scale-invariant Features”, Proceedings of the International Conference on Computer Vision,” pp. 1150–1157, 1999.
[6] D. Lowe, “Distinctive Image Features from Scale-invariant Key-points”, International Journal of Computer Vision, vol. 60, no. 2, pp.91–110, 2004.
[7] H. Bay, A. Ess, T. Tuytelaars, and L. V. Gool, “Speed-Up Robust Features”, in Computer Vision and Image Understanding (CYIU), vol. 110, no. 3, pp. 349-359, 2008.
[8] J. Matas, O. Chum, M. Urban, T. Pajdla, “Robust Wide Baseline Stereo from Maximally Stable Extremal Regions”, Proc. of British Machine Vision Conference, pp. 384-396, 2002.
[9] N. Dalal, B. Triggs, “Histogram of Oriented Gradients for Human Detection”, Proc. of IEEE Conference on Computer Vision and pattern Recognition (CVPR’05), vol. 1, pp. 886-893, 2005.
[10] C. Harris, M. Stephens, “A Combined Corner and Edge Detector”, Proc, of the 4-th Alvey Vision Conference, pp. 147-151, 1988.
[11] R. O. Duda, P. E. Hart, “Use of the Hough Transform Translation to Detect Lines and Curves in Pictures,” Comm. ACM, vol. 15, pp. 11-15, 1972.
[12] J. Shi, C. Tomasi, “Good Feature to Track”, Prof. of IEEE Conference on Computer Vision and Pattern Recognition (CVPR94), pp. 593-600, 1994.
[13] P. M. Panchal, S. R. Panchal, S. K. Shah, “A Comparison of SIFT and SURF”, in International Journal of Innovative Research in computer and Communication Engineering vol. 1, no. 2, pp. 323-327, 2013.
[14] L, Bo, T. Whangbo, “A SIFT-Color Moments Descriptor for Object Recognition”, Proc. of International Conference on IT Convergence and Security (ICITCS), pp. 1-3, 2014.
[15] L. Y. Duan, F. Gao, J, Chen, J. Lin, T, Huang, “Compact Descriptor for Mobile Visual Search and MPEG CDVS Standardization”, Proc. of IEEE International Symposium on Circuits and Systems (ISCAS), pp. 885-888, 2013.
[16] S. Josef, Z. Andrew, “Efficient Visual Search of Videos Cast as Text Retrieval,” IEEE Transactions on Pattern Analysis and Machine Intelligence, vol. 31, no. 4, pp.591-605, 2009.
[17] D. Nister and H. Stewenius, “Scalable Recognition with a vocabulary tree” in Proc. of IEEE Conference CVPR, pp. 2161-2168, 2006.
[18] C. C. Chung, C. J. Lin, “LibSVM” http://www.csie.ntu.edu.tw/~cjlin/libsvm/
[19] E. Nowak, F. Jurie, and B. Triggs, “Sampling Strategies for Bag-of-features Image Classification,” in Proc. Eur. Conf. Comput. Vis. (ECCV), pp. 490–503, 2006.
[20] C. Sharat. “Introduction to kd-trees”, University of Maryland Department of Computer Science.
[21] L. Kabbai, A. Azaza, M. Abdellaoui, A, Douik “Image Matching Based on LBP and SIFT Descriptor”, Prof. of IEEE Conference on Systems, Signals & Devices (SSD), pp.1-6, 2015.