Fast Search for MPEG Video Clips Using Adjacent Pixel Intensity Difference Quantization Histogram Feature
In this paper, we propose a novel fast search algorithm for short MPEG video clips from video database. This algorithm is based on the adjacent pixel intensity difference quantization (APIDQ) algorithm, which had been reliably applied to human face recognition previously. An APIDQ histogram is utilized as the feature vector of the frame image. Instead of fully decompressed video frames, partially decoded data, namely DC images are utilized. Combined with active search , a temporal pruning algorithm, fast and robust video search can be realized. The proposed search algorithm has been evaluated by 6 hours of video to search for given 200 MPEG video clips which each length is 15 seconds. Experimental results show the proposed algorithm can detect the similar video clip in merely 80ms, and Equal Error Rate (ERR) of 3 % is achieved, which is more accurately and robust than conventional fast video search algorithm.
Digital Object Identifier (DOI): doi.org/10.5281/zenodo.1070425Procedia APA BibTeX Chicago EndNote Harvard JSON MLA RIS XML ISO 690 PDF Downloads 1227
 K. Kashino, T. Kurozumi, and H. Murase, "Quick AND/OR search for multimedia signals based on histogram features", IEICE Trans., J83-D-II, vol.12, 2000, pp. 2735-2744.
 S.S. Cheung and A. Zakhor, "Efficient video similarity measurement with video signature", IEEE Trans. on Circuits and System for Video Technology, vol.13, no.1, 2003, pp. 59-74.
 A. Hampapur, K. Hyun, and R. Bolle, "Comparison of sequence matching techniques for video copy detection", SPIE. Storage and Retrieval for Media Databases 2002, 4676, San Jose, CA, USA, 2002, pp. 194-201.
 V.V. Vinod, H. Murase, "Focused color intersection with efficient searching for object extraction", Pattern Recognition, vol. 30, no.10, 1997, pp. 1787-1797.
 K. Kotani, F.F. Lee, Q. Chen, and T. Ohmi, "Face recognition based on the adjacent pixel intensity difference quantization histogram method", Proc. 2003 Int. Symposium on Intelligent Signal Processing and Communication Systems, D7-4, Japan, 2003, pp. 877-880.
 AT&T Laboratories Cambridge, The Database of Faces, at http://www.cl. cam.ac.uk/research/dtg/attarchive/facedatabase.html.
 L. Agnihotre, N. Dimitrova, T. McGee, S. Jeannin, S. Schaffer, J. Nesvadba, "Evolvable visual commercial detector", IEEE. International Conference on Computer Vision and Pattern Recognition, vol. 2, 2003, pp. 79-84.
 R. Lienhart, C. Kuhmunch, W. Effelsberg, "On the detection and recognition of television commercials", In Proc. IEEE Conf. on Multimedia Computing and Systems, 1997, pp. 509-516.
 R. Mohan, "Video sequence matching", In Proc. of the International Conference on Audio, Speech and Signal Processing, vol.6, 1998, pp. 3679-3700.
 B. Yeo and B. Liu, "Rapid scene analysis on compressed videos", IEEE Trans. on Circuits and Systems for Video Technology, vol.5, no.6, 1995, pp. 533-544.