Commenced in January 2007
Frequency: Monthly
Edition: International
Paper Count: 30184
Application of a Novel Audio Compression Scheme in Automatic Music Recommendation, Digital Rights Management and Audio Fingerprinting

Authors: Anindya Roy, Goutam Saha


Rapid progress in audio compression technology has contributed to the explosive growth of music available in digital form today. In a reversal of ideas, this work makes use of a recently proposed efficient audio compression scheme to develop three important applications in the context of Music Information Retrieval (MIR) for the effective manipulation of large music databases, namely automatic music recommendation (AMR), digital rights management (DRM) and audio finger-printing for song identification. The performance of these three applications has been evaluated with respect to a database of songs collected from a diverse set of genres.

Keywords: Audio compression, Music Information Retrieval, Digital Rights Management, Audio Fingerprinting.

Digital Object Identifier (DOI):

Procedia APA BibTeX Chicago EndNote Harvard JSON MLA RIS XML ISO 690 PDF Downloads 1201


[1] Rainer Typke, Frans Wiering, Remco C. Veltkamp, "A Survey of Music Information Retrieval Systems," International Symposium on Music Information Retrieval, 2006.
[2] A. Roy, G. Saha, "Compression using Joint Optimization based on Signal Statistics and Quantization Noise", Elesevier Computers and Electrical Engineering. Submitted for publication.
[3] Shankar Vembu and Stephan Baumann, "A Self-Organizing Map Based Knowledge Discovery for Music Recommendation Systems", Lecture notes in Computer Science - Computer Music Modeling and Retrieval, Springer Berlin /Heidelberg, vol. 3310/2005, pp. 119 -129.
[4] François Pachet and Pierre Roy, "Automatic Generation of Music Programs", Lecture notes in Computer Science - Principles and Practice of Constraint Programming CP99, Springer Berline / Heidelberg, vol. 1713/2004, pp. 331 - 345.
[5] François Pachet and J.-J.Aucouturier, "Scaling up music playlist generation", Proceedings of IEEE International Conference on Multimedia and Expo, 2002, vol. 1, pp. 105- 108.
[6] Qiong Liu, Reihaneh Safavi-Naini and Nicholas Paul Sheppard, "Digital rights management for content distribution" Proceedings of the Australasian information security workshop conference on ACSW frontiers 2003, vol. 21, pp. 49-58..
[7] Frank Hartung and Friedhelm Ramme, "Digital rights management and watermarking of multimedia content for M-commerce applications", IEEE Communication Magazine, vol. 38, no. 11, Nov. 2000, pp. 78-84.
[8] Jong Won Seok and Jin Woo Hong, "Audio watermarking for copyright protection of digital audio data", IEEE Electronics Letters, vol. 37, no. 1, Jan. 2001, pp. 60-61.
[9] P. Cano et al., "Audio Fingerprinting: Concepts And Applications", Studies in Computational Intelligence (SCI) 2, Springer-Verlag 2005, pp.233-245.
[10] Christopher J.C. Burges, Daniel Plastina, John C. Platt, Erin Renshaw, and Henrique S. Malvar, "Using Audio Fingerprinting for duplicate detection and thumbnail generation", International Conference on Audio, Speech and Signal Processing, 2005.
[11] Jason Freeman, "Fast Generation of Audio Signatures to Describe iTunes Libraries", Journal of New Music Research 2006, vol. 35, no. 1, pp. 51-61.
[12] Michail K. Tsatsanis and Georgios B. Giannakis, "Principal Component Filter Banks for Optimal Multiresolution Analysis", IEEE Trans. on Signal Proc., vol. 43, no.8, Aug.1995.
[13] P.Nasiopoulos, M.Yedlin and R.K.Ward, "A high performance fixedlength compression method using the Karhunen-Loeve transform", IEEE Trans. on Consumer Elec., vol. 41, no. 4, Nov. 1995, pp. 1189-1196.
[14] Anil K. Jain, "A Fast Karhunen Loeve Transform for a class of Random Processes", IEEE Trans. on Communications, vol. 24, no. 9, Sept. 1976, pp. 1023-1029.
[15] M. Loève, "Probability Theory - I", New York, Springer-Verlag, 1963.
[16] W. Kinsner, "Compression and Its Metrics for Multimedia", Proceedings of the First IEEE International Conference on Cognitive Informatics (ICCI02), pp. 1-15.
[17] Ahmed H. Tewfik, Deepen Sinha, and Paul Jorgensen, "On the Optimal Choice of a Wavelet for Signal Representation", IEEE Trans. on Info. Theory, vol. 38, no. 2, March 1992.
[18] Deepen Sinha and Ahmed H. Tewfik, "Low Bit Rate Transparent Audio Compression using Adapted Wavelets", IEEE Trans. on Signal Proc., vol. 41, no. 12, Dec. 1993.
[19] Jeff B. Burl, "Estimating the Basis Functions of the Karhunen-Loève Transform", IEEE Trans. on ASSP, vol. 37, no. 1, Jan. 1989.
[20] D. Pan, "A tutorial on MPEG/audio compression", Multimedia, IEEE, vol. 2, no. 2, Summer 1995.
[21] David Salomon, "Data Compression - The Complete Reference", Springer-Verlag, Third Edition, 2004.
[22] Masahiro Nakagawa and Makoto Miyahara, "Generalized Karhunen- Loeve Transformation I (Theoretical Consideration)", IEEE Trans. on Communications, vol. COM-35, no. 2, Feb. 1987.
[23] Yingbo Hua and Wanquan Liu, "Generalized Karhunen - Loeve Transform", IEEE Signal Proc. Letters, vol. 5, no. 6, June 1998.
[24] Gregory W. Wornell, "A Karhunen-Loève like Expansion for 1/f Proceses via Wavelets", IEEE Trans. on Information Theory, vol.36, no.4, July 1990, pp.859-861.
[25] N.J.Jayant and P.Noll, "Digital Coding of Waveforms", Prentice Hall, Engleood Clffs, NJ, 1984.
[26] Bryan E. Usevitch, "A Tutorial on Modern Lossy Wavelet Image Compression: Foundations of JPEG 2000", IEEE Signal Proc. Magazine, Sept. 2001, pp.22 - 35.
[27] Yukihiko Yamashita and Hidemitsu Ogawa, "Relative Karhunen-Lohe Transform", IEEE Trans. on Signal Proc., vol. 44, no. 2, Feb.1996.
[28] Louis L.Scharf and John K. Thomas, "Wiener Filters in Canonical Coordinates for Transform Coding, Filtering and Quantizing", IEEE Trans. on Signal Proc., vol. 46, no. 3, March 1998.