A Dictionary Learning Method Based On EMD for Audio Sparse Representation

Yueming Wang; Zenghui Zhang; Rendong Ying; Peilin Liu

Commenced in January 2007

Frequency: Monthly

Edition: International

Paper Count: 32807

A Dictionary Learning Method Based On EMD for Audio Sparse Representation

Authors: Yueming Wang, Zenghui Zhang, Rendong Ying, Peilin Liu

Abstract:

Sparse representation has long been studied and several dictionary learning methods have been proposed. The dictionary learning methods are widely used because they are adaptive. In this paper, a new dictionary learning method for audio is proposed. Signals are at first decomposed into different degrees of Intrinsic Mode Functions (IMF) using Empirical Mode Decomposition (EMD) technique. Then these IMFs form a learned dictionary. To reduce the size of the dictionary, the K-means method is applied to the dictionary to generate a K-EMD dictionary. Compared to K-SVD algorithm, the K-EMD dictionary decomposes audio signals into structured components, thus the sparsity of the representation is increased by 34.4% and the SNR of the recovered audio signals is increased by 20.9%.

Keywords: Dictionary Learning, EMD, K-means Method, Sparse Representation.

Digital Object Identifier (DOI): doi.org/10.5281/zenodo.1087632

Procedia APA BibTeX Chicago EndNote Harvard JSON MLA RIS XML ISO 690 PDF Downloads 2579

References:

[1] B. Olshausen and D. Field, “Emergence of simple-cell receptive fieldproperties by learning a sparse code for natural images,” Nature, vol.381, pp. 607–609, 1996.
[2] M. S. Lewicki and T. J. Sejnowski, “Learning overcomplete representations,” Neural Comput., vol. 12, pp. 337–365, 2000.
[3] P. Schmid-Saugeon and A. Zakhor, “Dictionary design for matchingpursuit and application to motion-compensated video coding,” IEEETrans. Circuits Syst. Video Technol., vol. 14, no. 6, pp. 880–886, 2004.
[4] M. Aharon, M. Elad, and A. Bruckstein, “K-SVD: An algorithm for designing overcomplete dictionaries for sparse representation,” IEEETrans. Signal Process, vol. 54, no. 11, pp. 4311–4322, 2006.
[5] M. Yaghoobi, L. Daudet, and M. Davies, “Parametric dictionary design for sparse coding,” in Workshop on Signal Processing with Adaptive Sparse Structured Representations (SPARS09), 2009
[6] M. G. Jafari and M. D. Plumbley. Fast dictionary learning for sparserepresentations of speech signals. IEEE Journal of Selected Topics in SignalProcessing, 5:1025–1031, Sep. 2011.
[7] N. E. Huang, Z. Shen, S. R. Long, M. C. Wu. H.H. Shih, et al, “The Empirical Mode Decompositionand The Hilbert Spectrum for Nonlinear and Nonstationary Time Series Analysis”, Proc. R. Soc. A,1998, 454, pp. 903-995.
[8] A. Gersho and R. M. Gray, Vector Quantization and Signal Compression.Norwell, MA: Kluwer Academic, 1991.
[9] T. Blumensath and M. E. Davies “Iterative thresholding for sparse approximations", J. Fourier Anal. Applicat., vol. 14, no. 5, pp.629 -654 2008