A Dictionary Learning Method Based On EMD for Audio Sparse Representation
Authors: Yueming Wang, Zenghui Zhang, Rendong Ying, Peilin Liu
Abstract:
Sparse representation has long been studied and several dictionary learning methods have been proposed. The dictionary learning methods are widely used because they are adaptive. In this paper, a new dictionary learning method for audio is proposed. Signals are at first decomposed into different degrees of Intrinsic Mode Functions (IMF) using Empirical Mode Decomposition (EMD) technique. Then these IMFs form a learned dictionary. To reduce the size of the dictionary, the K-means method is applied to the dictionary to generate a K-EMD dictionary. Compared to K-SVD algorithm, the K-EMD dictionary decomposes audio signals into structured components, thus the sparsity of the representation is increased by 34.4% and the SNR of the recovered audio signals is increased by 20.9%.
Keywords: Dictionary Learning, EMD, K-means Method, Sparse Representation.
Digital Object Identifier (DOI): doi.org/10.5281/zenodo.1087632
Procedia APA BibTeX Chicago EndNote Harvard JSON MLA RIS XML ISO 690 PDF Downloads 2630References:
[1] B. Olshausen and D. Field, “Emergence of simple-cell receptive
fieldproperties by learning a sparse code for natural images,” Nature,
vol.381, pp. 607–609, 1996.
[2] M. S. Lewicki and T. J. Sejnowski, “Learning overcomplete
representations,” Neural Comput., vol. 12, pp. 337–365, 2000.
[3] P. Schmid-Saugeon and A. Zakhor, “Dictionary design for
matchingpursuit and application to motion-compensated video coding,”
IEEETrans. Circuits Syst. Video Technol., vol. 14, no. 6, pp. 880–886,
2004.
[4] M. Aharon, M. Elad, and A. Bruckstein, “K-SVD: An algorithm for
designing overcomplete dictionaries for sparse representation,”
IEEETrans. Signal Process, vol. 54, no. 11, pp. 4311–4322, 2006.
[5] M. Yaghoobi, L. Daudet, and M. Davies, “Parametric dictionary design
for sparse coding,” in Workshop on Signal Processing with Adaptive
Sparse Structured Representations (SPARS09), 2009
[6] M. G. Jafari and M. D. Plumbley. Fast dictionary learning for
sparserepresentations of speech signals. IEEE Journal of Selected Topics
in SignalProcessing, 5:1025–1031, Sep. 2011.
[7] N. E. Huang, Z. Shen, S. R. Long, M. C. Wu. H.H. Shih, et al, “The
Empirical Mode Decompositionand The Hilbert Spectrum for Nonlinear
and Nonstationary Time Series Analysis”, Proc. R. Soc. A,1998, 454, pp.
903-995.
[8] A. Gersho and R. M. Gray, Vector Quantization and Signal
Compression.Norwell, MA: Kluwer Academic, 1991.
[9] T. Blumensath and M. E. Davies “Iterative thresholding for sparse
approximations", J. Fourier Anal. Applicat., vol. 14, no. 5, pp.629 -654
2008