A New Time-Frequency Speech Analysis Approach Based On Adaptive Fourier Decomposition
Authors: Liming Zhang
Abstract:
In this paper, a new adaptive Fourier decomposition (AFD) based time-frequency speech analysis approach is proposed. Given the fact that the fundamental frequency of speech signals often undergo fluctuation, the classical short-time Fourier transform (STFT) based spectrogram analysis suffers from the difficulty of window size selection. AFD is a newly developed signal decomposition theory. It is designed to deal with time-varying non-stationary signals. Its outstanding characteristic is to provide instantaneous frequency for each decomposed component, so the time-frequency analysis becomes easier. Experiments are conducted based on the sample sentence in TIMIT Acoustic-Phonetic Continuous Speech Corpus. The results show that the AFD based time-frequency distribution outperforms the STFT based one.
Keywords: Adaptive fourier decomposition, instantaneous frequency, speech analysis, time-frequency distribution.
Digital Object Identifier (DOI): doi.org/10.5281/zenodo.1087404
Procedia APA BibTeX Chicago EndNote Harvard JSON MLA RIS XML ISO 690 PDF Downloads 1731References:
[1] L. Cohen, Time-Frequency Analysis: Theory and Applications, Prentice,
Hall, 1995.
[2] S. Qian and D. Chen, “Joint Time-Frequency Analysis”, IEEE Signal
Processing Magazine, pp. 53-67, March, 1999.
[3] H. Choi and W.J. Williams, “Improved Time-Frequency Representation
of Multicomponent Signals Using Exponential Kernel,” IEEE
Transactions on Acoustics, Speech, and Signal Processing, Vol.37, No.6,
pp. 862-871, June 1989.
[4] L. Cohen, “Time-Frequency Distributions -- A Review”, Proceedings of
the IEEE, vol.77, No.7, pp.941-981, 1989.
[5] T.A.C.M. Claasen and W.F.G. Mecklenbrauker, “The Wigner
distribution-a tool for time-frequency signal analysis, Part III--Relations
with other time-frequency signal transformations”, Philips Journal of
Research, Vol. 35, No. 6. pp.372-389, 1980.
[6] B. Boashash, Time-Frequency Signal Analysis and Processing – A
Comprehensive Reference, Elsevier Science, Oxford, 2003.
[7] M.Kepesi and L. Weruaga, “Adaptive chirp-based time–frequency
analysis of speech signals”, Speech Communication, 48, pp. 474-492,
2006.
[8] T. Qian, “Intrinsic Mono-components Decomposition of Functions: An
Advance of Fourier Theory”, Math.Meth.Appl.Sci. 33, pp.880-891, 2010.
[9] T. Qian and Y. Wang, “Adaptive Fourier Series – a Variation of Greedy
Algorithm”, Advances in Computational Mathematics, 34, no.3,
pp.279-293, 2011.
[10] D. Gabor, “Theory of Communication”, Journal of the IEE, vol.93,
pp.~429-457, 1946.
[11] T. Qian, Q. H. Chen and L.Q. Li, “Analytic unit quadrature signals with
non-linear phase”, Physica D: Nonlinear Phenomena, 303, 80-87 2005.
[12] T. Qian, “Characterization of boundary values of functions in Hardy
spaces with applications in signal analysis”, Journal of Integral Equations
and Applications, Volume 17, Number 2, pp 159-198, 2005.
[13] T. Qian, “Analytic Signals and Harmonic Measures”, J. Math. Anal.
Appl. 314, pp.526-536, 2006.
[14] T. Qian, L. Zhang and Z. Li, “Algorithm of Adaptive Fourier Transform”,
IEEE Transactions on Signal Processing, vol.59(12),5899-5906,
Dec.,2011.
[15] T. Qian, “Mono-components for decomposition of signals”,
Mathematical Methods in the Applied Sciences, 29, pp. 1187-1198,
2006..
[16] T. Qian, “Boundary Derivatives of the Phases of Inner and Outer
Functions and Applications”, Mathematical Methods in the Applied
Sciences, 32, pp. 253-263, 2009.
[17] T. Qian and E. Wegert, “Optimal Approximation by Blaschke Forms”,
Complex Variables and Elliptic Equations, preprint Available online: 20
Jun 2011.
[18] TIMIT Acoustic-Phonetic Continuous Speech Corpus,
http://www.ldc.upenn.edu/Catalog/CatalogEntry.jsp?catalogId=LDC93S
1 (Retrieved on 27/11/2012)