Speech Enhancement of Vowels Based on Pitch and Formant Frequency
Authors: R. Rishma Rodrigo, R. Radhika, M. Vanitha Lakshmi
Abstract:
Numerous signal processing based speech enhancement systems have been proposed to improve intelligibility in the presence of noise. Traditionally, studies of neural vowel encoding have focused on the representation of formants (peaks in vowel spectra) in the discharge patterns of the population of auditory-nerve (AN) fibers. A method is presented for recording high-frequency speech components into a low-frequency region, to increase audibility for hearing loss listeners. The purpose of the paper is to enhance the formant of the speech based on the Kaiser window. The pitch and formant of the signal is based on the auto correlation, zero crossing and magnitude difference function. The formant enhancement stage aims to restore the representation of formants at the level of the midbrain. A MATLAB software’s are used for the implementation of the system with low complexity is developed.
Keywords: Formant estimation, formant enhancement, pitch detection, speech analysis.
Digital Object Identifier (DOI): doi.org/10.5281/zenodo.1111801
Procedia APA BibTeX Chicago EndNote Harvard JSON MLA RIS XML ISO 690 PDF Downloads 1647References:
[1] Akshay Rao, Laurel H. Carney, “Speech enhancement for hearing loss based on vowel cofing in auditory midbrain” in IEEE transactions on biomedical engineering, vol. 61, no. 7, July 2014.
[2] R. A. Cole, Y. Yan, B. Mak, M. Fanty, and T. Bailey, “The contribution of consonants versus vowels to word recognition in fluent speech,” in Proc. IEEE ICASSP, Atlanta, GA, USA, May. 1996, pp. 853–856.
[3] D. Kewley-Port, T. Z. Burkle, and J. H. Lee, “Contribution of consonant versus vowel information to sentence intelligibility for young normal hearing and elderly hearing-impaired listeners,” J. Acoust. Soc. Amer., vol. 122, no. 4, pp. 2365–75, Oct. 2007.
[4] M. J. Owren and G. C. Cardillo, “The relative roles of vowels and consonants in discriminating talker identity versus word meaning,” J. Acoust. Soc. Amer., vol. 119, no. 3, pp. 1727–39, Mar. 2006.
[5] G. Parikh and P. C. Loizou, “The influence of noise on vowel and consonant cues,” J. Acoust. Soc. Amer., vol. 118, no. 6, pp. 3874–3888, Dec. 2005.
[6] G. Fant, Acoustic Theory of Speech Production. Hague, The Netherlands: Mouton, 1960. Rao and Carney: Speech Enhancement for Listeners with Hearing Loss 2091
[7] D. B. Fry, A. S. Abramson, P. D. Eimas, and A. M. Liberman, “The identification and discrimination of synthetic vowels,” Language Speech, vol. 5, no. 4, pp. 171–189, Oct.–Dec. 1962.
[8] H. L. Helmholtz, On the Sensations of Tone as a Physiological Basis for the Theory of Music. Cambridge, U.K.: Cambridge Univ. Press, 2009.
[9] M. Ito, J. Tsuchida, and M. Yano, “On the effectiveness of whole spectral shape for vowel perception,” J. Acoust. Soc. Amer., vol. 110, no. 2, pp. 1141–1149, Aug. 2001.
[10] S. A. Zahorian and A. J. Jagharghi, “Spectral-shape features versus formants as acoustic correlates for vowels,” J. Acoust. Soc. Amer., vol. 94, no. 4, pp. 1966–1982, 1993.
[11] J. D. Miller, “Auditory-perceptual interpretation of the vowel,” J. Acoust. Soc. Amer., vol. 85, no. 5, pp. 2114–2134, May. 1989.
[12] R. K. Potter and J. C. Steinberg, “Toward the specification of speech,” J. Acoust. Soc. Amer., vol. 22, no. 6, pp. 807–820, 1950.
[13] B. Mohr and W.-Y. Wang, “Perceptual distance and the specification of phonological features,” Phonetica, vol. 18, no. 1, pp. 31–45, 1968.
[14] L. C. W. Pols, L. J. T. V. D. Kamp, andR. Plomp, “Perceptual and physical space of vowel sounds,” J. Acoust. Soc. Amer., vol. 46, no. 2B, pp. 458– 467, Aug. 1969.
[15] G. Langner and C. E. Schreiner, “Periodicity coding in the inferior colliculus of the cat. I. Neuronal mechanisms,” J. Neurophysiol., vol. 60, no. 6, pp. 1799–1822, Dec. 1988.
[16] B. S. Krishna and M. N. Semple, “Auditory temporal processing: responses to sinusoidally amplitude-modulated tones in the inferior colliculus,” J. Neurophysiol., vol. 84, no. 1, pp. 255–273, Jul. 2000.
[17] P. C. Nelson and L. H. Carney, “Neural rate and timing cues for detection and discrimination of amplitude-modulated tones in the awake rabbit inferior colliculus,” J. Neurophysiol., vol. 97, no. 1, pp. 522–539, Jan.2007.