Commenced in January 2007
Frequency: Monthly
Edition: International
Paper Count: 31106
Voice Disorders Identification Using Hybrid Approach: Wavelet Analysis and Multilayer Neural Networks

Authors: L. Salhi, M. Talbi, A. Cherif


This paper presents a new strategy of identification and classification of pathological voices using the hybrid method based on wavelet transform and neural networks. After speech acquisition from a patient, the speech signal is analysed in order to extract the acoustic parameters such as the pitch, the formants, Jitter, and shimmer. Obtained results will be compared to those normal and standard values thanks to a programmable database. Sounds are collected from normal people and patients, and then classified into two different categories. Speech data base is consists of several pathological and normal voices collected from the national hospital “Rabta-Tunis". Speech processing algorithm is conducted in a supervised mode for discrimination of normal and pathology voices and then for classification between neural and vocal pathologies (Parkinson, Alzheimer, laryngeal, dyslexia...). Several simulation results will be presented in function of the disease and will be compared with the clinical diagnosis in order to have an objective evaluation of the developed tool.

Keywords: Neural Networks, Pitch, Wavelet Transform, formants, Pathological Voices

Digital Object Identifier (DOI):

Procedia APA BibTeX Chicago EndNote Harvard JSON MLA RIS XML ISO 690 PDF Downloads 2557


[1] V. Parsa and D. G. Jamieson, "Interactions between speech coders and disordered speech," Speech Communication, vol. 40, no. 7, pp. 365-385, 2003.
[2] S. B. Davis, "Acoustic characteristics of normal and pathological voices," Speech and Language: Advances inBasic Research and Practice, vol. 1, pp. 271-335, 1979.
[3] F. Plant, H Kessler, B Cheetham, J Earis, "Speech Monitoring of Infective Laryngitis", Proceedings of ICSLP96, Philadelphia, pp. 749 - 752 , 1996
[4] M.N. Viera, F.R. McInnes, M.A. Jack "Robust F0 and Jitter estimation in the Pathological voices ", Proceedings of ICSLP96, Philadelphia, pp.745 -748, 1996.
[5] J.Nayak, P.S.Bhat "Classification and analysis of speech abnormalities", ITBM-RBM 26 (2005) 319-327.
[6] S. Mallat, "A Theory for multiresolution signal decomposition: Wavelet representation", IEEE Trans. Pattern Analysis and Machine Intelligence. Vol. 11. No. 7 pp674-693 July 1989.
[7] B. Boyanov, S.Hadjitodorov: "Acoustic analysis of pathological voices: a voice analysis system for screening of laryngeal diseases", Proc. IEEE Engineering in Medical and Biology, (1997), vol. 16, no. 4, 74-82.
[8] J.J. Jiang,Yu Zhang: "Nonlinear dynamic analysis of speech from pathological subjects", Proc IEEE Electronics Letters, March (2002), vol.38, no.6.
[9] P.Yu, M.Ouaknine, J.Revis, and A.Giovanni, "Objective Voice Analysis for Dysphonic Patients: A Multiparametric Protocol Including Acoustic and Aerodynamic Measurements", Journal of Voice Vol. 15, No. 4, pp. 529-542, 2001
[10] J.Wang, Jo.Cheolwoo, "Performance of Gaussian Mixture Model as a classifier for Pathological Voice", proceeding of the ASST in Auckland 2006, pp 165-169.
[11] J.Kortelainen, K.Noponen, « Neural networks », Intelligent Systems 2005
[12] S.Lotfi, C.Adnène, "A Speech Processing Interface for Analysis of Pathological Voices", in proceeding of ICTTA conference, Damascus 2006.
[13] S.Lotfi, B.Haythem, C.Adnène, " Interface d-analyse vocale a l-identification de certaines pathologies d-origine neurologique et vocale", in proceeding of JTM conference, Tunis 2007.
[14] A.chérif, « Pitch detection and formant extraction of Arabic speech processing » Journal of applied acoustics, January 2001
[15] A.M. Gaouda, M. Salama, A. Chikhani, and M. Sultan, "Application of wavelet analysis for monitoring dynamic performance in industrial plants," North American Power Symposium, Oct. 1997, Laramie, Wyoming.
[16] C.Fredouille, G.Pouchoulin, J-F. Bonastre, M.Azzarello, A.Giovanni, A.Ghio "Application of automatic speaker recognition techniques to pathological voice assessment" in proceeding of international conference on acoustic speech and signal processing (ICASSP 2005)
[17] C. M. Bishop, "Neural Networks for Pattern Recognition". Oxford: Clarendon Press, 1996.