Hai Quang Hong Dam and Hai Ho and Minh Hoang Le Ngo
Blind Speech Separation Using SRPPHAT Localization and Optimal Beamformer in TwoSpeaker Environments
1529 - 1533
2016
10
8
International Journal of Computer and Information Engineering
https://publications.waset.org/pdf/10005265
https://publications.waset.org/vol/116
World Academy of Science, Engineering and Technology
This paper investigates the problem of blind speech separation from the speech mixture of two speakers. A voice activity detector employing the Steered Response Power Phase Transform (SRPPHAT) is presented for detecting the activity information of speech sources and then the desired speech signals are extracted from the speech mixture by using an optimal beamformer. For evaluation, the algorithm effectiveness, a simulation using real speech recordings had been performed in a doubletalk situation where two speakers are active all the time. Evaluations show that the proposed blind speech separation algorithm offers a good interference suppression level whilst maintaining a low distortion level of the desired signal.
Open Science Index 116, 2016