Commenced in January 2007
Frequency: Monthly
Edition: International
Paper Count: 2

Search results for: N. Ellouze

2 Automatic Segmentation of the Clean Speech Signal

Authors: A. Bouzid, M. A. Ben Messaoud, N. Ellouze

Abstract:

Speech Segmentation is the measure of the change point detection for partitioning an input speech signal into regions each of which accords to only one speaker. In this paper, we apply two features based on multi-scale product (MP) of the clean speech, namely the spectral centroid of MP, and the zero crossings rate of MP. We focus on multi-scale product analysis as an important tool for segmentation extraction. The multi-scale product is based on making the product of the speech wavelet transform coefficients at three successive dyadic scales. We have evaluated our method on the Keele database. Experimental results show the effectiveness of our method presenting a good performance. It shows that the two simple features can find word boundaries, and extracted the segments of the clean speech.

Keywords: spectral centroid, multiscale product, speech segmentation, zero crossings rate

Procedia PDF Downloads 349
1 Robust Features for Impulsive Noisy Speech Recognition Using Relative Spectral Analysis

Authors: Noureddine Ellouze, Hajer Rahali, Zied Hajaiej

Abstract:

The goal of speech parameterization is to extract the relevant information about what is being spoken from the audio signal. In speech recognition systems Mel-Frequency Cepstral Coefficients (MFCC) and Relative Spectral Mel-Frequency Cepstral Coefficients (RASTA-MFCC) are the two main techniques used. It will be shown in this paper that it presents some modifications to the original MFCC method. In our work the effectiveness of proposed changes to MFCC called Modified Function Cepstral Coefficients (MODFCC) were tested and compared against the original MFCC and RASTA-MFCC features. The prosodic features such as jitter and shimmer are added to baseline spectral features. The above-mentioned techniques were tested with impulsive signals under various noisy conditions within AURORA databases.

Keywords: MFCC, auditory filter, impulsive noise, prosodic features, RASTA filter

Procedia PDF Downloads 300