A High Quality Speech Coder at 600 bps

Yong Zhang; Ruimin Hu

Commenced in January 2007

Frequency: Monthly

Edition: International

Paper Count: 33122

A High Quality Speech Coder at 600 bps

Authors: Yong Zhang, Ruimin Hu

Abstract:

This paper presents a vocoder to obtain high quality synthetic speech at 600 bps. To reduce the bit rate, the algorithm is based on a sinusoidally excited linear prediction model which extracts few coding parameters, and three consecutive frames are grouped into a superframe and jointly vector quantization is used to obtain high coding efficiency. The inter-frame redundancy is exploited with distinct quantization schemes for different unvoiced/voiced frame combinations in the superframe. Experimental results show that the quality of the proposed coder is better than that of 2.4kbps LPC10e and achieves approximately the same as that of 2.4kbps MELP and with high robustness.

Keywords: Speech coding, Vector quantization, linear predicition, Mixed sinusoidal excitation

Digital Object Identifier (DOI): doi.org/10.5281/zenodo.1060551

Procedia APA BibTeX Chicago EndNote Harvard JSON MLA RIS XML ISO 690 PDF Downloads 2190

References:

[1] Ovens, M.J..Ponting, Turner.M.E, "Ultra low bit rate voice coding," IEE Seminar, Vol.4, pp 911 - 920, 2000
[2] Gwenael guilmin, Francois Capman, and et.al, "New NATO STANAG narrow band voice coder at 600 bit/s", IEEE International Conference on Acoustics, Speech, and Signal Processing, Vol.3, pp.689-692, 2006
[3] T.Wang, K.Koishida, V.Cuperman, and et.al, "A 1200/2400 bps coding suite based on MELP," Proc of IEEE Workshop on Speech Coding, Vol.1, pp. 122-126, 2002
[4] O.Gottesman, A.Gersho, "Enhanced Waveform Interpolative Coding at Low Bit-rate", IEEE Trans.Speech Audio Processing, vol.9, No.8, pp.242-250, 2001
[5] Minoru Kohata, "A New 1.2kbit/s speech coding method based on a sinusoidal harmonic vocoder," Systems and Computers in Japan, vol.31, No.14, pp.64-73, 2000
[6] Jian Cong, Suo Cong, "New speech encoding algorithm for ultra low bit rate at 600/300," IEEE International Conference on Acoustics, Speech, and Signal Processing, Vol.2, pp.709-712, 2006
[7] Ehsan Jahangiri, Shahrokh Ghaemmaghami, "Scalable speech coding at rates below 900 bps", IEEE International Conference on Multimedia & Expo, Vol.1, pp.85-88, 2008
[8] A.D.Subramaniam, B.D.Rao, "PDF Optimized Parametric Vector Quantization of Speech Line Spectral Frequencies," IEEE Trans. Speech Audio Processing, Vol. 11, No. 2, pp. 130-142, Mar. 2003.
[9] L.M. Supplee, R.P.Cohn, J.S.Collura, A.V.McCree, "MELP: The new federal standard at 2400 bits/s," IEEE International Conference on Acoustics, Speech, and Signal Processing, Vol.4, pp.1591-1954, 1997
[10] Thomas E.Tremain, "The Government Standard Linear Predictive Coding Algorithm: LPC-10," Speech Technology, No.2, pp.40-49, 1982