A High Quality Speech Coder at 600 bps
Authors: Yong Zhang, Ruimin Hu
This paper presents a vocoder to obtain high quality synthetic speech at 600 bps. To reduce the bit rate, the algorithm is based on a sinusoidally excited linear prediction model which extracts few coding parameters, and three consecutive frames are grouped into a superframe and jointly vector quantization is used to obtain high coding efficiency. The inter-frame redundancy is exploited with distinct quantization schemes for different unvoiced/voiced frame combinations in the superframe. Experimental results show that the quality of the proposed coder is better than that of 2.4kbps LPC10e and achieves approximately the same as that of 2.4kbps MELP and with high robustness.
Digital Object Identifier (DOI): doi.org/10.5281/zenodo.1060551Procedia APA BibTeX Chicago EndNote Harvard JSON MLA RIS XML ISO 690 PDF Downloads 1868
 Ovens, M.J..Ponting, Turner.M.E, "Ultra low bit rate voice coding," IEE Seminar, Vol.4, pp 911 - 920, 2000
 Gwenael guilmin, Francois Capman, and et.al, "New NATO STANAG narrow band voice coder at 600 bit/s", IEEE International Conference on Acoustics, Speech, and Signal Processing, Vol.3, pp.689-692, 2006
 T.Wang, K.Koishida, V.Cuperman, and et.al, "A 1200/2400 bps coding suite based on MELP," Proc of IEEE Workshop on Speech Coding, Vol.1, pp. 122-126, 2002
 O.Gottesman, A.Gersho, "Enhanced Waveform Interpolative Coding at Low Bit-rate", IEEE Trans.Speech Audio Processing, vol.9, No.8, pp.242-250, 2001
 Minoru Kohata, "A New 1.2kbit/s speech coding method based on a sinusoidal harmonic vocoder," Systems and Computers in Japan, vol.31, No.14, pp.64-73, 2000
 Jian Cong, Suo Cong, "New speech encoding algorithm for ultra low bit rate at 600/300," IEEE International Conference on Acoustics, Speech, and Signal Processing, Vol.2, pp.709-712, 2006
 Ehsan Jahangiri, Shahrokh Ghaemmaghami, "Scalable speech coding at rates below 900 bps", IEEE International Conference on Multimedia & Expo, Vol.1, pp.85-88, 2008
 A.D.Subramaniam, B.D.Rao, "PDF Optimized Parametric Vector Quantization of Speech Line Spectral Frequencies," IEEE Trans. Speech Audio Processing, Vol. 11, No. 2, pp. 130-142, Mar. 2003.
 L.M. Supplee, R.P.Cohn, J.S.Collura, A.V.McCree, "MELP: The new federal standard at 2400 bits/s," IEEE International Conference on Acoustics, Speech, and Signal Processing, Vol.4, pp.1591-1954, 1997
 Thomas E.Tremain, "The Government Standard Linear Predictive Coding Algorithm: LPC-10," Speech Technology, No.2, pp.40-49, 1982