ENHANCING QUALITY OF CELP CODED SPEECH VIA WIDEBAND EXTENSION BY USING VOICING GMM INTERPOLATION AND IINM RE-SYNTHESIS

Research output: Chapters, Conference Papers, Creative and Literary Works (RGC: 12, 32, 41, 45)32_Refereed conference paper (with ISBN/ISSN)peer-review

10 Scopus Citations
View graph of relations

Author(s)

Related Research Unit(s)

Detail(s)

Original languageEnglish
Title of host publication2002 IEEE International Conference on Acoustics, Speech, and Signal Processing - PROCEEDINGS
Subtitle of host publicationSpeech Processing Neural Networks for Signal Processing
Pages241-244
Volume1
Publication statusPublished - May 2002

Publication series

Name
ISSN (Print)l520-6149

Conference

Title2002 IEEE International Conference on Acustics, Speech, and Signal Processing (ICASSP 2002)
LocationRenaissance Orlando Resort
PlaceUnited States
CityOrlando
Period13 - 17 May 2002

Abstract

This paper presents a procedure to improve the quality of narrowband (0-4khz) CELP coded speech. The procedure is based to refine the pitch periodicity and reinsert the high frequency components (4-8khz) in the narrowband CELP decoded speech. The narrowband CELP decoded speech is first analyzed with Harmonic+Noise analyzer and Lowband information are extracted. By exploiting the Lowband spectrum envelope and V/UV information, the highband (4-8khz) spectrum envelope is recovered statistically by using a voiced/unvoiced Gaussian mixture model with interpolation. Lowband information along with the estimated highband information is then fed to the Harmonic+Noise synthesizer to re-synthesize a wideband speech. The objective and subjective tests are performed to evaluate the quality of the re-synthesis wideband (0-8khz) speech. The results of the above experiments show that the re-synthesis wideband speech is pleasant to listen with crispy characteristics and preferred over the CELP coded speech.

Citation Format(s)

ENHANCING QUALITY OF CELP CODED SPEECH VIA WIDEBAND EXTENSION BY USING VOICING GMM INTERPOLATION AND IINM RE-SYNTHESIS. / Raza, Dar Ghulam ; Chan, Cheung-Fat.

2002 IEEE International Conference on Acoustics, Speech, and Signal Processing - PROCEEDINGS: Speech Processing Neural Networks for Signal Processing. Vol. 1 2002. p. 241-244.

Research output: Chapters, Conference Papers, Creative and Literary Works (RGC: 12, 32, 41, 45)32_Refereed conference paper (with ISBN/ISSN)peer-review