ENHANCING QUALITY OF CELP CODED SPEECH VIA WIDEBAND EXTENSION BY USING VOICING GMM INTERPOLATION AND IINM RE-SYNTHESIS
Research output: Chapters, Conference Papers, Creative and Literary Works (RGC: 12, 32, 41, 45) › 32_Refereed conference paper (with ISBN/ISSN) › peer-review
Author(s)
Related Research Unit(s)
Detail(s)
Original language | English |
---|---|
Title of host publication | 2002 IEEE International Conference on Acoustics, Speech, and Signal Processing - PROCEEDINGS |
Subtitle of host publication | Speech Processing Neural Networks for Signal Processing |
Pages | 241-244 |
Volume | 1 |
Publication status | Published - May 2002 |
Publication series
Name | |
---|---|
ISSN (Print) | l520-6149 |
Conference
Title | 2002 IEEE International Conference on Acustics, Speech, and Signal Processing (ICASSP 2002) |
---|---|
Location | Renaissance Orlando Resort |
Place | United States |
City | Orlando |
Period | 13 - 17 May 2002 |
Link(s)
Abstract
This paper presents a procedure to improve the quality of narrowband (0-4khz) CELP coded speech. The procedure is based to refine the pitch periodicity and reinsert the high frequency components (4-8khz) in the narrowband CELP decoded speech. The narrowband CELP decoded speech is first analyzed with Harmonic+Noise analyzer and Lowband information are extracted. By exploiting the Lowband spectrum envelope and V/UV information, the highband (4-8khz) spectrum envelope is recovered statistically by using a voiced/unvoiced Gaussian mixture model with interpolation. Lowband information along with the estimated highband information is then fed to the Harmonic+Noise synthesizer to re-synthesize a wideband speech. The objective and subjective tests are performed to evaluate the quality of the re-synthesis wideband (0-8khz) speech. The results of the above experiments show that the re-synthesis wideband speech is pleasant to listen with crispy characteristics and preferred over the CELP coded speech.
Citation Format(s)
ENHANCING QUALITY OF CELP CODED SPEECH VIA WIDEBAND EXTENSION BY USING VOICING GMM INTERPOLATION AND IINM RE-SYNTHESIS. / Raza, Dar Ghulam ; Chan, Cheung-Fat.
2002 IEEE International Conference on Acoustics, Speech, and Signal Processing - PROCEEDINGS: Speech Processing Neural Networks for Signal Processing. Vol. 1 2002. p. 241-244.Research output: Chapters, Conference Papers, Creative and Literary Works (RGC: 12, 32, 41, 45) › 32_Refereed conference paper (with ISBN/ISSN) › peer-review