TY - GEN
T1 - ENHANCING QUALITY OF CELP CODED SPEECH VIA WIDEBAND EXTENSION BY USING VOICING GMM INTERPOLATION AND HNM RE-SYNTHESIS
AU - Raza, Dar Ghulam
AU - Chan, Cheung-Fat
PY - 2002/9
Y1 - 2002/9
N2 - This paper presents a procedure to improve the quality of narrowband (0-4khz) CELP coded speech. The procedure is based to refine the pitch periodicity and reinsert the high frequency components (4-8khz) in the narrowband CELP decoded speech. The narrowband CELP decoded speech is first analyzed with Harmonic+Noise analyzer and Lowband information are extracted. By exploiting the Lowband spectrum envelope and V/UV information, the highband (4-8khz) spectrum envelope is recovered statistically by using a voiced/unvoiced gaussian mixture model with interpolation. Lowband information along with the estimated highband information is then fed to the Harmonic+Noise synthesizer to re-synthesize a wideband speech. The objective and subjective tests are performed to evaluate the quality of the re-synthesis wideband (0-8khz) speech. The results of the above experiments show that the re-synthesis wideband speech is pleasant to listen with crispy characteristics and preferred over the CELP coded speech.
AB - This paper presents a procedure to improve the quality of narrowband (0-4khz) CELP coded speech. The procedure is based to refine the pitch periodicity and reinsert the high frequency components (4-8khz) in the narrowband CELP decoded speech. The narrowband CELP decoded speech is first analyzed with Harmonic+Noise analyzer and Lowband information are extracted. By exploiting the Lowband spectrum envelope and V/UV information, the highband (4-8khz) spectrum envelope is recovered statistically by using a voiced/unvoiced gaussian mixture model with interpolation. Lowband information along with the estimated highband information is then fed to the Harmonic+Noise synthesizer to re-synthesize a wideband speech. The objective and subjective tests are performed to evaluate the quality of the re-synthesis wideband (0-8khz) speech. The results of the above experiments show that the re-synthesis wideband speech is pleasant to listen with crispy characteristics and preferred over the CELP coded speech.
UR - http://www.scopus.com/inward/record.url?scp=84960900404&partnerID=8YFLogxK
UR - https://www.scopus.com/record/pubmetrics.uri?eid=2-s2.0-84960900404&origin=recordpage
M3 - RGC 32 - Refereed conference paper (with host publication)
BT - 2002 11th European Signal Processing Conference
T2 - 11th European Signal Processing Conference (Eusipco 2002)
Y2 - 3 September 2002 through 6 September 2002
ER -