TY - GEN
T1 - Bandwidth extension of narrowband speech based on blind model adaptation
AU - Yao, Sheng
AU - Chan, Cheung-Fat
PY - 2007
Y1 - 2007
N2 - Traditional telephone transmission network has speech frequency upper-limit below 4 kHz. The narrowband telephone speech (0-4 kHz) sounds muffled as compared with the original wideband speech (0-8 kHz). Artificial bandwidth extension is an economical way of enhancing the quality of narrowband speech without modifying the infrastructure of the network. Existing bandwidth extension methods usually include off-line learning phase and on-line enhancing phase. The performance of these systems depends largely on the consistency of wideband training data and actual narrowband input data. In real situation, input speeches usually mismatch with off-line training speeches, leading to serious model errors. To avoid the data mismatch, we propose a method based on blind adaptation of linear dynamic model. The benefit of our method is the exclusion of off-line training phase and experiment results show that our systems is comparable with those data-oriented systems in the measurements of highband spectral distortion. When data mismatch occurs, our system outperforms those systems. © 2007 EURASIP.
AB - Traditional telephone transmission network has speech frequency upper-limit below 4 kHz. The narrowband telephone speech (0-4 kHz) sounds muffled as compared with the original wideband speech (0-8 kHz). Artificial bandwidth extension is an economical way of enhancing the quality of narrowband speech without modifying the infrastructure of the network. Existing bandwidth extension methods usually include off-line learning phase and on-line enhancing phase. The performance of these systems depends largely on the consistency of wideband training data and actual narrowband input data. In real situation, input speeches usually mismatch with off-line training speeches, leading to serious model errors. To avoid the data mismatch, we propose a method based on blind adaptation of linear dynamic model. The benefit of our method is the exclusion of off-line training phase and experiment results show that our systems is comparable with those data-oriented systems in the measurements of highband spectral distortion. When data mismatch occurs, our system outperforms those systems. © 2007 EURASIP.
UR - http://www.scopus.com/inward/record.url?scp=84863763242&partnerID=8YFLogxK
UR - https://www.scopus.com/record/pubmetrics.uri?eid=2-s2.0-84863763242&origin=recordpage
M3 - RGC 32 - Refereed conference paper (with host publication)
SN - 9788392134022
SP - 2350
EP - 2354
BT - European Signal Processing Conference
T2 - 15th European Signal Processing Conference, EUSIPCO 2007
Y2 - 3 September 2007 through 7 September 2007
ER -