On the analysis and evaluation of prosody conversion techniques

Research output: Chapters, Conference Papers, Creative and Literary Works (RGC: 12, 32, 41, 45)32_Refereed conference paper (with ISBN/ISSN)peer-review

5 Scopus Citations
View graph of relations

Author(s)

Related Research Unit(s)

Detail(s)

Original languageEnglish
Title of host publicationProceedings of the 2017 International Conference on Asian Language Processing (IALP)
EditorsRong Tong, Yue Zhang, Yanfeng Lu, Minghui Dong
PublisherIEEE
Pages44-47
ISBN (Electronic)9781538619810
ISBN (Print)9781538619827
Publication statusPublished - Dec 2017

Publication series

NameInternational Conference on Asian Language Processing
ISSN (Print)2159-1962
ISSN (Electronic)2159-1970

Conference

Title21st International Conference on Asian Language Processing (IALP 2017)
LocationNational University of Singapore
PlaceSingapore
CitySingapore
Period5 - 7 December 2017

Abstract

Voice conversion is a process of modifying the characteristics of source speaker such as spectrum or/and prosody, to sound as if it was spoken by another speaker. In this paper, we study the evaluation of prosody transformation, in particular, the evaluation of Fundamental Frequency (F0) conversion. F0 is an essential prosody feature that should be taken care of in a compressive voice conversion framework. So far, the evaluation of the converted prosody features is performed mainly by looking at Pearson Correlation Coefficient and Root Mean Square Error (RMSE). Unfortunately, these techniques do not explicitly measure the F0 alignment between the source and target signals. We believe that an evaluation measure that takes into account the time alignment of F0 is needed to provide a new perspective. Therefore, in this paper, we study a new technique to assess the accuracy of prosody transformation. In our experiments with different prosody transformation techniques, we report that the proposed evaluation approach achieves consistent results with the baseline evaluation metrics.

Research Area(s)

  • Prosody evaluation, prosody transformation, voice conversion, VOICE CONVERSION, TIME

Citation Format(s)

On the analysis and evaluation of prosody conversion techniques. / Sisman, Berrak; Lee, Grandee; Li, Haizhou; Tan, Kay Chen.

Proceedings of the 2017 International Conference on Asian Language Processing (IALP). ed. / Rong Tong; Yue Zhang; Yanfeng Lu; Minghui Dong. IEEE, 2017. p. 44-47 (International Conference on Asian Language Processing).

Research output: Chapters, Conference Papers, Creative and Literary Works (RGC: 12, 32, 41, 45)32_Refereed conference paper (with ISBN/ISSN)peer-review