Maximum model distance discriminative training for text-independent speaker verification

Research output: Chapters, Conference Papers, Creative and Literary Works (RGC: 12, 32, 41, 45)32_Refereed conference paper (with host publication)peer-review

1 Scopus Citations
View graph of relations

Author(s)

Related Research Unit(s)

Detail(s)

Original languageEnglish
Title of host publicationIECON Proceedings (Industrial Electronics Conference)
Pages1769-1774
Volume2
Publication statusPublished - 2004

Publication series

Name
Volume2

Conference

TitleIECON 2004 - 30th Annual Conference of IEEE Industrial Electronics Society
PlaceKorea, Republic of
CityBusan
Period2 - 6 November 2004

Abstract

This paper presents the design and implementation of text-independent speaker verification. We apply the maximum model distance (MMD) algorithm to the Gaussian mixture model (GMM) training. The traditional maximum likelihood (ML) method only utilizes the labeled utterances for each speaker model, which probably leads to a local optimization solution. By maximizing the model distance between the target and competing speakers, MMD could add the discriminative capability into the training procedure and then improve the verification performance. Based on the TIMIT corpus, we designed the verification experiments and the results show that the equal error rate (EER) could be reduced greatly compared with the traditional ML method. © 2004 IEEE.

Citation Format(s)

Maximum model distance discriminative training for text-independent speaker verification. / Hong, Q. Y.; Kwong, S.
IECON Proceedings (Industrial Electronics Conference). Vol. 2 2004. p. 1769-1774 TD6-1.

Research output: Chapters, Conference Papers, Creative and Literary Works (RGC: 12, 32, 41, 45)32_Refereed conference paper (with host publication)peer-review