PROTOTYPICAL NETWORKS FOR SMALL FOOTPRINT TEXT-INDEPENDENT SPEAKER VERIFICATION

Research output: Chapters, Conference Papers, Creative and Literary Works (RGC: 12, 32, 41, 45)32_Refereed conference paper (with ISBN/ISSN)peer-review

10 Scopus Citations
View graph of relations

Author(s)

Related Research Unit(s)

Detail(s)

Original languageEnglish
Title of host publication2020 IEEE International Conference on Acoustics, Speech, and Signal Processing
Subtitle of host publicationProceedings
PublisherIEEE
Pages6804-6808
Volume2020-May
ISBN (Electronic)978-1-5090-6631-5
ISBN (Print)978-1-5090-6632-2
Publication statusPublished - May 2020

Publication series

NameInternational Conference on Acoustics, Speech, and Signal Processing (ICASSP)
PublisherIEEE
ISSN (Print)1520-6149
ISSN (Electronic)2379-190X

Conference

Title45th International Conference on Acoustics, Speech, and Signal Processing, ICASSP 2020
Locationvirtual
City
Period4 - 8 May 2020

Abstract

Speaker verification aims to recognize target speakers with very few enrollment utterances. Conventional approaches learn a representation model to extract the speaker embeddings for verification. Recently, there are several new approaches in meta-learning which try to learn a shared metric space. Among these approaches, prototypical networks aim at learning a non-linear mapping from the input space to an embedding space with a predefined distance metric. In this paper, we investigate the use of prototypical networks in a small footprint text-independent speaker verification task. Our work is evaluated on SRE10 evaluation set. Experiments show that prototypical networks outperform the conventional method when the amount of data per training speaker is limited.

Research Area(s)

  • Meta learning, Prototypical networks, Speaker verification

Citation Format(s)

PROTOTYPICAL NETWORKS FOR SMALL FOOTPRINT TEXT-INDEPENDENT SPEAKER VERIFICATION. / Ko, Tom; Chen, Yangbin; Li, Qing.

2020 IEEE International Conference on Acoustics, Speech, and Signal Processing: Proceedings. Vol. 2020-May IEEE, 2020. p. 6804-6808 9054471 (International Conference on Acoustics, Speech, and Signal Processing (ICASSP)).

Research output: Chapters, Conference Papers, Creative and Literary Works (RGC: 12, 32, 41, 45)32_Refereed conference paper (with ISBN/ISSN)peer-review