Language-specific Effects on Automatic Speech Recognition Errors for World Englishes

June Choe, Yiran Chen, May Pik Yu Chan, Aini Li, Xin Gao, Nicole Holliday

Research output: Chapters, Conference Papers, Creative and Literary WorksRGC 32 - Refereed conference paper (with host publication)peer-review

15 Downloads (CityUHK Scholars)

Abstract

Despite recent advancements in automated speech recognition (ASR) technologies, reports of unequal performance across speakers of different demographic groups abound. At the same time, the focus on performance metrics such as the Word Error Rate (WER) in prior studies limit the specificity and scope of recommendations that can be offered for system engineering to overcome these challenges. The current study bridges this gap by investigating the performance of Otter’s automatic captioning system on native and non-native English speakers of different language background through a linguistic analysis of segment-level errors. By examining language-specific error profiles for vowels and consonants motivated by linguistic theory, we find that certain categories of errors can be predicted from the phonological structure of a speaker’s native language. © 2022 Proceedings - International Conference on Computational Linguistics, COLING. All rights reserved.
Original languageEnglish
Title of host publicationProceedings of the 29th International Conference on Computational Linguistics
PublisherInternational Committee on Computational Linguistics
Pages7177-7186
Publication statusPublished - Oct 2022
Externally publishedYes
Event29th International Conference on Computational Linguistics, COLING 2022 - Gyeongju, Korea, Republic of
Duration: 12 Oct 202217 Oct 2022
https://aclanthology.org/2022.coling-1

Publication series

NameProceedings - International Conference on Computational Linguistics, COLING
PublisherAssociation for Computational Linguistics (ACL)
ISSN (Print)2951-2093

Conference

Conference29th International Conference on Computational Linguistics, COLING 2022
PlaceKorea, Republic of
CityGyeongju
Period12/10/2217/10/22
Internet address

Publisher's Copyright Statement

  • This full text is made available under CC-BY 4.0. https://creativecommons.org/licenses/by/4.0/

Fingerprint

Dive into the research topics of 'Language-specific Effects on Automatic Speech Recognition Errors for World Englishes'. Together they form a unique fingerprint.

Cite this