Abstract
Despite recent advancements in automated speech recognition (ASR) technologies, reports of unequal performance across speakers of different demographic groups abound. At the same time, the focus on performance metrics such as the Word Error Rate (WER) in prior studies limit the specificity and scope of recommendations that can be offered for system engineering to overcome these challenges. The current study bridges this gap by investigating the performance of Otter’s automatic captioning system on native and non-native English speakers of different language background through a linguistic analysis of segment-level errors. By examining language-specific error profiles for vowels and consonants motivated by linguistic theory, we find that certain categories of errors can be predicted from the phonological structure of a speaker’s native language. © 2022 Proceedings - International Conference on Computational Linguistics, COLING. All rights reserved.
| Original language | English |
|---|---|
| Title of host publication | Proceedings of the 29th International Conference on Computational Linguistics |
| Publisher | International Committee on Computational Linguistics |
| Pages | 7177-7186 |
| Publication status | Published - Oct 2022 |
| Externally published | Yes |
| Event | 29th International Conference on Computational Linguistics, COLING 2022 - Gyeongju, Korea, Republic of Duration: 12 Oct 2022 → 17 Oct 2022 https://aclanthology.org/2022.coling-1 |
Publication series
| Name | Proceedings - International Conference on Computational Linguistics, COLING |
|---|---|
| Publisher | Association for Computational Linguistics (ACL) |
| ISSN (Print) | 2951-2093 |
Conference
| Conference | 29th International Conference on Computational Linguistics, COLING 2022 |
|---|---|
| Place | Korea, Republic of |
| City | Gyeongju |
| Period | 12/10/22 → 17/10/22 |
| Internet address |
Publisher's Copyright Statement
- This full text is made available under CC-BY 4.0. https://creativecommons.org/licenses/by/4.0/
Fingerprint
Dive into the research topics of 'Language-specific Effects on Automatic Speech Recognition Errors for World Englishes'. Together they form a unique fingerprint.Cite this
- APA
- Author
- BIBTEX
- Harvard
- Standard
- RIS
- Vancouver