Usability and Practicality of Speech Recording by Mobile Phones for Phonetic Analysis

Research output: Chapters, Conference Papers, Creative and Literary Works (RGC: 12, 32, 41, 45)32_Refereed conference paper (with ISBN/ISSN)peer-review

1 Scopus Citations
View graph of relations

Author(s)

Detail(s)

Original languageEnglish
Title of host publication2021 12th International Symposium on Chinese Spoken Language Processing (ISCSLP)
PublisherIEEE
ISBN (Electronic)9781728169941
ISBN (Print)9781728169958
Publication statusPublished - Jan 2021

Publication series

NameInternational Symposium on Chinese Spoken Language Processing, ISCSLP

Conference

Title12th International Symposium on Chinese Spoken Language Processing (ISCSLP 2021)
LocationVirtual
PlaceChina
CityHong Kong
Period24 - 26 January 2021

Abstract

High-quality speech recording is critical to phonetic analysis. However, when professional equipment or a sound-proof booth is not accessible, such as in random sampling or during the current pandemic period, is it reliable and valid to use non-professional devices to record speech data? We selected ten devices and examined the frequency range and signal-to-noise ratio (SNR) of speech data they recorded. We also compared recordings in a quiet room with noise at a moderate level. The results showed that all devices recorded a wide frequency range, which covered speech frequency well. But, their SNRs differed significantly. Environmental noise also appeared to affect recording quality. We then analyzed fine-grained phonetic parameters of data recorded in the quiet room, including suprasegmental, segmental and phonation-related parameters. F0 was found relatively consistent in the recordings from all devices, but certain differences were captured in F1, F2 and Center of Gravity (CoG). F3 as well as parameters relevant to phonation analysis, on the other hand, showed high variations. Therefore, our findings suggest that non-professional devices such as mobile phones are reliable substitutes of professional recorders, at least in prosodic analysis for general purposes. Cautions should be taken when values of F3 and phonation-related parameters are involved.

Research Area(s)

  • CoG, F0, mobile phone, SNR, vowel formants

Citation Format(s)

Usability and Practicality of Speech Recording by Mobile Phones for Phonetic Analysis. / Guan, Yihan; Li, Bin.

2021 12th International Symposium on Chinese Spoken Language Processing (ISCSLP). IEEE, 2021. 9362082 (International Symposium on Chinese Spoken Language Processing, ISCSLP).

Research output: Chapters, Conference Papers, Creative and Literary Works (RGC: 12, 32, 41, 45)32_Refereed conference paper (with ISBN/ISSN)peer-review