A corpus-based analysis of mixed code in Hong Kong speech

Research output: Chapters, Conference Papers, Creative and Literary Works (RGC: 12, 32, 41, 45)32_Refereed conference paper (with host publication)peer-review

4 Scopus Citations
View graph of relations

Author(s)

Detail(s)

Original languageEnglish
Title of host publicationProceedings - 2012 International Conference on Asian Language Processing, IALP 2012
PublisherIEEE Computer Society
Pages165-168
Publication statusPublished - 2012

Conference

Title2012 International Conference on Asian Language Processing, IALP 2012
PlaceViet Nam
CityHanoi
Period13 - 15 November 2012

Abstract

We present a corpus-based analysis of the use of mixed code in Hong Kong speech. From transcriptions of Cantonese television programs, we identify English words embedded within Cantonese utterances, and investigate the motivations for such code-switching. Among the many motivations observed in previous research, we found that four alone account for more than 95% of the use of English words in our speech data across genres, genders, and age groups. We performed analyses over more than 60 hours of transcribed speech, resulting in one of the largest empirical studies to-date on this linguistic phenomenon. © 2012 IEEE.

Research Area(s)

  • Cantonese, Code-mixing, Code-switching, Corpus linguistics, English

Citation Format(s)

A corpus-based analysis of mixed code in Hong Kong speech. / Lee, John.
Proceedings - 2012 International Conference on Asian Language Processing, IALP 2012. IEEE Computer Society, 2012. p. 165-168 6473722.

Research output: Chapters, Conference Papers, Creative and Literary Works (RGC: 12, 32, 41, 45)32_Refereed conference paper (with host publication)peer-review