Register-sensitive Translation : A Case Study of Mandarin and Cantonese

Research output: Chapters, Conference Papers, Creative and Literary Works (RGC: 12, 32, 41, 45)32_Refereed conference paper (with host publication)peer-review

4 Scopus Citations
View graph of relations

Detail(s)

Original languageEnglish
Title of host publicationProceedings of the 13th Conference of the Association for Machine Translation in the Americas 2018
EditorsColin Cherry, Graham Neubig
PublisherAssociation for Machine Translation in the Americas
Pages89-96
Volume1
ISBN (Print)9781510867260
Publication statusPublished - Mar 2018

Publication series

NameAMTA - Conference of the Association for Machine Translation in the Americas, Proceedings

Meeting

Title13th Conference of the Association for Machine Translation in the Americas (AMTA 2018)
LocationAloft Boston Seaport
PlaceUnited States
CityBoston
Period17 - 21 March 2018

Abstract

This paper describes an approach for translation between Chinese dialects that can produce target sentences at different registers. We focus on Mandarin as the source language, and Cantonese as the target. Mutually unintelligible, these two varieties of Chinese exhibit differences at both the lexical and syntactic levels, and the extent of the difference can vary considerably depending on the register of Cantonese. Since only a modest amount of parallel data is available, we adopt a knowledge-based approach and exploit lexical mappings and syntactic transformations from linguistics research. Our system parses a source sentence, uses register-annotated lexical mappings to translate words, and then performs word reordering through syntactic transformations. Evaluation shows that translation models that match the required register of the target sentences yield better translation quality.

Research Area(s)

  • Machine translation

Citation Format(s)

Register-sensitive Translation: A Case Study of Mandarin and Cantonese. / Wong, Tak-sum; Lee, John.
Proceedings of the 13th Conference of the Association for Machine Translation in the Americas 2018. ed. / Colin Cherry; Graham Neubig. Vol. 1 Association for Machine Translation in the Americas, 2018. p. 89-96 (AMTA - Conference of the Association for Machine Translation in the Americas, Proceedings).

Research output: Chapters, Conference Papers, Creative and Literary Works (RGC: 12, 32, 41, 45)32_Refereed conference paper (with host publication)peer-review