Skip to main navigation Skip to search Skip to main content

Register-sensitive Translation: A Case Study of Mandarin and Cantonese

Research output: Chapters, Conference Papers, Creative and Literary WorksRGC 32 - Refereed conference paper (with host publication)peer-review

Abstract

This paper describes an approach for translation between Chinese dialects that can produce target sentences at different registers. We focus on Mandarin as the source language, and Cantonese as the target. Mutually unintelligible, these two varieties of Chinese exhibit differences at both the lexical and syntactic levels, and the extent of the difference can vary considerably depending on the register of Cantonese. Since only a modest amount of parallel data is available, we adopt a knowledge-based approach and exploit lexical mappings and syntactic transformations from linguistics research. Our system parses a source sentence, uses register-annotated lexical mappings to translate words, and then performs word reordering through syntactic transformations. Evaluation shows that translation models that match the required register of the target sentences yield better translation quality.
Original languageEnglish
Title of host publicationProceedings of the 13th Conference of the Association for Machine Translation in the Americas 2018
EditorsColin Cherry, Graham Neubig
PublisherAssociation for Machine Translation in the Americas
Pages89-96
Volume1
ISBN (Print)9781510867260
Publication statusPublished - Mar 2018
Event13th Conference of the Association for Machine Translation in the Americas (AMTA 2018) - Aloft Boston Seaport, Boston, United States
Duration: 17 Mar 201821 Mar 2018
http://www.conference.amtaweb.org/

Publication series

NameAMTA - Conference of the Association for Machine Translation in the Americas, Proceedings

Meeting

Meeting13th Conference of the Association for Machine Translation in the Americas (AMTA 2018)
Abbreviated titleAMTA 2018
PlaceUnited States
CityBoston
Period17/03/1821/03/18
Internet address

Research Keywords

  • Machine translation

Fingerprint

Dive into the research topics of 'Register-sensitive Translation: A Case Study of Mandarin and Cantonese'. Together they form a unique fingerprint.

Cite this