Abstract
This paper describes an approach for translation between Chinese dialects that can produce target sentences at different registers. We focus on Mandarin as the source language, and Cantonese as the target. Mutually unintelligible, these two varieties of Chinese exhibit differences at both the lexical and syntactic levels, and the extent of the difference can vary considerably depending on the register of Cantonese. Since only a modest amount of parallel data is available, we adopt a knowledge-based approach and exploit lexical mappings and syntactic transformations from linguistics research. Our system parses a source sentence, uses register-annotated lexical mappings to translate words, and then performs word reordering through syntactic transformations. Evaluation shows that translation models that match the required register of the target sentences yield better translation quality.
| Original language | English |
|---|---|
| Title of host publication | Proceedings of the 13th Conference of the Association for Machine Translation in the Americas 2018 |
| Editors | Colin Cherry, Graham Neubig |
| Publisher | Association for Machine Translation in the Americas |
| Pages | 89-96 |
| Volume | 1 |
| ISBN (Print) | 9781510867260 |
| Publication status | Published - Mar 2018 |
| Event | 13th Conference of the Association for Machine Translation in the Americas (AMTA 2018) - Aloft Boston Seaport, Boston, United States Duration: 17 Mar 2018 → 21 Mar 2018 http://www.conference.amtaweb.org/ |
Publication series
| Name | AMTA - Conference of the Association for Machine Translation in the Americas, Proceedings |
|---|
Meeting
| Meeting | 13th Conference of the Association for Machine Translation in the Americas (AMTA 2018) |
|---|---|
| Abbreviated title | AMTA 2018 |
| Place | United States |
| City | Boston |
| Period | 17/03/18 → 21/03/18 |
| Internet address |
Research Keywords
- Machine translation
Fingerprint
Dive into the research topics of 'Register-sensitive Translation: A Case Study of Mandarin and Cantonese'. Together they form a unique fingerprint.Cite this
- APA
- Author
- BIBTEX
- Harvard
- Standard
- RIS
- Vancouver