Does joint decoding really outperform cascade processing in English-to-Chinese transliteration generation? The role of syllabification

Yan Song, Chunyu Kit

    Research output: Chapters, Conference Papers, Creative and Literary WorksRGC 32 - Refereed conference paper (with host publication)peer-review

    2 Citations (Scopus)

    Abstract

    Transliteration is a challengeable task aimed at converting a proper name into another language with phonetic equivalence. Since the conversion relates to the phonetic aspect of a text, syllabification is considered a major factor affecting the performance of a transliteration system. In grapheme-based approaches, there are two routines to transliterate, one is to perform in a pipeline of separate syllabification and other components in generation process step by step, the other is to synchronously segment syllables and generating transliteration options. Usually, joint decoding outperforms the cascade processing in many natural language processing missions, however, syllabification is a special component in transliteration task. Thus in this paper, we investigate the two routines with a systematic analysis and compare their results to illustrate the strength of syllabification. A phrase-based statistical machine translation framework for joint decoding and a conditional random field syllabification system are used in this work for our investigation, which shows a different scenario on the issue of joint decoding versus cascade processing in transliteration. © 2010 IEEE.
    Original languageEnglish
    Title of host publication2010 International Conference on Machine Learning and Cybernetics, ICMLC 2010
    Pages3323-3328
    Volume6
    DOIs
    Publication statusPublished - 2010
    Event2010 International Conference on Machine Learning and Cybernetics, ICMLC 2010 - Qingdao, China
    Duration: 11 Jul 201014 Jul 2010

    Publication series

    Name
    Volume6

    Conference

    Conference2010 International Conference on Machine Learning and Cybernetics, ICMLC 2010
    Country/TerritoryChina
    CityQingdao
    Period11/07/1014/07/10

    Research Keywords

    • Joint decoding
    • Log-linear model
    • Statistical machine translation
    • Syllabification
    • Transliteration

    Fingerprint

    Dive into the research topics of 'Does joint decoding really outperform cascade processing in English-to-Chinese transliteration generation? The role of syllabification'. Together they form a unique fingerprint.

    Cite this