TY - GEN
T1 - Does joint decoding really outperform cascade processing in English-to-Chinese transliteration generation? The role of syllabification
AU - Song, Yan
AU - Kit, Chunyu
PY - 2010
Y1 - 2010
N2 - Transliteration is a challengeable task aimed at converting a proper name into another language with phonetic equivalence. Since the conversion relates to the phonetic aspect of a text, syllabification is considered a major factor affecting the performance of a transliteration system. In grapheme-based approaches, there are two routines to transliterate, one is to perform in a pipeline of separate syllabification and other components in generation process step by step, the other is to synchronously segment syllables and generating transliteration options. Usually, joint decoding outperforms the cascade processing in many natural language processing missions, however, syllabification is a special component in transliteration task. Thus in this paper, we investigate the two routines with a systematic analysis and compare their results to illustrate the strength of syllabification. A phrase-based statistical machine translation framework for joint decoding and a conditional random field syllabification system are used in this work for our investigation, which shows a different scenario on the issue of joint decoding versus cascade processing in transliteration. © 2010 IEEE.
AB - Transliteration is a challengeable task aimed at converting a proper name into another language with phonetic equivalence. Since the conversion relates to the phonetic aspect of a text, syllabification is considered a major factor affecting the performance of a transliteration system. In grapheme-based approaches, there are two routines to transliterate, one is to perform in a pipeline of separate syllabification and other components in generation process step by step, the other is to synchronously segment syllables and generating transliteration options. Usually, joint decoding outperforms the cascade processing in many natural language processing missions, however, syllabification is a special component in transliteration task. Thus in this paper, we investigate the two routines with a systematic analysis and compare their results to illustrate the strength of syllabification. A phrase-based statistical machine translation framework for joint decoding and a conditional random field syllabification system are used in this work for our investigation, which shows a different scenario on the issue of joint decoding versus cascade processing in transliteration. © 2010 IEEE.
KW - Joint decoding
KW - Log-linear model
KW - Statistical machine translation
KW - Syllabification
KW - Transliteration
UR - http://www.scopus.com/inward/record.url?scp=78149299108&partnerID=8YFLogxK
UR - https://www.scopus.com/record/pubmetrics.uri?eid=2-s2.0-78149299108&origin=recordpage
U2 - 10.1109/ICMLC.2010.5580674
DO - 10.1109/ICMLC.2010.5580674
M3 - RGC 32 - Refereed conference paper (with host publication)
SN - 9781424465262
VL - 6
SP - 3323
EP - 3328
BT - 2010 International Conference on Machine Learning and Cybernetics, ICMLC 2010
T2 - 2010 International Conference on Machine Learning and Cybernetics, ICMLC 2010
Y2 - 11 July 2010 through 14 July 2010
ER -