Abstract
In this paper, we discuss techniques to combine an interlingua translation framework with phrase-based statistical methods, for translation from Chinese into English. Our goal is to achieve high-quality translation, suitable for use in language tutoring applications. We explore these ideas in the context of a flight domain, for which we have a large corpus of English queries, obtained from users interacting with a dialogue system. Our techniques exploit a pre-existing English-to-Chinese translation system to automatically produce a synthetic bilingual corpus. Several experiments were conducted combining linguistic and statistical methods, and manual evaluation was conducted for a set of 460 Chinese sentences. The best performance achieved an "adequate" or better analysis (3 or above rating) on nearly 94% of the 409 parsable subset. Using a Rover scheme to combine four systems resulted in an "adequate or better" rating for 88% of all the utterances. © 2006 The Association for Machine Translation in the Americas.
Original language | English |
---|---|
Title of host publication | AMTA 2006 - Proceedings of the 7th Conference of the Association for Machine Translation of the Americas: Visions for the Future of Machine Translation |
Pages | 213-222 |
Publication status | Published - 2006 |
Externally published | Yes |
Event | 7th Biennial Conference of the Association for Machine Translation in the Americas, AMTA 2006 - Cambridge, MA, United States Duration: 8 Aug 2006 → 12 Aug 2006 |
Conference
Conference | 7th Biennial Conference of the Association for Machine Translation in the Americas, AMTA 2006 |
---|---|
Country/Territory | United States |
City | Cambridge, MA |
Period | 8/08/06 → 12/08/06 |