Detection of non-native sentences using machine-translated training data
Research output: Chapters, Conference Papers, Creative and Literary Works › RGC 32 - Refereed conference paper (with host publication) › peer-review
Author(s)
Detail(s)
Original language | English |
---|---|
Title of host publication | NAACL-HLT 2007 - Human Language Technology Conference of the North American Chapter of the Association of Computational Linguistics, Companion Volume: Short Papers |
Publisher | Association for Computational Linguistics (ACL) |
Pages | 93-96 |
Publication status | Published - 2007 |
Externally published | Yes |
Publication series
Name | NAACL-HLT 2007 - Human Language Technology Conference of the North American Chapter of the Association of Computational Linguistics, Companion Volume: Short Papers |
---|
Conference
Title | 2007 Human Language Technology Conference of the North American Chapter of the Association of Computational Linguistics, NAACL-HLT 2007 |
---|---|
Place | United States |
City | Rochester |
Period | 22 - 27 April 2007 |
Link(s)
Abstract
Training statistical models to detect nonnative sentences requires a large corpus of non-native writing samples, which is often not readily available. This paper examines the extent to which machinetranslated (MT) sentences can substitute as training data. Two tasks are examined. For the native vs non-native classification task, nonnative training data yields better performance; for the ranking task, however, models trained with a large, publicly available set of MT data perform as well as those trained with non-native data.
Bibliographic Note
Publication details (e.g. title, author(s), publication statuses and dates) are captured on an “AS IS” and “AS AVAILABLE” basis at the time of record harvesting from the data source. Suggestions for further amendments or supplementary information can be sent to [email protected].
Citation Format(s)
Detection of non-native sentences using machine-translated training data. / Lee, John; Zhou, Ming; Liu, Xiaohua.
NAACL-HLT 2007 - Human Language Technology Conference of the North American Chapter of the Association of Computational Linguistics, Companion Volume: Short Papers. Association for Computational Linguistics (ACL), 2007. p. 93-96 (NAACL-HLT 2007 - Human Language Technology Conference of the North American Chapter of the Association of Computational Linguistics, Companion Volume: Short Papers).
NAACL-HLT 2007 - Human Language Technology Conference of the North American Chapter of the Association of Computational Linguistics, Companion Volume: Short Papers. Association for Computational Linguistics (ACL), 2007. p. 93-96 (NAACL-HLT 2007 - Human Language Technology Conference of the North American Chapter of the Association of Computational Linguistics, Companion Volume: Short Papers).
Research output: Chapters, Conference Papers, Creative and Literary Works › RGC 32 - Refereed conference paper (with host publication) › peer-review