L1-L2 Parallel Treebank of Learner Chinese : Overused and Underused Syntactic Structures
Research output: Chapters, Conference Papers, Creative and Literary Works (RGC: 12, 32, 41, 45) › 32_Refereed conference paper (with host publication) › peer-review
Author(s)
Related Research Unit(s)
Detail(s)
Original language | English |
---|---|
Title of host publication | LREC 2018, Eleventh International Conference on Language Resources and Evaluation Proceedings |
Editors | Nicoletta Calzolari, Khalid Choukri, Christopher Cieri, Thierry Declerck, Sara Goggi, Koiti Hasida, Hitoshi Isahara, Bente Maegaard, Joseph Mariani, Hélène Mazo, Asuncion Moreno, Jan Odijk, Stelios Piperidis, Takenobu Tokunaga |
Publisher | European Language Resources Association (ELRA) |
Pages | 4106-4110 |
ISBN (Print) | 9791095546009 |
Publication status | Published - May 2018 |
Publication series
Name | LREC 2018, Eleventh International Conference on Language Resources and Evaluation |
---|
Conference
Title | Eleventh International Conference on Language Resources and Evaluation (LREC 2018) |
---|---|
Location | Phoenix Seagaia Conference Center |
Place | Japan |
City | Miyazaki |
Period | 7 - 12 May 2018 |
Link(s)
Abstract
We present a preliminary analysis on a corpus of texts written by learners of Chinese as a foreign language (CFL), annotated in the form of an L1-L2 parallel dependency treebank. The treebank consists of parse trees of sentences written by CFL learners (“L2 sentences”), parse trees of their target hypotheses (“L1 sentences”), and word alignment between the L1 sentences and L2 sentences. Currently, the treebank consists of 600 L2 sentences and 697 L1 sentences. We report the most overused and underused syntactic relations by the CFL learners, and discuss the underlying learner errors.
Research Area(s)
- learner corpus, parallel treebank, Chinese as a foreign language
Citation Format(s)
L1-L2 Parallel Treebank of Learner Chinese: Overused and Underused Syntactic Structures. / Li, Keying; Lee, John.
LREC 2018, Eleventh International Conference on Language Resources and Evaluation Proceedings. ed. / Nicoletta Calzolari; Khalid Choukri; Christopher Cieri; Thierry Declerck; Sara Goggi; Koiti Hasida; Hitoshi Isahara; Bente Maegaard; Joseph Mariani; Hélène Mazo; Asuncion Moreno; Jan Odijk; Stelios Piperidis; Takenobu Tokunaga. European Language Resources Association (ELRA), 2018. p. 4106-4110 (LREC 2018, Eleventh International Conference on Language Resources and Evaluation).
LREC 2018, Eleventh International Conference on Language Resources and Evaluation Proceedings. ed. / Nicoletta Calzolari; Khalid Choukri; Christopher Cieri; Thierry Declerck; Sara Goggi; Koiti Hasida; Hitoshi Isahara; Bente Maegaard; Joseph Mariani; Hélène Mazo; Asuncion Moreno; Jan Odijk; Stelios Piperidis; Takenobu Tokunaga. European Language Resources Association (ELRA), 2018. p. 4106-4110 (LREC 2018, Eleventh International Conference on Language Resources and Evaluation).
Research output: Chapters, Conference Papers, Creative and Literary Works (RGC: 12, 32, 41, 45) › 32_Refereed conference paper (with host publication) › peer-review