L1-L2 Parallel Treebank of Learner Chinese : Overused and Underused Syntactic Structures

Research output: Chapters, Conference Papers, Creative and Literary Works (RGC: 12, 32, 41, 45)32_Refereed conference paper (with host publication)peer-review

View graph of relations

Detail(s)

Original languageEnglish
Title of host publicationLREC 2018, Eleventh International Conference on Language Resources and Evaluation Proceedings
EditorsNicoletta Calzolari, Khalid Choukri, Christopher Cieri, Thierry Declerck, Sara Goggi, Koiti Hasida, Hitoshi Isahara, Bente Maegaard, Joseph Mariani, Hélène Mazo, Asuncion Moreno, Jan Odijk, Stelios Piperidis, Takenobu Tokunaga
PublisherEuropean Language Resources Association (ELRA)
Pages4106-4110
ISBN (Print)9791095546009
Publication statusPublished - May 2018

Publication series

NameLREC 2018, Eleventh International Conference on Language Resources and Evaluation

Conference

TitleEleventh International Conference on Language Resources and Evaluation (LREC 2018)
LocationPhoenix Seagaia Conference Center
PlaceJapan
CityMiyazaki
Period7 - 12 May 2018

Abstract

We present a preliminary analysis on a corpus of texts written by learners of Chinese as a foreign language (CFL), annotated in the form of an L1-L2 parallel dependency treebank. The treebank consists of parse trees of sentences written by CFL learners (“L2 sentences”), parse trees of their target hypotheses (“L1 sentences”), and word alignment between the L1 sentences and L2 sentences. Currently, the treebank consists of 600 L2 sentences and 697 L1 sentences. We report the most overused and underused syntactic relations by the CFL learners, and discuss the underlying learner errors.

Research Area(s)

  • learner corpus, parallel treebank, Chinese as a foreign language

Citation Format(s)

L1-L2 Parallel Treebank of Learner Chinese: Overused and Underused Syntactic Structures. / Li, Keying; Lee, John.
LREC 2018, Eleventh International Conference on Language Resources and Evaluation Proceedings. ed. / Nicoletta Calzolari; Khalid Choukri; Christopher Cieri; Thierry Declerck; Sara Goggi; Koiti Hasida; Hitoshi Isahara; Bente Maegaard; Joseph Mariani; Hélène Mazo; Asuncion Moreno; Jan Odijk; Stelios Piperidis; Takenobu Tokunaga. European Language Resources Association (ELRA), 2018. p. 4106-4110 (LREC 2018, Eleventh International Conference on Language Resources and Evaluation).

Research output: Chapters, Conference Papers, Creative and Literary Works (RGC: 12, 32, 41, 45)32_Refereed conference paper (with host publication)peer-review