Part of speech tagging of grammatical features related to L2 Chinese development: A case analysis of Stanza in the L2 writing context
Research output: Journal Publications and Reviews › RGC 21 - Publication in refereed journal › peer-review
Author(s)
Related Research Unit(s)
Detail(s)
Original language | English |
---|---|
Article number | 1139703 |
Journal / Publication | Frontiers in Psychology |
Volume | 14 |
Online published | 15 Feb 2023 |
Publication status | Published - 2023 |
Link(s)
DOI | DOI |
---|---|
Attachment(s) | Documents
Publisher's Copyright Statement
|
Link to Scopus | https://www.scopus.com/record/display.uri?eid=2-s2.0-85149491308&origin=recordpage |
Permanent Link | https://scholars.cityu.edu.hk/en/publications/publication(845ad1d4-40d8-424c-b828-49fcc3584091).html |
Abstract
Grammatical complexity has received extensive attention in second language acquisition. Although computational tools have been developed to analyze grammatical complexity, most relevant studies investigated this construct in the context of English as a second language. In response to an increasing number of L2 Chinese learners, it is important to extend the investigation of grammatical complexity in L2 Chinese. To promote relevant research, we evaluated the new computational tool, Stanza, on its accuracy of part-of-speech tagging for L2 Chinese writing. We particularly focused on eight grammatical features closely related to L2Chinese development. Then, we reported the precisions, recalls, and F-scores for the individual grammatical features and offered a qualitative analysis of systematic tagging errors. In terms of the precision, three features have high rates, over 90% (i.e.,ba and bei markers, classifiers, -de as noun modifier marker). For recall, four features have high rates, over 90% (i.e., aspect markers, ba and bei markers, classifiers, -de as noun modifier marker). Overall, based on the F-scores, Stanza has a good tagging performance on ba and bei markers, classifiers, and -de as a noun modifier marker. This evaluation provides research implications for scholars who plan to use this computational tool to study L2 Chinese development in second language acquisition or applied linguistics in general.
Research Area(s)
- part of speech tagging, SLA, corpus linguistics, language development, grammatical features, Chinese as a second language
Citation Format(s)
Part of speech tagging of grammatical features related to L2 Chinese development: A case analysis of Stanza in the L2 writing context. / Lan, Ge; Pan, Xiaofei; Sun, Yachao et al.
In: Frontiers in Psychology, Vol. 14, 1139703, 2023.
In: Frontiers in Psychology, Vol. 14, 1139703, 2023.
Research output: Journal Publications and Reviews › RGC 21 - Publication in refereed journal › peer-review
Download Statistics
No data available