TY - JOUR
T1 - Universal Dependencies for Mandarin Chinese
AU - Poiret, Rafaël
AU - Wong, Tak-Sum
AU - Lee, John
AU - Gerdes, Kim
AU - Leung, Herman
PY - 2023/6
Y1 - 2023/6
N2 - This article presents a Universal Dependency (UD) annotation scheme for Mandarin Chinese, as well as the current UD Chinese HK treebank. Our focus is mainly on parts-of-speech tags and syntactic relations, with a quite large array of phenomena investigated. The main goal is to make transparent the linguistic consideration behind our annotation choices, and show how we articulated these choices with the criteria of Universal Dependencies. This scheme has been developed with reference to two other dependency schemes for this language, i.e. the Chinese Stanford Dependencies (Chang et al., 2009) and the Chinese Dependency Treebank (HIT-SCIR, 2010). We provide mappings between our scheme and the two others. The content of the UD Chinese HK treebank is discussed in relation to the other UD treebanks for Chinese, and the inter-annotator agreement on POS and dependency annotation is reported. Our proposed scheme is motivated by reasoned linguistic analysis, is suitable for cross-linguistic comparison, and produced a high level of agreement between annotators.© The Author(s), under exclusive licence to Springer Nature B.V. 2021, corrected publication 2022
AB - This article presents a Universal Dependency (UD) annotation scheme for Mandarin Chinese, as well as the current UD Chinese HK treebank. Our focus is mainly on parts-of-speech tags and syntactic relations, with a quite large array of phenomena investigated. The main goal is to make transparent the linguistic consideration behind our annotation choices, and show how we articulated these choices with the criteria of Universal Dependencies. This scheme has been developed with reference to two other dependency schemes for this language, i.e. the Chinese Stanford Dependencies (Chang et al., 2009) and the Chinese Dependency Treebank (HIT-SCIR, 2010). We provide mappings between our scheme and the two others. The content of the UD Chinese HK treebank is discussed in relation to the other UD treebanks for Chinese, and the inter-annotator agreement on POS and dependency annotation is reported. Our proposed scheme is motivated by reasoned linguistic analysis, is suitable for cross-linguistic comparison, and produced a high level of agreement between annotators.© The Author(s), under exclusive licence to Springer Nature B.V. 2021, corrected publication 2022
KW - Chinese
KW - Universal dependencies
KW - Treebank
KW - Annotation scheme
UR - http://www.scopus.com/inward/record.url?scp=85120608857&partnerID=8YFLogxK
UR - https://www.scopus.com/record/pubmetrics.uri?eid=2-s2.0-85120608857&origin=recordpage
U2 - 10.1007/s10579-021-09564-2
DO - 10.1007/s10579-021-09564-2
M3 - RGC 21 - Publication in refereed journal
SN - 1574-020X
VL - 57
SP - 673
EP - 710
JO - Language Resources and Evaluation
JF - Language Resources and Evaluation
IS - 2
ER -