Character Set Construction for Chinese Language Learning

Research output: Chapters, Conference Papers, Creative and Literary WorksRGC 32 - Refereed conference paper (with host publication)peer-review

1 Citation (Scopus)
85 Downloads (CityUHK Scholars)

Abstract

To promote efficient learning of Chinese characters, pedagogical materials may present not only a single character, but a set of characters that are related in meaning and in written form. This paper investigates automatic construction of these character sets. The proposed model represents a character as averaged word vectors of common words containing the character. It then identifies sets of characters with high semantic similarity through clustering. Human evaluation shows that this representation outperforms direct use of character embeddings, and that the resulting character sets capture distinct semantic ranges.
Original languageEnglish
Title of host publicationProceedings of the 16th Workshop on Innovative Use of NLP for Building Educational Applications
PublisherAssociation for Computational Linguistics
Pages59–63
ISBN (Electronic)978-1-954085-11-4
ISBN (Print)9781954085114
Publication statusPublished - 20 Apr 2021
Event16th Workshop on Innovative Use of NLP for Building Educational Applications (BEA 2021) - Virtual
Duration: 20 Apr 2021 → …
https://sig-edu.org/bea/current

Conference

Conference16th Workshop on Innovative Use of NLP for Building Educational Applications (BEA 2021)
Abbreviated titleBEA2021
Period20/04/21 → …
Internet address

Publisher's Copyright Statement

  • This full text is made available under CC-BY 4.0. https://creativecommons.org/licenses/by/4.0/

Fingerprint

Dive into the research topics of 'Character Set Construction for Chinese Language Learning'. Together they form a unique fingerprint.

Cite this