SCA-CLS : A New Semantic-Context-Aware Framework for Community-Oriented Lexical Simplification
Research output: Chapters, Conference Papers, Creative and Literary Works › RGC 32 - Refereed conference paper (with host publication) › peer-review
Author(s)
Related Research Unit(s)
Detail(s)
Original language | English |
---|---|
Title of host publication | Natural Language Processing and Chinese Computing - 12th National CCF Conference, NLPCC 2023, Proceedings, Part I |
Editors | Fei Liu, Nan Duan, Qingting Xu, Yu Hong |
Publisher | Springer, Cham |
Pages | 69-81 |
ISBN (electronic) | 9783031446931 |
ISBN (print) | 9783031446924 |
Publication status | Published - 2023 |
Publication series
Name | Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) |
---|---|
Volume | 14302 LNAI |
ISSN (Print) | 0302-9743 |
ISSN (electronic) | 1611-3349 |
Conference
Title | 12th National CCF Conference on Natural Language Processing and Chinese Computing (NLPCC 2023) |
---|---|
Place | China |
City | Foshan |
Period | 12 - 15 October 2023 |
Link(s)
Abstract
Community-oriented lexical simplification aims to transform complex words within a sentence into semantically consistent but simple substitute words from a community-specific vocabulary. Most state-of-the-art contextual word embedding models generate substitutes by extracting contextual information of complex words. Although these models take context into account, they fail to capture rich semantics of complex words with polysemy, resulting in many spurious and semantically non-equivalent candidates. Thus, this paper proposes a novel Semantic-Context-Aware framework for Community-oriented Lexical Simplification (SCA-CLS), which integrates gloss (sense definition) into BERT to identify the actual sense of the complex word (especially for polysemy) in current context and ranks substitutes by proposed gloss similarity. In addition, a new complexity feature is proposed to enhance substitute ranking. Experiment results on Wikipedia dataset show that SCA-CLS outperforms the state-of-the-art Merge-Sort model on both substitute generation and ranking tasks, indicating its effectiveness for community-oriented lexical simplification. © 2023, The Author(s), under exclusive license to Springer Nature Switzerland AG.
Research Area(s)
- BERT, Gloss, Lexical simplification, Ranking, Semantic
Citation Format(s)
SCA-CLS: A New Semantic-Context-Aware Framework for Community-Oriented Lexical Simplification. / Li, Rongying; Xie, Wenxiu; Lee, John et al.
Natural Language Processing and Chinese Computing - 12th National CCF Conference, NLPCC 2023, Proceedings, Part I. ed. / Fei Liu; Nan Duan; Qingting Xu; Yu Hong. Springer, Cham, 2023. p. 69-81 (Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics); Vol. 14302 LNAI).
Natural Language Processing and Chinese Computing - 12th National CCF Conference, NLPCC 2023, Proceedings, Part I. ed. / Fei Liu; Nan Duan; Qingting Xu; Yu Hong. Springer, Cham, 2023. p. 69-81 (Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics); Vol. 14302 LNAI).
Research output: Chapters, Conference Papers, Creative and Literary Works › RGC 32 - Refereed conference paper (with host publication) › peer-review