Projects per year
Abstract
The importance of multiword expressions (MWEs) for language learning is well established. While MWE research has been evaluated on various downstream tasks such as syntactic parsing and machine translation, its applications in computer-assisted language learning has been less explored. This paper investigates the selection of MWEs for graded vocabulary lists. Widely used by language teachers and students, these lists recommend a language acquisition sequence to optimize learning efficiency. We automatically generate these lists using difficulty-graded corpora and MWEs extracted based on semantic compositionality. We evaluate these lists on their ability to facilitate text comprehension for learners. Experimental results show that our proposed method generates higher-quality lists than baselines using collocation measures. ©2023 Association for Computational Linguistics
| Original language | English |
|---|---|
| Title of host publication | The 19th Workshop on Multiword Expressions (MWE 2023) |
| Subtitle of host publication | Proceedings of the Workshop |
| Publisher | Association for Computational Linguistics |
| Pages | 81–86 |
| ISBN (Electronic) | 978-1-959429-59-3 |
| DOIs | |
| Publication status | Published - 6 May 2023 |
| Event | 19th Workshop on Multiword Expressions (MWE 2023) - Hybrid, Dubrovnik, Croatia Duration: 6 May 2023 → 6 May 2023 https://multiword.org/mwe2023/ |
Publication series
| Name | Workshop on Multiword Expressions, MWE - Proceedings |
|---|
Conference
| Conference | 19th Workshop on Multiword Expressions (MWE 2023) |
|---|---|
| Place | Croatia |
| City | Dubrovnik |
| Period | 6/05/23 → 6/05/23 |
| Internet address |
Funding
This work was partly supported by the Language Fund from the Standing Committee on Language Education and Research (project EDB(LE)/P&R/EL/203/14) and by the General Research Fund (project 11207320).
Publisher's Copyright Statement
- This full text is made available under CC-BY 4.0. https://creativecommons.org/licenses/by/4.0/
RGC Funding Information
- RGC-funded
Fingerprint
Dive into the research topics of 'Automatic Generation of Vocabulary Lists with Multiword Expressions'. Together they form a unique fingerprint.Projects
- 2 Finished
-
LF: A Text Difficulty Analysis Tool for Developing Extra-Curricular Reading Materials
LEE, J. S. Y. (Principal Investigator / Project Coordinator) & LIU, M. (Co-Investigator)
1/10/21 → 31/12/24
Project: Research
-
GRF: Semantic Modeling for Sentence-level Readability Assessment
LEE, J. S. Y. (Principal Investigator / Project Coordinator), LIU, M. (Co-Investigator) & Sun, W. (Co-Investigator)
1/01/21 → 17/06/24
Project: Research