Abstract
A lexical simplification (LS) system substitutes difficult words in a text with simpler
ones to make it easier for the user to understand. In the typical LS pipeline, the Substitution Ranking step determines the best substitution out of a set of candidates. Most current systems do not consider the user’s vocabulary proficiency, and always aim for the simplest candidate. This approach may overlook
less-simple candidates that the user can understand, and that are semantically closer to the
original word. We propose a personalized approach for Substitution Ranking to identify the
candidate that is the closest synonym and is
non-complex for the user. In experiments on
learners of English at different proficiency levels, we show that this approach enhances the
semantic faithfulness of the output, at the cost
of a relatively small increase in the number of
complex words.
Original language | English |
---|---|
Title of host publication | INLG 2019 - The 12th International Conference on Natural Language Generation |
Subtitle of host publication | Proceedings of the Conference |
Editors | Kees van Deemter, Chenghua Lin, Hiroya Takamura |
Publisher | Association for Computational Linguistics |
Pages | 258-267 |
ISBN (Print) | 9781950737949 |
DOIs | |
Publication status | Published - Oct 2019 |
Event | 12th International Conference on Natural Language Generation (INLG 2019) - National Museum of Emerging Science and Innovation (Miraikan), Tokyo, Japan Duration: 29 Oct 2019 → 1 Nov 2019 https://www.inlg2019.com/ |
Publication series
Name | INLG - International Conference on Natural Language Generation, Proceedings of the Conference |
---|
Conference
Conference | 12th International Conference on Natural Language Generation (INLG 2019) |
---|---|
Country/Territory | Japan |
City | Tokyo |
Period | 29/10/19 → 1/11/19 |
Internet address |