Enhancing Automatic Readability Assessment with Pre-training and Soft Labels for Ordinal Regression

Jinshan Zeng, Yudong Xie, Xianglong Yu, John S. Y. Lee, Ding-Xuan Zhou

Research output: Chapters, Conference Papers, Creative and Literary WorksRGC 32 - Refereed conference paper (with host publication)peer-review

10 Citations (Scopus)
116 Downloads (CityUHK Scholars)

Abstract

The readability assessment task aims to assign a difficulty grade to a text. While neural models have recently demonstrated impressive performance, most do not exploit the ordinal nature of the difficulty grades, and make little effort for model initialization to facilitate fine-tuning. We address these limitations with soft labels for ordinal regression, and with model pre-training through prediction of pairwise relative text difficulty. We incorporate these two components into a model based on hierarchical attention networks, and evaluate its performance on both English and Chinese datasets. Experimental results show that our proposed model outperforms competitive neural models and statistical classifiers on most datasets.
Original languageEnglish
Title of host publicationFindings of the Association for Computational Linguistics
Subtitle of host publicationEMNLP 2022
EditorsYoav Goldberg, Zornitsa Kozareva, Yue Zhang
PublisherAssociation for Computational Linguistics
Pages4586-4597
Publication statusPublished - Dec 2022
Event2022 Conference on Empirical Methods in Natural Language Processing (EMNLP 2022) - Hybrid, Abu Dhabi, United Arab Emirates
Duration: 7 Dec 202211 Dec 2022
https://2022.emnlp.org/

Conference

Conference2022 Conference on Empirical Methods in Natural Language Processing (EMNLP 2022)
Country/TerritoryUnited Arab Emirates
CityAbu Dhabi
Period7/12/2211/12/22
Internet address

Publisher's Copyright Statement

  • This full text is made available under CC-BY 4.0. https://creativecommons.org/licenses/by/4.0/

Fingerprint

Dive into the research topics of 'Enhancing Automatic Readability Assessment with Pre-training and Soft Labels for Ordinal Regression'. Together they form a unique fingerprint.

Cite this