Robustness of Hybrid Models in Cross-domain Readability Assessment
Research output: Chapters, Conference Papers, Creative and Literary Works (RGC: 12, 32, 41, 45) › 32_Refereed conference paper (with ISBN/ISSN) › peer-review
Author(s)
Related Research Unit(s)
Detail(s)
Original language | English |
---|---|
Title of host publication | Proceedings of the 20th Workshop of the Australasian Language Technology Association |
Subtitle of host publication | ALTA 2022 |
Publisher | Australasian Language Technology Association |
Pages | 62-67 |
Publication status | Published - Dec 2022 |
Conference
Title | 20th Annual Workshop of the Australasian Language Technology Association (ALTA 2022) |
---|---|
Location | Flinders University |
Place | Australia |
City | Adelaide |
Period | 14 - 16 December 2022 |
Link(s)
Document Link | Links
|
---|---|
Permanent Link | https://scholars.cityu.edu.hk/en/publications/publication(86e3eba5-8164-429b-a454-c41c11a03ff7).html |
Abstract
Recent studies in automatic readability assessment have shown that hybrid models — models that leverage both linguistically motivated features and neural models — can outperform neural models. However, most evaluations on hybrid models have been based on in-domain data in English. This paper provides further evidence on the contribution of linguistic features by reporting the first direct comparison between hybrid, neural and linguistic models on cross-domain data. In experiments on a Chinese dataset, the hybrid model outperforms the neural model on both in-domain and cross-domain data. Importantly, the hybrid model exhibits much smaller performance degradation in the cross-domain setting, suggesting that the linguistic features are more robust and can better capture salient indicators of text difficulty.
Citation Format(s)
Robustness of Hybrid Models in Cross-domain Readability Assessment. / Lim, Ho Hung; Cai, Tianyuan; Lee, John S. Y. et al.
Proceedings of the 20th Workshop of the Australasian Language Technology Association: ALTA 2022. Australasian Language Technology Association, 2022. p. 62-67.Research output: Chapters, Conference Papers, Creative and Literary Works (RGC: 12, 32, 41, 45) › 32_Refereed conference paper (with ISBN/ISSN) › peer-review