Identifying the cause of performance issues of Pretrained Language Model for Educational Technology

Research output: Chapters, Conference Papers, Creative and Literary WorksRGC 32 - Refereed conference paper (with host publication)peer-review

Abstract

Online self-learning platforms, such as questionanswering (Q&A) websites, are popular technology for learning, and utilising Pretrained Language Models (PLMs) to maintain their content qualities is a common practice. However, it is challenging to identify the cause that affects the performance of PLMs. In this study, we propose using machine common sense to identify the cause affecting the performance of PLMs. We conducted an empirical experiment with three PLMs using a publicly available dataset. We select 45000 data points as the training data and 1000 as the testing data. We first train the PLMs with training data, then run machine common sense tests to examine their reasoning abilities. We define the causal relationship between content quality and reasoning ability, and use the cause to derive Metamorphic Relations (MR) of Metamorphic Testing (MT) for creating follow-up testing datasets. We analyse the changes between source and follow-up outputs to see whether the identified cause affects the performance. Results show that the reasonableness of the content is the cause that affects the performance of PLM, which has reasoning abilities. In addition, the proposed approach in this study is effective for identifying and validating the cause that affects the performance of PLMs, even on devices with limited computer resources. Future research can apply our approach and seek different machine common sense tests and counterfactual analysing techniques to identify different causes of performance issues of different PLMs. © 2025 IEEE.
Original languageEnglish
Title of host publicationProceedings - 2025 International Symposium on Educational Technology
Subtitle of host publicationISET 2025
PublisherIEEE
Pages179-183
ISBN (Electronic)9798331595500
ISBN (Print)979-8-3315-9551-7
DOIs
Publication statusPresented - 24 Jul 2025
Event11th International Symposium on Educational Technology (ISET 2025) - Shangri-La Bangkok, Bangkok, Thailand
Duration: 22 Jul 202525 Jul 2025
https://hksmic.org.hk/iset/2025/

Publication series

NameProceedings - International Symposium on Educational Technology, ISET
ISSN (Print)2766-2128
ISSN (Electronic)2766-2144

Conference

Conference11th International Symposium on Educational Technology (ISET 2025)
Abbreviated titleISET
PlaceThailand
CityBangkok
Period22/07/2525/07/25
Internet address

Funding

This work is supported in part by the General Research Fund of the Research Grants Council of Hong Kong and the research funds of the City University of Hong Kong (6000796 6000796, 9229109, 9229098, 9220103, 9229029).

Research Keywords

  • Content quality prediction
  • causality
  • machine common sense
  • metamorphic testing
  • self-learning platform
  • pre-trained language model

Fingerprint

Dive into the research topics of 'Identifying the cause of performance issues of Pretrained Language Model for Educational Technology'. Together they form a unique fingerprint.

Cite this