Skip to main navigation Skip to search Skip to main content

Automatic Evaluation of Multiple-Choice Items for Reading Comprehension: Effects of Question and Distractor Categories

Research output: Chapters, Conference Papers, Creative and Literary WorksRGC 32 - Refereed conference paper (with host publication)peer-review

Abstract

Automatic generation of multiple-choice (MC) items for reading comprehension can support language learning by providing large amounts of practice materials. To enable rapid development of MC generation models, automatic assessment is essential since it is time-consuming to manually evaluate question and distractor quality. Although Text Informativity (TI) has been adopted as an automatic evaluation metric, the ability of Large Language Models (LLMs) to estimate the TI scores of different categories of questions and distractors has not yet been thoroughly analyzed. This paper investigates LLM performance in calculating TI scores for the range of questions and distractors defined in the PIRLS (Progress in International Reading Literacy Study) and STARC (Structured Annotations for Reading Comprehension) frameworks. We show that automatically estimated TI scores may result in systematic preferences for some question and distractor categories, and recommend that TI scores be used for within-category comparisons only.
©ELRA Language Resources Association (ELRA), 2026
Original languageEnglish
Title of host publicationProceedings of Shaping Multilingual, Multimodal AI for the Social Sciences and Humanities (LLMs4SSH) @ LREC 2026
EditorsArturo Montejo-Ráez, Cristina Grisot, Joanna Blochowiak
PublisherEuropean Language Resources Association (ELRA)
Pages170-174
ISBN (Electronic)978-2-493814-85-2
Publication statusPublished - 11 May 2026
Event15th International Conference on Language Resources and Evaluation - Palau de Congressos de Palma, Palma, Spain
Duration: 11 May 202616 May 2026
https://lrec2026.info/

Conference

Conference15th International Conference on Language Resources and Evaluation
Abbreviated titleLREC 2026
PlaceSpain
CityPalma
Period11/05/2616/05/26
Internet address

UN SDGs

This output contributes to the following UN Sustainable Development Goals (SDGs)

  1. SDG 4 - Quality Education
    SDG 4 Quality Education

Research Keywords

  • multiple-choice items
  • text informativity
  • question generation
  • distractor generation

Fingerprint

Dive into the research topics of 'Automatic Evaluation of Multiple-Choice Items for Reading Comprehension: Effects of Question and Distractor Categories'. Together they form a unique fingerprint.

Cite this