Towards Robust Knowledge Tracing Models via k-Sparse Attention

Shuyan Huang, Zitao Liu*, Xiangyu Zhao, Weiqi Luo, Jian Weng

*Corresponding author for this work

Research output: Chapters, Conference Papers, Creative and Literary WorksRGC 32 - Refereed conference paper (with host publication)peer-review

35 Citations (Scopus)

Abstract

Knowledge tracing (KT) is the problem of predicting students' future performance based on their historical interaction sequences. With the advanced capability of capturing contextual long-term dependency, attention mechanism becomes one of the essential components in many deep learning based KT (DLKT) models. In spite of the impressive performance achieved by these attentional DLKT models, many of them are often vulnerable to run the risk of overfitting, especially on small-scale educational datasets. Therefore, in this paper, we propose sparseKT, a simple yet effective framework to improve the robustness and generalization of the attention based DLKT approaches. Specifically, we incorporate a k-selection module to only pick items with the highest attention scores. We propose two sparsification heuristics: (1) soft-thresholding sparse attention and (2) top-K sparse attention. We show that our sparseKT is able to help attentional KT models get rid of irrelevant student interactions and improve the predictive performance when compared to 11 state-of-the-art KT models on three publicly available real-world educational datasets. To encourage reproducible research, we make our data and code publicly available at https://github.com/pykt-team/pykt-toolkit. © 2023 Copyright held by the owner/author(s). Publication rights licensed to ACM
Original languageEnglish
Title of host publicationSIGIR '23: Proceedings of the 46th International ACM SIGIR Conference on Research and Development in Information Retrieval
PublisherAssociation for Computing Machinery
Pages2441–2445
ISBN (Print)9781450394086
DOIs
Publication statusPublished - 18 Jul 2023
Event46th International ACM SIGIR Conference on Research and Development in Information Retrieval (SIGIR 2023) - Hybrid, Taipei International Convention Center, Taipei, Taiwan, China
Duration: 23 Jul 202327 Jul 2023
https://sigir.org/sigir2023/

Publication series

NameSIGIR - Proceedings of the International ACM SIGIR Conference on Research and Development in Information Retrieval

Conference

Conference46th International ACM SIGIR Conference on Research and Development in Information Retrieval (SIGIR 2023)
Abbreviated titleSIGIR '23
PlaceTaiwan, China
CityTaipei
Period23/07/2327/07/23
Internet address

Research Keywords

  • knowledge tracing
  • student modeling
  • AI in education
  • sparse attention
  • deep learning

Fingerprint

Dive into the research topics of 'Towards Robust Knowledge Tracing Models via k-Sparse Attention'. Together they form a unique fingerprint.

Cite this