Large language model ChatGPT versus small deep learning models for self-admitted technical debt detection : Why not together?
Research output: Journal Publications and Reviews › RGC 21 - Publication in refereed journal › peer-review
Author(s)
Related Research Unit(s)
Detail(s)
Original language | English |
---|---|
Journal / Publication | Software - Practice and Experience |
Online published | 28 Jun 2024 |
Publication status | Online published - 28 Jun 2024 |
Link(s)
Abstract
Given the increasing complexity and volume of Self-Admitted Technical Debts (SATDs), how to efficiently detect them becomes critical in software engineering practice for improving code quality and project efficiency. Although current deep learning methods have achieved good performance in detecting SATDs in code comments, they lack explanation. Large language models such as ChatGPT are increasingly being applied to text classification tasks due to their ability to provide explanations for classification results, but it is unclear how effective ChatGPT is for SATD classification. As the first in-depth study of ChatGPT for SATD detection, we evaluate ChatGPT's effectiveness, compare it with small deep learning models, and find that ChatGPT performs better on Recall, while small models perform better on Precision. Furthermore, to enhance the performance of these approaches, we propose a novel fusion approach named FSATD which combines ChatGPT with small models for SATD detection so as to provide reliable explanations. Through extensive experiments on 62,276 comments from 10 open-source projects, we show that FSATD outperforms existing methods in performance of F1-score in cross-project scenarios. Additionally, FSATD allows for flexible adjustment of fusion strategies, adapting to different requirements of various application scenarios, and can achieve the best Precision, Recall, or F1-score. © 2024 John Wiley & Sons Ltd.
Research Area(s)
- ChatGPT, fusion, performance and interpretability, self-admitted technical debt, small deep learning models
Citation Format(s)
Large language model ChatGPT versus small deep learning models for self-admitted technical debt detection: Why not together? / Li, Jun; Li, Lixian; Liu, Jin et al.
In: Software - Practice and Experience, 28.06.2024.
In: Software - Practice and Experience, 28.06.2024.
Research output: Journal Publications and Reviews › RGC 21 - Publication in refereed journal › peer-review