Detection of Fault-Prone Classes Using Logistic Regression Based Object-Oriented Metrics Thresholds

Research output: Chapters, Conference Papers, Creative and Literary Works (RGC: 12, 32, 41, 45)32_Refereed conference paper (with host publication)peer-review

14 Scopus Citations
View graph of relations

Related Research Unit(s)

Detail(s)

Original languageEnglish
Title of host publicationProceedings - 2016 IEEE International Conference on Software Quality, Reliability and Security-Companion, QRS-C 2016
PublisherInstitute of Electrical and Electronics Engineers Inc.
Pages93-100
ISBN (Print)9781509037131
Publication statusPublished - 21 Sept 2016

Conference

Title2nd IEEE International Conference on Software Quality, Reliability and Security-Companion, QRS-C 2016
PlaceAustria
CityVienna
Period1 - 3 August 2016

Abstract

Background: In the plethora of studies, the objectorientedmetrics have been empirically validated to assess thedesign properties and quantify the high-level quality attributessuch as fault-proneness, either at the method or class granularitylevels of software. Motivation: A more precise value of an objectorientedmetric can be used as an indicator for the developers tomake the informed decisions regarding the detection of designflaws and classify the fault-proneness classes. Method: Benderused an approach in the domain of epidemiology studies to derivethe threshold values for the risk factors. In our study, we followthe Bender's approach and propose a model to derive thethresholds for a set of software design metrics via non-linearfunctions, which are described through logistic regressioncoefficients. Subsequently, we perform four types of analysis andthree experiments in order to evaluate and compare theeffectiveness of derived thresholds in the domain of classificationof fault proneness classes. We use the Precision, Recall, Fmeasureand classification accuracy performance measures toassess the effectiveness of derived metrics thresholds. Results: Wecompare the derive threshold values of DIT, CA, LCOM andNPM metrics with their existing data distribution basedthreshold values, and observed the significant increase in theclassification accuracy of fault-prone classes. For example, DIT(27%), Ca (2%), NPM (2%) and LCOM (15%) for the Ant-1.5project. Conclusion: The analysis results suggest that theproposed model can be applied to derive the thresholds of otherobject-oriented metrics which present either with or withoutheavy-tailed distribution, however, the proposed model to derivethresholds cannot generalize for all the systems due to variationin data characteristics.

Research Area(s)

  • Classification, Complexity, Fault Proneness, Logistic Regression, Object-Oriented Metrics, Performance Measures.

Citation Format(s)

Detection of Fault-Prone Classes Using Logistic Regression Based Object-Oriented Metrics Thresholds. / Hussain, Shahid; Keung, Jacky; Khan, Arif Ali et al.
Proceedings - 2016 IEEE International Conference on Software Quality, Reliability and Security-Companion, QRS-C 2016. Institute of Electrical and Electronics Engineers Inc., 2016. p. 93-100 7573729.

Research output: Chapters, Conference Papers, Creative and Literary Works (RGC: 12, 32, 41, 45)32_Refereed conference paper (with host publication)peer-review