Abstract
In the domain of software fault prediction, class membership probability of a selected classifier and the factors related to its estimation can be considered as necessary information for tester to take informed decisions about software quality issues. The objective of this study is to empirically investigate the class membership probability estimation capability of 15 classifiers/fault predictors on 12 datasets of open source projects retrieved from PROMISE repository. We empirically validate the effect of dataset characteristics and set of metrics on the performance of classifiers in estimating the class membership probability. We used Receiver Operating Characteristics-Area under Curve (ROC-AUC) value and overall accuracy as benchmarks to evaluate and compare the performance of classifiers. We apply Friedman's, post-hoc Nemenyi and Analysis of Means (ANOM) test to compare the significant performance of classifiers. We conclude that ADTree and RandomForest outperform, while ZeroR classifier cannot show significant performance for estimation of class membership probability.
Original language | English |
---|---|
Title of host publication | SAC '16 Proceedings of the 31st Annual ACM Symposium on Applied Computing |
Publisher | Association for Computing Machinery |
Pages | 1550-1553 |
ISBN (Print) | 978-1-4503-3739-7 |
DOIs | |
Publication status | Published - 4 Apr 2016 |
Event | SAC '16 the 31st Annual ACM Symposium on Applied Computing - , Italy Duration: 4 Apr 2016 → 8 Apr 2016 http://www.sigapp.org/sac/sac2016/ |
Conference
Conference | SAC '16 the 31st Annual ACM Symposium on Applied Computing |
---|---|
Country/Territory | Italy |
Period | 4/04/16 → 8/04/16 |
Internet address |
Research Keywords
- Accuracy
- Chidamber and Kemerer metrics
- Classifiers
- Estimation
- Performance
- Probability
- Weka