Abstract
Background: There are too many design options for software effort estimators. How can we best explore them all? Aim: We seek aspects on general principles of effort estimation that can guide the design of effort estimators. Method: We identified the essential assumption of analogy-based effort estimation, i.e., the immediate neighbors of a project offer stable conclusions about that project. We test that assumption by generating a binary tree of clusters of effort data and comparing the variance of supertrees versus smaller subtrees. Results: For 10 data sets (from Coc81, Nasa93, Desharnais, Albrecht, ISBSG, and data from Turkish companies), we found: 1) The estimation variance of cluster subtrees is usually larger than that of cluster supertrees; 2) if analogy is restricted to the cluster trees with lower variance, then effort estimates have a significantly lower error (measured using MRE, AR, and Pred(25) with a Wilcoxon test, 95 percent confidence, compared to nearest neighbor methods that use neighborhoods of a fixed size). Conclusion: Estimation by analogy can be significantly improved by a dynamic selection of nearest neighbors, using only the project data from regions with small variance. © 1976-2012 IEEE.
| Original language | English |
|---|---|
| Article number | 5728833 |
| Pages (from-to) | 425-438 |
| Journal | IEEE Transactions on Software Engineering |
| Volume | 38 |
| Issue number | 2 |
| DOIs | |
| Publication status | Published - 2012 |
| Externally published | Yes |
Research Keywords
- analogy
- k-NN
- Software cost estimation
Fingerprint
Dive into the research topics of 'Exploiting the essential assumptions of analogy-based effort estimation'. Together they form a unique fingerprint.Cite this
- APA
- Author
- BIBTEX
- Harvard
- Standard
- RIS
- Vancouver