2007
DOI: 10.1109/tr.2007.896761
|View full text |Cite
|
Sign up to set email alerts
|

A Comprehensive Empirical Study of Count Models for Software Fault Prediction

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1
1

Citation Types

0
24
0

Year Published

2008
2008
2021
2021

Publication Types

Select...
3
3
3

Relationship

0
9

Authors

Journals

citations
Cited by 63 publications
(25 citation statements)
references
References 14 publications
0
24
0
Order By: Relevance
“…With the training set in place, we choose regressions for model selection. Our three candidate regressions are all generalized linear regressions previously used [12] for predicting failure count data: negative binomial regression, Poisson regression, and the logistic regression. The negative binomial and Poisson regressions estimate the number of failures for a given file, by which we rank for our prioritization.…”
Section: Model Selection and Validationmentioning
confidence: 99%
“…With the training set in place, we choose regressions for model selection. Our three candidate regressions are all generalized linear regressions previously used [12] for predicting failure count data: negative binomial regression, Poisson regression, and the logistic regression. The negative binomial and Poisson regressions estimate the number of failures for a given file, by which we rank for our prioritization.…”
Section: Model Selection and Validationmentioning
confidence: 99%
“…Moreover, the effectiveness of this method for predicting the number of bugs has been validated by prior studies [22,23,27,28].…”
Section: Lrmentioning
confidence: 94%
“…Because multiple liner regression (MLR) models have proven useful in software defect prediction [9,27,28], in this paper we define a simple MLR model predicting the scores of a given set of software entities, as described below. …”
Section: B Description Of Our Approachmentioning
confidence: 99%
“…Gao and Khoshgoftaar [18] empirically evaluated eight statistical count models for software quality prediction. They showed that with a very large number of zero response variables, the zero inflated and hurdle-count models are more appropriate.…”
Section: Related Workmentioning
confidence: 99%