2015
DOI: 10.1109/tkde.2014.2324590
|View full text |Cite
|
Sign up to set email alerts
|

Towards Effective Bug Triage with Software Data Reduction Techniques

Abstract: International audienceSoftware companies spend over 45 percent of cost in dealing with software bugs. An inevitable step of fixing bugs is bug triage, which aims to correctly assign a developer to a new bug. To decrease the time cost in manual work, text classification techniques are applied to conduct automatic bug triage. In this paper, we address the problem of data reduction for bug triage, i.e., how to reduce the scale and improve the quality of bug data. We combine instance selection with feature selecti… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
3
1

Citation Types

0
46
0

Year Published

2016
2016
2022
2022

Publication Types

Select...
5
2
1

Relationship

0
8

Authors

Journals

citations
Cited by 132 publications
(61 citation statements)
references
References 59 publications
0
46
0
Order By: Relevance
“…The four well-performed algorithms are chosen in text data [13,19] and software data, namely Information Gain (IG), χ2 statistic (CH) [14], Symmetrical Uncertainty attribute evaluation (SU), and Relief-F Attribute selection (RF). The chi-squared distribution also known as chi-square or χ² distribution with k degrees of freedom is the distribution of a sum of the squares of k independent criterion normal random variables.…”
Section: Feature Selectionmentioning
confidence: 99%
See 1 more Smart Citation
“…The four well-performed algorithms are chosen in text data [13,19] and software data, namely Information Gain (IG), χ2 statistic (CH) [14], Symmetrical Uncertainty attribute evaluation (SU), and Relief-F Attribute selection (RF). The chi-squared distribution also known as chi-square or χ² distribution with k degrees of freedom is the distribution of a sum of the squares of k independent criterion normal random variables.…”
Section: Feature Selectionmentioning
confidence: 99%
“…Due to large number of daily bugs and lack of expertise of all the bugs, manual triage is an expensive in time cost and labor cost, low in accuracy. To overcome the limitations of existing work, an automatic bug triage approaches proposed [19]. This approach applies the text classification techniques in order to predict the relevant developer for bug reports without tossing.…”
Section: Introductionmentioning
confidence: 99%
“…Reported bugs should be analyzed to determine whether they are valid or not, correct or not, and unique or not. This is called a bug triage process [1].…”
Section: Introductionmentioning
confidence: 99%
“…In their study textual descriptions were transformed to feature vectors, which were the inputs of classification models. However, the study of [1] tells that not all the features are important for bug report prioritization. Their results motivated us to adopt feature selection methods on bug report prioritization.…”
Section: Introductionmentioning
confidence: 99%
“…For open source large-scale software projects, the number of daily bugs is so large which makes the triaging process very difficult and challenging [10]. Software companies spend over 45 percent of cost in fixing bugs .There are two challenges related to bug data that may affect the effective use of bug repositories in software development tasks, namely the large scale and the low quality.…”
mentioning
confidence: 99%