2018 IEEE International Conference on Software Maintenance and Evolution (ICSME) 2018
DOI: 10.1109/icsme.2018.00046
|View full text |Cite
|
Sign up to set email alerts
|

Are Bug Reports Enough for Text Retrieval-Based Bug Localization?

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

5
50
0

Year Published

2019
2019
2022
2022

Publication Types

Select...
6
1

Relationship

0
7

Authors

Journals

citations
Cited by 36 publications
(56 citation statements)
references
References 34 publications
5
50
0
Order By: Relevance
“…Herbold et al (2020) independently confirmed the results by Herzig et al (2013) and demonstrated how this and other issues negatively impact defect prediction data. However, while both Herzig et al (2013) and Herbold et al (2020) study the impact of mislabels of defect prediction, any software repository mining research that studies defects suffers from similar consequences, e.g., bug localization (e.g., Marcus et al 2004, Lukins et al 2008, Rao and Kak 2011, Mills et al 2018. In the literature, there are several approaches that try to address the issue of mislabels in issue systems through machine learning.…”
Section: Related Workmentioning
confidence: 99%
See 1 more Smart Citation
“…Herbold et al (2020) independently confirmed the results by Herzig et al (2013) and demonstrated how this and other issues negatively impact defect prediction data. However, while both Herzig et al (2013) and Herbold et al (2020) study the impact of mislabels of defect prediction, any software repository mining research that studies defects suffers from similar consequences, e.g., bug localization (e.g., Marcus et al 2004, Lukins et al 2008, Rao and Kak 2011, Mills et al 2018. In the literature, there are several approaches that try to address the issue of mislabels in issue systems through machine learning.…”
Section: Related Workmentioning
confidence: 99%
“…If a feature request is misclassified as bug this may hold up a release. Second, there are many Mining Software Repositories (MSR) approaches that rely on issue types, especially the issue type bug, e.g., for bug localization (e.g., Marcus et al 2004, Lukins et al 2008, Rao and Kak 2011, Mills et al 2018 or the labeling of commits as defective with the SZZ algorithm (Śliwerski et al 2005) and the subsequent use of these labels, e.g., for defect prediction (e.g., Hall et al 2012, Hosseini et al 2017 or the creation of fine-grained data (e.g., Just et al 2014). Mislabelled issues threaten the validity of the research and would also degenerate the performance of approaches based on this data that are implemented in tools and used by practitioners.…”
Section: Introductionmentioning
confidence: 99%
“…We can simplify Equation (50) analogously to Equation (47) and get a constant cost model with a 1-to-m mapping between artifacts and defects…”
Section: Initializations Of the Cost Modelmentioning
confidence: 99%
“…This shows that in order to resolve ≈ 40% of bug report, a developer has to fix code in more than one source file, which is not uncommon [32,22]. It is also known that not all files in a bug-fixing commit may be directly related to that bug report [36]. However, the authors of the original study took all files in a bug-fixing commit.…”
Section: Data Preprocessingmentioning
confidence: 99%
“…There also exists a small percentage (≈ 0.07%) of outlier files associated with more than a 100 bugs. Note that this does not necessarily imply that these files are error-prone: they may not be related to an actual fix [36], as discussed above. Using the same rationale as in the previous paragraph, we retain all of the files.…”
Section: Data Preprocessingmentioning
confidence: 99%