Proceedings of the 14th International Conference on Predictive Models and Data Analytics in Software Engineering 2018
DOI: 10.1145/3273934.3273936
|View full text |Cite
|
Sign up to set email alerts
|

A Public Unified Bug Dataset for Java

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1
1

Citation Types

1
25
0

Year Published

2020
2020
2024
2024

Publication Types

Select...
4
4
1

Relationship

0
9

Authors

Journals

citations
Cited by 41 publications
(26 citation statements)
references
References 35 publications
1
25
0
Order By: Relevance
“…For experimentation, we considered forty open source datasets relating to the defect prediction from tera-PROMISE repository [39]. The list and the class imbalance percentages of datasets are shown in Table 1.…”
Section: Experimentation and Resultsmentioning
confidence: 99%
“…For experimentation, we considered forty open source datasets relating to the defect prediction from tera-PROMISE repository [39]. The list and the class imbalance percentages of datasets are shown in Table 1.…”
Section: Experimentation and Resultsmentioning
confidence: 99%
“…The PROMISE dataset has been validated and used several times in different research papers, e.g., concerning bug prediction [15,14].…”
Section: G Defectsmentioning
confidence: 99%
“…The process of detecting and removing soft-ware defects is an important step in guaranteeing the fulfilment of end user satisfaction [20] and reducing the economic liability associated with releasing flawed software products [26]. Furthermore, building an efficient defect prediction model has raised increased interest from researchers in their quest to learn from the pre-70 vious defects and to use this knowledge to predict future ones [15]. Defects were also heavily investigated in the literature, with several studies considering defects as a dependent variable affected by various independent variables, such 75 as patterns [52,22,6], bad design [57] and the presence of code smells [30].…”
Section: Introductionmentioning
confidence: 99%
“…However, public datasets may have some issues such as lack of data quality. For example, there are two versions of PROMISE (Ferenc et al, 2018) dataset. Simplified PROMISE Source Code (SPSC) and PROMISE Source Code (PSC).…”
Section: B Datasetsmentioning
confidence: 99%