2019 IEEE 26th International Conference on Software Analysis, Evolution and Reengineering (SANER) 2019
DOI: 10.1109/saner.2019.8667991
|View full text |Cite
|
Sign up to set email alerts
|

BEARS: An Extensible Java Bug Benchmark for Automatic Program Repair Studies

Abstract: Benchmarks of bugs are essential to empirically evaluate automatic program repair tools. In this paper, we present BEARS, a project for collecting and storing bugs into an extensible bug benchmark for automatic repair studies in Java. The collection of bugs relies on commit building state from Continuous Integration (CI) to find potential pairs of buggy and patched program versions from open-source projects hosted on GitHub. Each pair of program versions passes through a pipeline where an attempt of reproducin… Show more

Help me understand this report
View preprint versions

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1
1

Citation Types

0
67
1

Year Published

2019
2019
2024
2024

Publication Types

Select...
3
3
2

Relationship

2
6

Authors

Journals

citations
Cited by 104 publications
(74 citation statements)
references
References 17 publications
0
67
1
Order By: Relevance
“…There are some benchmarks that were rarely used or never used so far: this is partially explained by the fact that some benchmarks were recently published (e.g. Bears [25]), thus they were not available when some repair tools were published.…”
Section: State Of Affairs On Test-suite-based Automatic Repair Tools mentioning
confidence: 99%
See 1 more Smart Citation
“…There are some benchmarks that were rarely used or never used so far: this is partially explained by the fact that some benchmarks were recently published (e.g. Bears [25]), thus they were not available when some repair tools were published.…”
Section: State Of Affairs On Test-suite-based Automatic Repair Tools mentioning
confidence: 99%
“…Bears [25] contains 251 bugs from 72 different GitHub projects with an average size of 62,597 lines of Java code. It was created by mining software repositories based on commit building state from Travis Continuous Integration.…”
Section: Subject Benchmarks Of Bugsmentioning
confidence: 99%
“…The performance achieved by TBar on Defects4J may not be reached on a bigger, more diverse and more representative dataset. To address this threat, new benchmarks such as Bugs.jar [61] and Bears [46] should be investigated.…”
Section: Threats To Validitymentioning
confidence: 99%
“…On the contrary, the developer test is shorter and directly targets the changed behavior, which is better. JSOUP#3676B13 19 : This change is a pull request (i.e. a set of commits) and introduces 5 new behavioral changes.…”
Section: Rq4: How Do Human and Generated Tests That Detect Behavioralmentioning
confidence: 99%