2021
DOI: 10.1007/s10664-021-09996-y
|View full text |Cite
|
Sign up to set email alerts
|

Predicting unstable software benchmarks using static source code features

Abstract: Software benchmarks are only as good as the performance measurements they yield. Unstable benchmarks show high variability among repeated measurements, which causes uncertainty about the actual performance and complicates reliable change assessment. However, if a benchmark is stable or unstable only becomes evident after it has been executed and its results are available. In this paper, we introduce a machine-learning-based approach to predict a benchmark’s stability without having to execute it. Our approach … Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1

Citation Types

0
2
0

Year Published

2021
2021
2024
2024

Publication Types

Select...
3
3

Relationship

2
4

Authors

Journals

citations
Cited by 18 publications
(2 citation statements)
references
References 118 publications
0
2
0
Order By: Relevance
“…We chose 2014 as a cutoff point because this was the year the TensorFlow system was initially released. I2: Making use of DL as a core contribution of the paper and explicitly reporting on the used code representation approach. To illustrate this criterion, we discuss the following study as a counterexample [26]. In this study, Laaber et al.…”
Section: Methodsmentioning
confidence: 99%
See 1 more Smart Citation
“…We chose 2014 as a cutoff point because this was the year the TensorFlow system was initially released. I2: Making use of DL as a core contribution of the paper and explicitly reporting on the used code representation approach. To illustrate this criterion, we discuss the following study as a counterexample [26]. In this study, Laaber et al.…”
Section: Methodsmentioning
confidence: 99%
“…� I2: Making use of DL as a core contribution of the paper and explicitly reporting on the used code representation approach. To illustrate this criterion, we discuss the following study as a counterexample [26]. In this study, Laaber et al tackled an SE task (predictability of system performance) and the authors used an artificial neural network (ANN) as a DL model for that task.…”
Section: Inclusion Criteriamentioning
confidence: 99%