Proceedings of the 25th International Symposium on Software Testing and Analysis 2016
DOI: 10.1145/2931037.2931040
|View full text |Cite
|
Sign up to set email alerts
|

Threats to the validity of mutation-based test assessment

Abstract: Much research on software testing and test techniques relies on experimental studies based on mutation testing. In this paper we reveal that such studies are vulnerable to a potential threat to validity, leading to possible Type I errors; incorrectly rejecting the Null Hypothesis. Our findings indicate that Type I errors occur, for arbitrary experiments that fail to take countermeasures, approximately 62% of the time. Clearly, a Type I error would potentially compromise any scientific conclusion. We show that … Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1
1

Citation Types

5
140
0

Year Published

2017
2017
2022
2022

Publication Types

Select...
5
2

Relationship

2
5

Authors

Journals

citations
Cited by 105 publications
(145 citation statements)
references
References 60 publications
5
140
0
Order By: Relevance
“…An emerging question regards the optimal use of mutants when comparing testing methods, i.e., whether methods should be applied on the original (clean) or on the mutant versions of the programs. Similarly, the relation of specific kinds of mutants, such as the subsuming [56] and hard to kill [57] ones, with real faults and their actual contribution within the testing process form other important aspects that we plan to investigate.…”
Section: Discussionmentioning
confidence: 99%
“…An emerging question regards the optimal use of mutants when comparing testing methods, i.e., whether methods should be applied on the original (clean) or on the mutant versions of the programs. Similarly, the relation of specific kinds of mutants, such as the subsuming [56] and hard to kill [57] ones, with real faults and their actual contribution within the testing process form other important aspects that we plan to investigate.…”
Section: Discussionmentioning
confidence: 99%
“…The actual differences are thin and are due to the selection procedure. Disjoint mutants are a subset with minimum joint killings, approximated through a greedy heuristic [1], [6]. Surface mutants [12] are also approximated by a similar heuristic.…”
Section: B Set-based Mqismentioning
confidence: 99%
“…We used the Codeflaws benchmark [13] that involves programs selected from an on-line programming contests 1 . In Codeflaws, every faulty program version is unique and has two instances, the 'faulty' and the 'fixed' one.…”
Section: A Programs and Faultsmentioning
confidence: 99%
See 2 more Smart Citations