Proceedings of the ACM/IEEE 42nd International Conference on Software Engineering: Companion Proceedings 2020
DOI: 10.1145/3377812.3390793
|View full text |Cite
|
Sign up to set email alerts
|

Importance-driven deep learning system testing

Abstract: Deep Learning (DL) systems are key enablers for engineering intelligent applications due to their ability to solve complex tasks such as image recognition and machine translation. Nevertheless, using DL systems in safety-and security-critical applications requires to provide testing evidence for their dependable operation. Recent research in this direction focuses on adapting testing criteria from traditional software engineering as a means of increasing confidence for their correct behaviour. However, they ar… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

0
12
0

Year Published

2020
2020
2023
2023

Publication Types

Select...
4
2
1

Relationship

0
7

Authors

Journals

citations
Cited by 11 publications
(12 citation statements)
references
References 28 publications
(50 reference statements)
0
12
0
Order By: Relevance
“…DeepXplore [83] introduced the neuron coverage metric to measures the percentage of activated neurons or a given test suite and DNN model, and generates new test inputs that can maximize the metric to test DL systems. Many others [70,[84][85][86][87][88] extended the coverage concept and proposed to use them on many different scenarios. Model testing has also been leveraged for many other domains such as image classification [79,89], automatic speech recognition [90], text classification [74], and machine translation [91,92].…”
Section: Effects O F Configurable Parametersmentioning
confidence: 99%
“…DeepXplore [83] introduced the neuron coverage metric to measures the percentage of activated neurons or a given test suite and DNN model, and generates new test inputs that can maximize the metric to test DL systems. Many others [70,[84][85][86][87][88] extended the coverage concept and proposed to use them on many different scenarios. Model testing has also been leveraged for many other domains such as image classification [79,89], automatic speech recognition [90], text classification [74], and machine translation [91,92].…”
Section: Effects O F Configurable Parametersmentioning
confidence: 99%
“…We select the state-of-the-art coverage criteria as the baselines for the evaluation, i.e., Neuron Coverage (NC) [39], 𝑘-Multisection Neuron Coverage (KMNC) [31], Neuron Boundary Coverage (NBC) [31], Likelihood-based Surprise Coverage (LSC) [31], Distance-based Surprise Coverage (DSC) [23] and Importance-Driven Coverage (IDC) [15]. Due to that the original IDC only supports Keras models, we implemented a PyTorch version of IDC for the comparison.…”
Section: Baselinesmentioning
confidence: 99%
“…The critical neurons between the layers form the CDP. Compared with the existing neuron-based coverage metric (e.g., NC [39], 𝑘-multisection Neuron Coverage [31], IDC [15]), CDP considers not only the critical neurons in one layer but also the relationships among layers. Moreover, the CDP is clearly related to decision-making.…”
Section: Introductionmentioning
confidence: 99%
“…[50,60,213]). One approach proposed a technique to select test cases based on a metric of importance [65], whereas others proposed techniques to identify corner cases [22], adversarial examples [204] or likely failure scenarios [107]. Finally, a few approaches proposed techniques for test input prioritization to select the most important ones and reduce the cost of labeling [34,54] or reduce the performance cost of training and testing huge amounts of data [186].…”
Section: Software Testing (115 Studies)mentioning
confidence: 99%
“…14 out of 20 focused on test coverage metrics (e.g. [22,75,190], whereas the rest of metrics were reported only by one study each: diversity [187], importance [65], suspiciousness [52], probability of sufficiency [36], and disagreement [216].…”
Section: Software Testing (115 Studies)mentioning
confidence: 99%