“Boxing Clever”: Practical Techniques for Gaining Insights into Training Data and Monitoring Distribution Shift

Ashmore, Rob; Hill, Matthew Q.

doi:10.1007/978-3-319-99229-7_33

Cited by 12 publications

(11 citation statements)

References 9 publications

(9 reference statements)

Supporting

Mentioning

Contrasting

Order By: Relevance

“…Moreover, there are different ways to define the set of hyper-rectangles. For example, the "boxing clever" method in [Ashmore and Hill, 2018], initially proposed for designing training datasets, divides the input space into a series of representative boxes. When the hyper-rectangle is sufficiently fine-grained with respect to Lipschitz constant of the DNN, the method in [Wicker et al, 2018] becomes exhaustive search and has provable guarantee on its result.…”

Section: Safety Coveragementioning

confidence: 99%

“…Therefore, although it is reasonable to believe that the resulting trained models can perform well on new inputs close to the training data, it is also understandable that the trained models might not perform correctly in those inputs where there is no neighbouring training data. While techniques are being requested to achieve better generalisability for DNN training algorithm including various regularisation techniques (see e.g., for a comprehensive overview), as suggested in e.g., [Amodei et al, 2016, Ashmore and Hill, 2018, Moreno-Torres et al, 2012, it is also meaningful (particularly for the certification of safety critical systems) to be able to identify those inputs on which the trained models should not have high confidence. Technically, such inputs can be formally defined as both topologically far away from training data in the input space and being classified with high probability by the trained models.…”

Section: Distributional Shift Out-of-distribution Detection and Run-time Monitoringmentioning

confidence: 99%

See 1 more Smart Citation

A survey of safety and trustworthiness of deep neural networks: Verification, testing, adversarial attack and defence, and interpretability

Huang

Kroening

Ruan

et al. 2020

Computer Science Review

341

182

View full text Add to dashboard Cite

Section: Safety Coveragementioning

confidence: 99%

Section: Distributional Shift Out-of-distribution Detection and Run-time Monitoringmentioning

confidence: 99%

A survey of safety and trustworthiness of deep neural networks: Verification, testing, adversarial attack and defence, and interpretability

Huang

Kroening

Ruan

et al. 2020

Computer Science Review

341

182

View full text Add to dashboard Cite

“…Whilst in [36], coverage is enforced to finite partitions of the input space, relying on predefined sets of application-specific scenario attributes. The "boxing clever" technique in [37] focuses on the distribution of training data and divides the input domain into a series of representative boxes. In [38], the difference between test dataset and training dataset is measured by quantifying the difference between DNNs' activation patterns.…”

Section: Generation Of Adversarial Examples For Dnnsmentioning

confidence: 99%

Concolic testing for deep neural networks

Sun

Ruan

et al. 2018

Proceedings of the 33rd ACM/IEEE International Conference on Automated Software Engineering

313

249

View full text Add to dashboard Cite

Concolic testing combines program execution and symbolic analysis to explore the execution paths of a software program. This paper presents the first concolic testing approach for Deep Neural Networks (DNNs). More specifically, we formalise coverage criteria for DNNs that have been studied in the literature, and then develop a coherent method for performing concolic testing to increase test coverage. Our experimental results show the effectiveness of the concolic testing approach in both achieving high coverage and finding adversarial examples.

show abstract

“…To be more specific, the instinctive features of NN-based software (e.g., NN model's architectural details and the working mechanism of NN) should be carefully considered when setting the testing criteria. That is testing criteria should be defined comprehensively and explicitly under the consideration of not only test case coverage but also the robustness of NN-based system performance (for instance, test how NN will respond when input data change slightly) and the features of training data set, such as the data density issue mentioned in [146].…”

Section: Limitations and Suggestions For Testing And Verifying Of Nn-mentioning

confidence: 99%

Testing and verification of neural-network-based safety-critical control software: A systematic literature review

Zhang

2020

Information and Software Technology

View full text Add to dashboard Cite

Context: Neural Network (NN) algorithms have been successfully adopted in a number of Safety-Critical Cyber-Physical Systems (SCCPSs). Testing and Verification (T&V) of NN-based control software in safety-critical domains are gaining interest and attention from both software engineering and safety engineering researchers and practitioners. Objective: With the increase in studies on the T&V of NN-based control software in safety-critical domains, it is important to systematically review the state-of-the-art T&V methodologies, to classify approaches and tools that are invented, and to identify challenges and gaps for future studies. Method: By searching the six most relevant digital libraries, we retrieved 950 papers on the T&V of NN-based Safety-Critical Control Software (SCCS). Then we filtered the papers based on the predefined inclusion and exclusion criteria and applied snowballing to identify new relevant papers. Results: To reach our result, we selected 83 primary papers published between 2001 and 2018, applied the thematic analysis approach for analyzing the data extracted from the selected papers, presented the classification of approaches, and identified challenges. Conclusion: The approaches were categorized into five high-order themes: assuring robustness of NNs, assuring safety properties of NN-based control software, improving the failure resilience of NNs, measuring and ensuring test completeness, and improving the interpretability of NNs. From the industry perspective, improving the interpretability of NNs is a crucial need in safetycritical applications. We also investigated nine safety integrity properties within four major safety lifecycle phases to investigate the achievement level of T&V goals in IEC 61508-3. Results show that correctness, completeness, freedom from intrinsic faults, and fault tolerance have drawn most attention from the research community. However, little effort has been invested in achieving repeatability; no reviewed study focused on precisely defined testing configuration or on defense against common cause failure.

show abstract

“Boxing Clever”: Practical Techniques for Gaining Insights into Training Data and Monitoring Distribution Shift

Cited by 12 publications

References 9 publications

A survey of safety and trustworthiness of deep neural networks: Verification, testing, adversarial attack and defence, and interpretability

A survey of safety and trustworthiness of deep neural networks: Verification, testing, adversarial attack and defence, and interpretability

Concolic testing for deep neural networks

Testing and verification of neural-network-based safety-critical control software: A systematic literature review

Contact Info

Product

Resources

About