2017 47th Annual IEEE/IFIP International Conference on Dependable Systems and Networks Workshops (DSN-W) 2017
DOI: 10.1109/dsn-w.2017.47
|View full text |Cite
|
Sign up to set email alerts
|

Evaluation and Mitigation of Soft-Errors in Neural Network-Based Object Detection in Three GPU Architectures

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
4
1

Citation Types

0
12
0

Year Published

2020
2020
2024
2024

Publication Types

Select...
6
2

Relationship

0
8

Authors

Journals

citations
Cited by 42 publications
(29 citation statements)
references
References 33 publications
0
12
0
Order By: Relevance
“…In the literature, several works have been proposed in the last years for addressing the reliability assessment of CNNs. Concerning physical-based FI, we can cite [2], where the reliability dependence on three different Graphics Processing Unit (GPU) architectures (Kepler, Maxwell, and Pascal) was evaluated executing the Darknet Neural Network [8] when exposed to atmospheric-like neutrons. In [9], the authors analyse the reliability of a 54-layer Deep Neural Network injecting faults in the network weights and input data using an accelerated neutron beam for studying transient errors and FI tests to simulate permanent faults.…”
Section: Related Workmentioning
confidence: 99%
See 1 more Smart Citation
“…In the literature, several works have been proposed in the last years for addressing the reliability assessment of CNNs. Concerning physical-based FI, we can cite [2], where the reliability dependence on three different Graphics Processing Unit (GPU) architectures (Kepler, Maxwell, and Pascal) was evaluated executing the Darknet Neural Network [8] when exposed to atmospheric-like neutrons. In [9], the authors analyse the reliability of a 54-layer Deep Neural Network injecting faults in the network weights and input data using an accelerated neutron beam for studying transient errors and FI tests to simulate permanent faults.…”
Section: Related Workmentioning
confidence: 99%
“…Nowadays, there is an incentive for pushing CNNs from the cloud to the edge devices and, in particular, for real-time safety-critical systems, e.g., in autonomous driving. Several recent studies have demonstrated that hardware faults induced by external perturbations (i.e., in a harsh environment) can significantly impact the inference leading to CNN prediction failures [2], [3]. Therefore, ensuring the reliability of CNNs is crucial, especially when deployed in safety-and mission-critical applications, such as robotics, aeronautics, smart healthcare, and autonomous driving.…”
Section: Introductionmentioning
confidence: 99%
“…Although several studies [17][8] [18][19] [18] have evaluated and analyzed the reliability of DNN models, and many techniques have been proposed to mitigation the soft errors in GPUs based on software solutions. For instance, Triple Modular Redundancy (TMR), Double Modular Redundancy (DMR), and Algorithm-Based Fault Tolerance (ABFT).…”
Section: Introductionmentioning
confidence: 99%
“…However, TMR introduces large overhead in hardware design, especially for massively parallel processing array. Moreover, ECC can only be used in a memory unit and not in a logic unit owing to its inability to maintain the error correction message transfer between the layers [4]. For its part, fault-aware training has been tested and found to be effective for many DNN inference accelerator fault-patterns, while also having little impact on hardware design.…”
Section: Introductionmentioning
confidence: 99%