2011 Design, Automation &Amp; Test in Europe 2011
DOI: 10.1109/date.2011.5763096
|View full text |Cite
|
Sign up to set email alerts
|

Architectures for online error detection and recovery in multicore processors

Abstract: The huge investment in the design and production of multicore processors may be put at risk because the emerging highly miniaturized but unreliable fabrication technologies will impose significant barriers to the life-long reliable operation of future chips. Extremely complex, massively parallel, multi-core processor chips fabricated in these technologies will become more vulnerable to: (a) environmental disturbances that produce transient (or soft) errors, (b) latent manufacturing defects as well as aging/wea… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1
1

Citation Types

0
62
0

Year Published

2016
2016
2023
2023

Publication Types

Select...
5
2
1

Relationship

0
8

Authors

Journals

citations
Cited by 106 publications
(64 citation statements)
references
References 46 publications
0
62
0
Order By: Relevance
“…This side information enables significant savings in redundancy [13]. In fact, this method of testing the performance of a device followed by some configuration is rather popular in practice, used in multi-core CPUs [14], analog-to-digital converters [15], sense-amplifiers [16], selfreplicating automatons [17], parallel computing [18], [19], etc. This paper can be seen as an attempt to provide theoretical foundations for the static defect scenario.…”
Section: B Relation To Prior Workmentioning
confidence: 99%
See 1 more Smart Citation
“…This side information enables significant savings in redundancy [13]. In fact, this method of testing the performance of a device followed by some configuration is rather popular in practice, used in multi-core CPUs [14], analog-to-digital converters [15], sense-amplifiers [16], selfreplicating automatons [17], parallel computing [18], [19], etc. This paper can be seen as an attempt to provide theoretical foundations for the static defect scenario.…”
Section: B Relation To Prior Workmentioning
confidence: 99%
“…In particular, for t = 2, we can first choose ε = 3/2. No matter how we choose the values of π 2 , π 3 and π 4 , to satisfy (14) we must have ρ ≥ 1/2.…”
Section: Covering Conversementioning
confidence: 99%
“…These many-core architectures have opened new horizons in microprocessor industry and are going to be a dominant trend in general purpose microprocessors [1][2][3][4][5], and also in specialized processing units such as Graphics Processing Units (GPUs) and Network Processing Units [6,7]. However, extensive miniaturization and large scale integration in new fabrication technologies have led to unpleasant side effects such as production yield drop, infant mortality, and accelerated aging [8][9][10] in all nano-scale semiconductor devices including many-core processors. In fact, because of non-aggressive burn-in testing, more pronounced aging effects, and incomplete testing and verification processes due to increased time-to-market pressure in new fabrication technologies, systems fabricated in these technologies may experience faults (including early defects or latent faults) and fail any time in the field.…”
Section: Introductionmentioning
confidence: 99%
“…Other types of design defects that are not complicated can be fixed by the extensive testing that happens on the manufacturing side (Gizopoulos et al, 2011). With the above design bugs, there can be four classifications:…”
Section: Design Bug Modelmentioning
confidence: 99%
“…Fault isolation using hardware and software codesign is done by Khan and Kundu (2010). Isolating defects at various levels is explained in detail by Gizopoulos et al (2011). In design bugs related fault isolation, the exact location of faults and the context of the faulty event should be narrowed down.…”
Section: Fault Isolationmentioning
confidence: 99%