2013 43rd Annual IEEE/IFIP International Conference on Dependable Systems and Networks (DSN) 2013
DOI: 10.1109/dsn.2013.6575308
|View full text |Cite
|
Sign up to set email alerts
|

Generative software-based memory error detection and correction for operating system data structures

Abstract: Abstract-Recent studies indicate that the number of system failures caused by main memory errors is much higher than expected. In contrast to the commonly used hardware-based countermeasures, for example using ECC memory, softwarebased fault-tolerance measures are much more flexible and can exploit application knowledge, such as the criticality of specific data structures. This paper presents a software-based memory error protection approach, which we used to harden the eCos operating system in a case study. T… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1
1

Citation Types

0
19
0

Year Published

2013
2013
2021
2021

Publication Types

Select...
4
2
2

Relationship

1
7

Authors

Journals

citations
Cited by 32 publications
(20 citation statements)
references
References 33 publications
0
19
0
Order By: Relevance
“…Yet, an interesting aspect is the possibility to configure kernel internals, which have different resilience behaviors regarding to soft errors. Here, the essential system component is the scheduler [12]. eCos offers different real-time schedulers to choose from: a multi-level queue (MLQ) scheduler and a bitmap scheduler.…”
Section: A Dynamic Operating System: Ecos (Posix)mentioning
confidence: 99%
See 2 more Smart Citations
“…Yet, an interesting aspect is the possibility to configure kernel internals, which have different resilience behaviors regarding to soft errors. Here, the essential system component is the scheduler [12]. eCos offers different real-time schedulers to choose from: a multi-level queue (MLQ) scheduler and a bitmap scheduler.…”
Section: A Dynamic Operating System: Ecos (Posix)mentioning
confidence: 99%
“…The robustness of the eCos kernel against memory faults is improved by Generic Object Protection [12]. In-memory kernel objects are enriched by a CRC32 error-detecting code, which is allocated together with each instance of a kernel data structure, such as the scheduler and thread objects.…”
Section: A Hardening Ecosmentioning
confidence: 99%
See 1 more Smart Citation
“…These techniques, though they improve system reliability, still require costly ECC hardware for detecting and identifying memory pages with errors. Other works have attempted to reduce the impact of memory errors on system reliability by writing more reliable software [39], modifying the OS memory allocator [40], or using a compiler to generate a more error-tolerant version of the program [41,42]. Other algorithmic solutions (e.g., memory bounds checks [43], watchdog timers [43], and checkpoint recovery [44][45][46]) have also been applied to achieve resilience to memory errors.…”
Section: B Related Workmentioning
confidence: 99%
“…As an initial technique, we instrumented jino to enrich references with parity information. This technique has some advantages over reference replication [6], as it does not inflate the reference size and so the attack surface of the program, which is the reason why we have selected this approach.…”
Section: Protection Techniquesmentioning
confidence: 99%