2006 25th IEEE Symposium on Reliable Distributed Systems (SRDS'06) 2006
DOI: 10.1109/srds.2006.38
|View full text |Cite
|
Sign up to set email alerts
|

Recovering from Distributable Thread Failures with Assured Timeliness in Real-Time Distributed Systems

Abstract: This thesis considers the problem of recovering from failures of distributable threads with assured timeliness. When a node hosting a portion of a distributable thread fails, it causes orphans-i.e., thread segments that are disconnected from the thread's root. A termination model is considered for recovering from such failures. In this model the orphans must be detected and cleaned up, and failure-exception notification must be delivered to the farthest, contiguous surviving thread segment for resuming thread … Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

0
14
0

Year Published

2007
2007
2013
2013

Publication Types

Select...
5
3

Relationship

3
5

Authors

Journals

citations
Cited by 18 publications
(14 citation statements)
references
References 13 publications
0
14
0
Order By: Relevance
“…Binoy Ravindran, Edward Curley, Jonathan Anderson, and E. Douglas Jensen who considered the problem of recovering from failures of distributable threads in distributed real-time systems. They presented a scheduling algorithm called HUA and two thread integrity protocols called D-TPR and W-TPR [5].…”
Section: Related Workmentioning
confidence: 99%
“…Binoy Ravindran, Edward Curley, Jonathan Anderson, and E. Douglas Jensen who considered the problem of recovering from failures of distributable threads in distributed real-time systems. They presented a scheduling algorithm called HUA and two thread integrity protocols called D-TPR and W-TPR [5].…”
Section: Related Workmentioning
confidence: 99%
“…This abort model differs from previous authors in two points [39], [42], [75]. First, we do not assume immediate invocation of the abort handler upon resumption of execution.…”
Section: Abort Modelmentioning
confidence: 99%
“…In this paper, we consider the problem of scheduling threads in the presence of the previously mentioned uncertainties, focusing particularly on (arbitrary) node failures. Past efforts on thread scheduling (e.g., [2,12,13]) consider a paradigm broadly called independent node scheduling, where threads are scheduled at nodes using propagated thread scheduling parameters and without any interaction with other nodes (thereby not considering node failures during scheduling). Fault-management is separately addressed by thread integrity protocols [14] that run concurrent to thread execution.…”
Section: Contributions: Assured Thread Timeliness In the Presence Of mentioning
confidence: 99%
“…Fault-management is separately addressed by thread integrity protocols [14] that run concurrent to thread execution. Thread integrity protocols detect failures of the thread abstraction, delivering failure-exception notifications [2,13]. This approach avoids the overhead of inter-node communication, and is therefore message-efficient and tractable (solely from the thread scheduling standpoint).…”
Section: Contributions: Assured Thread Timeliness In the Presence Of mentioning
confidence: 99%