Paulo Guedes scite author profile

Workstation clusters are becoming an interesting alternative to dedicated multiprocessors. In this environment, the probability of a failure, during an application's exeeution, increases with the execution time and the number of workstations used. If no provision is made for handling failures, it is unlikely that long running applications will terminate successfully. One solution to this problem is process checkpointing.This paper presents a checkpoint protocol for a multithreaded distributed shared memory system based on the entry consistency memory model. The protocol allows transparent recovery from single node failures and, in some cases, from multiple node failures. A simple mechanism is used to determine if the system can be brought to a consistent state in the event of multiple machine crashes.The protocol keeps a distributed log of shared data accessesin the volatile memory of the processes, taking advantage of the independent failure characteristics of workstation clusters. Periodically, or whenever the log reaches a highwater mark, each process checkpoints its state, independently from the others. The protocol needs no extra messages during the failure-fke period, since atl checkpoint control information is piggybacked on the memory coherence protocol messages.

show abstract

Efficient and flexible object sharing

Castro

Sequeira

Costa

et al.

View full text Add to dashboard Cite

PerDiS: Design, Implementation, and Use of a PERsistent DIstributed Store

Ferreira

Shapiro

Blondel

et al. 2000

View full text Add to dashboard Cite

Verifying workflow processes against organization security policies

Ribeiro

Guedes

View full text Add to dashboard Cite

scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.

Contact Info

hi@scite.ai

10624 S. Eastern Ave., Ste. A-614

Henderson, NV 89052, USA

Blog Terms and Conditions API Terms Privacy Policy Contact Cookie Preferences Do Not Sell or Share My Personal Information

Made with 💙 for researchers

Part of the Research Solutions Family.

Paulo Guedes

Lightweight logging for lazy release consistent distributed shared memory

A checkpoint protocol for an entry consistent shared memory system

Efficient and flexible object sharing

PerDiS: Design, Implementation, and Use of a PERsistent DIstributed Store

Verifying workflow processes against organization security policies

Contact Info

Product

Resources

About