2014 International Workshop on Data Intensive Scalable Computing Systems 2014
DOI: 10.1109/discs.2014.13
|View full text |Cite
|
Sign up to set email alerts
|

Efficient, Failure Resilient Transactions for Parallel and Distributed Computing

Abstract: Scientific simulations are moving away from using centralized persistent storage for intermediate data between workflow steps towards an all online model. This shift is motivated by the relatively slow IO bandwidth growth compared with compute speed increases. The challenges presented by this shift to Integrated Application Workflows are motivated by the loss of persistent storage semantics for node-to-node communication.One step towards addressing this semantics gap is using transactions to logically delineat… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1

Citation Types

0
1
0

Year Published

2016
2016
2020
2020

Publication Types

Select...
3
1

Relationship

0
4

Authors

Journals

citations
Cited by 4 publications
(1 citation statement)
references
References 13 publications
(12 reference statements)
0
1
0
Order By: Relevance
“…The study in Reference [26] exploits the reduction style processing pattern in analytics applications and reduces the complications of keeping checkpoints of the simulation and the analytics consistent. Research efforts in Reference [27] use a synchronous two-phase commit transactions protocol to tolerate failures in high performance and distributed computing systems. In comparison to these efforts, our data resilience approach specifically targets data staging based in-situ workflows and is more flexible, asynchronous, and scalable.…”
Section: Related Workmentioning
confidence: 99%
“…The study in Reference [26] exploits the reduction style processing pattern in analytics applications and reduces the complications of keeping checkpoints of the simulation and the analytics consistent. Research efforts in Reference [27] use a synchronous two-phase commit transactions protocol to tolerate failures in high performance and distributed computing systems. In comparison to these efforts, our data resilience approach specifically targets data staging based in-situ workflows and is more flexible, asynchronous, and scalable.…”
Section: Related Workmentioning
confidence: 99%