2015 IEEE 11th International Conference on E-Science 2015
DOI: 10.1109/escience.2015.13
|View full text |Cite
|
Sign up to set email alerts
|

MemEFS: An Elastic In-memory Runtime File System for eScience Applications

Abstract: Abstract-Data-intensive scientific workflows exhibit inter-task dependencies that generate file-based communication schemes. In such scenarios, traditional disk-based storage systems often limit overall application performance and scalability. To overcome the storage bottleneck, in-memory runtime distributed file systems speed up application I/O. Such systems are deployed statically onto a fixed number of compute nodes and act as a distributed, fast I/O cache for the runtime generated data. Such static deploym… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
2
1

Citation Types

0
6
0

Year Published

2016
2016
2018
2018

Publication Types

Select...
5
1

Relationship

2
4

Authors

Journals

citations
Cited by 6 publications
(6 citation statements)
references
References 27 publications
0
6
0
Order By: Relevance
“…For public infrastructure, elasticity could reduce operational costs by reducing resource waste [4], [5], and improve the ability to meet Quality-of-Service guarantees (such as high performance and availability) by using appropriate autoscaling policies [6]. For private infrastructure, elasticity could reduce operational costs by increasing resource utilization [7], or throttle throughput to meet demands across applications [8].…”
Section: Problem Statement and Conceptual Contributionmentioning
confidence: 99%
“…For public infrastructure, elasticity could reduce operational costs by reducing resource waste [4], [5], and improve the ability to meet Quality-of-Service guarantees (such as high performance and availability) by using appropriate autoscaling policies [6]. For private infrastructure, elasticity could reduce operational costs by increasing resource utilization [7], or throttle throughput to meet demands across applications [8].…”
Section: Problem Statement and Conceptual Contributionmentioning
confidence: 99%
“…For achieving this, variants of elastic MapReduce [50], [51] have been developed. Furthermore, more domain-specific elasticity schemes have been proposed: machine learning on Spark [52], elastic stream processing [53] or even scientific workflows [22]. Similarly to such systems, JoyGraph supports user-defined policies for dynamically scaling the number of compute nodes during runtime.…”
Section: Related Workmentioning
confidence: 99%
“…For public infrastructure, elasticity could reduce operational costs by reducing resource waste [19], [20], and improve the ability to meet Quality-of-Service guarantees (such as high performance and availability) by using appropriate auto-scaling policies [21]. For private infrastructure, elasticity could reduce operational costs by increasing resource utilization [22], or throttle throughput to meet demands across applications [23].…”
Section: Introductionmentioning
confidence: 99%
“…As the size of RAM continues to increase and its cost decreases over time, some recent work has referred to "RAM [as] the new disk," and disk as the new tape storage [23]. This trend in RAM capacity and cost has motivated filesystems based entirely in memory, including MemEFS [28] and RAMCloud [14]. Another study [10] concludes that data accessed every 5 hours should be placed in RAM, suggesting that many intermediate and temporary files created by large data-intensive computations could be stored in memory for their entire lifetime, avoiding secondary storage completely.…”
Section: Related Workmentioning
confidence: 99%