2018 IEEE International Conference on Big Data (Big Data) 2018
DOI: 10.1109/bigdata.2018.8622616
|View full text |Cite
|
Sign up to set email alerts
|

ARCHIE: Data Analysis Acceleration with Array Caching in Hierarchical Storage

Abstract: Scientific data analysis typically involves reading massive amounts of data generated by simulations, experiments, and observations. A significant bottleneck in this process is reading such data because the data files are stored on the rotating disks. Recent supercomputing systems are adding non-volatile storage layers to fill the performance gap between fast main memory and the slow disk-based storage. Software libraries for managing this hierarchy not only need to read data efficiently, but also reduce user-… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1

Citation Types

0
3
0

Year Published

2019
2019
2020
2020

Publication Types

Select...
3
2
1

Relationship

1
5

Authors

Journals

citations
Cited by 10 publications
(3 citation statements)
references
References 27 publications
0
3
0
Order By: Relevance
“…In read caching, we have implemented caching and prefetching chunks of data based on the history of HDF5 chunk accesses [5] . The cached chunks are stored as binary files in a faster persistent storage device, such as an SSD-based burst buffer.…”
Section: Read Caching In Data Elevatormentioning
confidence: 99%
“…In read caching, we have implemented caching and prefetching chunks of data based on the history of HDF5 chunk accesses [5] . The cached chunks are stored as binary files in a faster persistent storage device, such as an SSD-based burst buffer.…”
Section: Read Caching In Data Elevatormentioning
confidence: 99%
“…Recent works such as Hermes [19], UniviStor [20] and ARCHIE [18] provide buffering solution on all stages of storage layers. Hermes offers three data placement policies to manage caching and prefetching of flat data files.…”
Section: Related Workmentioning
confidence: 99%
“…On read, they need to populate the Burst Buffer through manual stage-in operations. Since prefetching data to an intermediate storage is also necessary for many postprocessing applications, systems that support one-way read operation, such as ARCHIE, have also been proposed [18]. Hermes and UniviStor are new systems that offer multi-tiered buffering solutions [19], [20].…”
Section: Introductionmentioning
confidence: 99%