2021
DOI: 10.1145/3469379.3469392
|View full text |Cite
|
Sign up to set email alerts
|

Predicting file lifetimes for data placement in multi-tiered storage systems for HPC

Abstract: The emergence of Exascale machines in HPC will have the foreseen consequence of putting more pressure on the storage systems in place, not only in terms of capacity but also bandwidth and latency. With limited budget we cannot imagine using only storage class memory, which leads to the use of a heterogeneous tiered storage hierarchy. In order to make the most efficient use of the high performance tier in this storage hierarchy, we need to be able to place user data on the right tier and at the right time. In t… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1

Citation Types

0
4
0

Year Published

2022
2022
2022
2022

Publication Types

Select...
2
2

Relationship

0
4

Authors

Journals

citations
Cited by 4 publications
(4 citation statements)
references
References 18 publications
0
4
0
Order By: Relevance
“…Several state-of-the-art multi-level caching techniques were proposed in the literature [24]- [27] which mainly concentrates on caching the data based on prediction by analyzing the characteristics of I/O accesses. The authors of [28] predicted the life time of files by analyzing the access frequency of the files. The authors in [29] proposed WorkflowRL method which manages the data in the multilevel storage systems based on reinforcement learning.…”
Section: B Multi-level Caching Methodsmentioning
confidence: 99%
“…Several state-of-the-art multi-level caching techniques were proposed in the literature [24]- [27] which mainly concentrates on caching the data based on prediction by analyzing the characteristics of I/O accesses. The authors of [28] predicted the life time of files by analyzing the access frequency of the files. The authors in [29] proposed WorkflowRL method which manages the data in the multilevel storage systems based on reinforcement learning.…”
Section: B Multi-level Caching Methodsmentioning
confidence: 99%
“…Knowledge regarding file lifetimes can enable the development and implementation of cache eviction policies superior to the Least Recently Used (LRU) policy. [12,15].…”
Section: File Lifetimesmentioning
confidence: 99%
“…This work could also be expanded upon by developing machine learning models that can predict when a certain file is likely to be evicted from the cache. Such models have been shown to be effective in creating more precise and detailed file lifetime data [15]. A more effective model could inform design choices for better distributed storage caches.…”
Section: Summary and Next Stepsmentioning
confidence: 99%
“…Thus, it is beneficial to know how long files tend to remain open. Knowledge regarding file lifetimes can enable the development and implementation of cache eviction policies superior to the Least Recently Used (LRU) policy [12,15]…”
mentioning
confidence: 99%