Proceedings of the Seventh ACM Symposium on Cloud Computing 2016
DOI: 10.1145/2987550.2987553
|View full text |Cite
|
Sign up to set email alerts
|

Adaptive Caching in Big SQL using the HDFS Cache

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

0
23
0

Year Published

2017
2017
2020
2020

Publication Types

Select...
4
3

Relationship

0
7

Authors

Journals

citations
Cited by 34 publications
(27 citation statements)
references
References 15 publications
0
23
0
Order By: Relevance
“…However, PACMan does not allow applications to specify hot data in memory for subsequent efficient accesses and does not implement cache admission policies. Big SQL [16] is an SQL-on-hadoop system that utilizes HDFS cache for caching table partitions. Big SQL presents two algorithms, namely SLRU-K and EXD, that explore the tradeoff of caching objects based on recency and frequency of data accesses.…”
Section: Distributed File Systems and Tieringmentioning
confidence: 99%
See 3 more Smart Citations
“…However, PACMan does not allow applications to specify hot data in memory for subsequent efficient accesses and does not implement cache admission policies. Big SQL [16] is an SQL-on-hadoop system that utilizes HDFS cache for caching table partitions. Big SQL presents two algorithms, namely SLRU-K and EXD, that explore the tradeoff of caching objects based on recency and frequency of data accesses.…”
Section: Distributed File Systems and Tieringmentioning
confidence: 99%
“…All these decisions will be handled through pluggable downgrade and upgrade policies, elaborated in Sections 4-6. The decisions are made at the granularity of files (rather than blocks) since previous work [5,16] has shown that performance improvement is attained only when entire files are present in a higher tier (called the "all-or-nothing" property in [5]).…”
Section: Adaptive Tiered Storage Managementmentioning
confidence: 99%
See 2 more Smart Citations
“…they perform within a constant factor k of the optimal offline algorithm. More recently, cost-based algorithms have been used for adaptive caching in Hadoop-based analytics systems [19]. Unlike algorithms for data items with identical costs, there is no provably optimal, polynomial time cache eviction algorithm [12].…”
Section: Related Workmentioning
confidence: 99%