2014 IEEE International Conference on Big Data (Big Data) 2014
DOI: 10.1109/bigdata.2014.7004234
|View full text |Cite
|
Sign up to set email alerts
|

VENU: Orchestrating SSDs in hadoop storage

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
4
1

Citation Types

0
8
0

Year Published

2015
2015
2022
2022

Publication Types

Select...
3
2
2
2

Relationship

0
9

Authors

Journals

citations
Cited by 21 publications
(8 citation statements)
references
References 10 publications
0
8
0
Order By: Relevance
“…The Apache Hadoop community [36] has also proposed such kind of centralized cache management scheme in HDFS, which is an explicit caching mechanism that allows users to specify paths to be cached by HDFS. Some recent studies [20,25,26,35] also pay attention to incorporate heterogeneous storage media (e.g. SSD and parallel file system) in HDFS.…”
Section: Related Workmentioning
confidence: 99%
See 1 more Smart Citation
“…The Apache Hadoop community [36] has also proposed such kind of centralized cache management scheme in HDFS, which is an explicit caching mechanism that allows users to specify paths to be cached by HDFS. Some recent studies [20,25,26,35] also pay attention to incorporate heterogeneous storage media (e.g. SSD and parallel file system) in HDFS.…”
Section: Related Workmentioning
confidence: 99%
“…SSD and parallel file system) in HDFS. Authors in [25] deal with data distribution in the presence of nodes that do not have uniform storage characteristics; whereas, [26] caches data in SSD. Researchers in [35] present HDFS-specific optimizations for PVFS and [32] propose to store cold data of Hadoop cluster to network-attached file systems.…”
Section: Related Workmentioning
confidence: 99%
“…In the Hadoop ecosystem, several studies investigated the performance improvement of a variety of MapReduce workloads when introducing SSD into the ecosystem as a tier [59], [60], [61] [12] or as a cache [8]. To examine the difference in SSD utilization usage approaches impact on MapReduce workloads, [62] and [61] compared the performance improvement when applying SSD as a tier and as a cache.…”
Section: Storage Capability and Workload Characteristicmentioning
confidence: 99%
“…[18] explore the benefits and limitations of in-storage processing on SSDs (the execution of applications on processors in the storage controller). Other researchers focus on incorporating SSDs into HDFS using caching mechanisms to achieve better performance [20,28,32,35]. A few works also discuss SSDs' impact on Hadoop [17,25], sometimes focusing on using SSDs as the sole storage device under HDFS.…”
Section: Related Workmentioning
confidence: 99%