2009 IEEE International Conference on Cluster Computing and Workshops 2009
DOI: 10.1109/clustr.2009.5289196
|View full text |Cite
|
Sign up to set email alerts
|

Implementing WebGIS on Hadoop: A case study of improving small file I/O performance on HDFS

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

0
73
0
1

Year Published

2014
2014
2023
2023

Publication Types

Select...
4
3
1

Relationship

0
8

Authors

Journals

citations
Cited by 148 publications
(74 citation statements)
references
References 8 publications
0
73
0
1
Order By: Relevance
“…Reference [9] and [10] were put forward their own solutions in view of the different application, in fact the thought of the two papers is similar: the basic idea is to combine small files into large ones to reduce the file number and build index for each file. in the original HDFS basis to add a small document processing module, when a file arrived, judge whether the file belongs to a small file, if it is, it gave it to the document processing module processing, otherwise store in HDSF directly.…”
Section: Other Solutionsmentioning
confidence: 99%
See 2 more Smart Citations
“…Reference [9] and [10] were put forward their own solutions in view of the different application, in fact the thought of the two papers is similar: the basic idea is to combine small files into large ones to reduce the file number and build index for each file. in the original HDFS basis to add a small document processing module, when a file arrived, judge whether the file belongs to a small file, if it is, it gave it to the document processing module processing, otherwise store in HDSF directly.…”
Section: Other Solutionsmentioning
confidence: 99%
“…Papers [9] and [10] which mentioned in Section four are proposed for specific applications solutions, they are not common technical solutions.…”
Section: Conclusion and Proposed Solutionmentioning
confidence: 99%
See 1 more Smart Citation
“…Preprocessing is also required for each addition of new input image set. Small files cannot be directly accessed in SequenceFile, whole SequenceFile has to be processed to obtain an image data as one element [14].…”
Section: L a S Smentioning
confidence: 99%
“…Even the improved LATE speculative execution could end up increasing the degree of congestion within a busy cluster, if speculative copies are simply assigned to machines that are already close to maximum resource utilization. Resource Aware Programming in Hadoop has prettify one of the Explore Challenges [14] [15] in Cloud Computing. Programming in Hadoop is centralized, and initiated.…”
Section: Resource Aware Schedulingmentioning
confidence: 99%