Proceedings of the 10th IEEE International Conference on Collaborative Computing: Networking, Applications and Worksharing 2014
DOI: 10.4108/icst.collaboratecom.2014.257304
|View full text |Cite
|
Sign up to set email alerts
|

HConfig: Resource Adaptive Fast Bulk Loading in HBase

Abstract: Abstract-NoSQL (Not only SQL) data stores become a vital component in many big data computing platforms due to its inherent horizontal scalability. HBase is an open-source distributed NoSQL store that is widely used by many Internet enterprises to handle their big data computing applications (e.g. Facebook handles millions of messages each day with HBase). Optimizations that can enhance the performance of HBase are of paramount interests for big data applications that use HBase or Big Table like key-value stor… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1

Citation Types

0
4
0

Year Published

2015
2015
2022
2022

Publication Types

Select...
4
1

Relationship

1
4

Authors

Journals

citations
Cited by 7 publications
(4 citation statements)
references
References 20 publications
0
4
0
Order By: Relevance
“…The schema used follows the aforementioned design; i.e., each row in this table has a rowkey that is the concatenation of a cluster ID and the lowerleft coordinates of a cell in the uniform space, and contains a single column with all data points (in the original space) mapped to that cell. Moreover, instead of inserting the data points into the table one by one, the standard HBase bulk loading [5] technique is used.…”
Section: A the Stos Methodologymentioning
confidence: 99%
See 2 more Smart Citations
“…The schema used follows the aforementioned design; i.e., each row in this table has a rowkey that is the concatenation of a cluster ID and the lowerleft coordinates of a cell in the uniform space, and contains a single column with all data points (in the original space) mapped to that cell. Moreover, instead of inserting the data points into the table one by one, the standard HBase bulk loading [5] technique is used.…”
Section: A the Stos Methodologymentioning
confidence: 99%
“…We also use the standard HBase bulk loading [5] technique to load the STOS-organised data into an HBase table. If the distribution of the row-keys of the HBase table is uniform, then the [5]-based loads are very efficient.…”
Section: B Performance Assessment: Stos Vs Index Overheadsmentioning
confidence: 99%
See 1 more Smart Citation
“…Due to the space constraint, we only include the results of read-intensive workloads. Also some detail on critical parameters and their impact on bulk loading performance of HBase are reported in [6]. Fig.4 shows the read throughput by varying the heap size from the 1GB default to 2GB, 4GB, 6GB.…”
Section: A Tuning Configuration Parametersmentioning
confidence: 99%