2016 IEEE High Performance Extreme Computing Conference (HPEC) 2016
DOI: 10.1109/hpec.2016.7761577
|View full text |Cite
|
Sign up to set email alerts
|

From NoSQL Accumulo to NewSQL Graphulo: Design and utility of graph algorithms inside a BigTable database

Abstract: Abstract-Google BigTable's scale-out design for distributed key-value storage inspired a generation of NoSQL databases. Recently the NewSQL paradigm emerged in response to analytic workloads that demand distributed computation local to data storage. Many such analytics take the form of graph algorithms, a trend that motivated the GraphBLAS initiative to standardize a set of matrix math kernels for building graph algorithms. In this article we show how it is possible to implement the GraphBLAS kernels in a BigT… Show more

Help me understand this report
View preprint versions

Search citation statements

Order By: Relevance

Paper Sections

Select...
3
1
1

Citation Types

0
11
0

Year Published

2017
2017
2022
2022

Publication Types

Select...
5
2

Relationship

3
4

Authors

Journals

citations
Cited by 12 publications
(11 citation statements)
references
References 33 publications
0
11
0
Order By: Relevance
“…To use Graphulo through D4M, the first step is to bind to a database, requesting a Graphulo object: Graphulo have been shown to scale well to multi-node Accumulo instances [10] and outperform the client-side alternative in many cases. Our extensive performance results indicate that D4M-Graphulo can be used in cases where data size makes operations impossible to complete client-side due to memory constraints [11] [12]. We have performed significant experiments with the D4M-Graphulo tool and have compared it to numerous parallel processing paradigms.…”
Section: Graphulomentioning
confidence: 99%
“…To use Graphulo through D4M, the first step is to bind to a database, requesting a Graphulo object: Graphulo have been shown to scale well to multi-node Accumulo instances [10] and outperform the client-side alternative in many cases. Our extensive performance results indicate that D4M-Graphulo can be used in cases where data size makes operations impossible to complete client-side due to memory constraints [11] [12]. We have performed significant experiments with the D4M-Graphulo tool and have compared it to numerous parallel processing paradigms.…”
Section: Graphulomentioning
confidence: 99%
“…During scans, the user can execute arbitrary code in the form of iterators that run server-side as data streams from each partition in parallel. Iterator code can even initiate scans on or write entries to additional tables, a fact we previously exploited in the Graphulo matrix math library [20,21].…”
Section: Accumulo Implementation Of Laradbmentioning
confidence: 99%
“…Solutions for big data problems generally involve distributed computation and need to take the full advantage of data locality. Therefore, instead of using an external system, performing computations inside a database system is a preferable solution [7]. One approach to perform big data computations inside a database system is using NewSQL databases [8].…”
Section: Introductionmentioning
confidence: 99%
“…These type of databases seek solutions to provide scalability of NoSQL systems while retaining the SQL guarantees (ACID properties) of relational databases. However, even though using a NewSQL database can be a good alternative, some researchers take a different approach and seek solutions based on performing big data computations inside NoSQL databases [7]. To that extent, the Graphulo library [9] realizing the kernel operations of Graph Basic Linear Algebra Subprogram (GraphBLAS) [10] in Accumulo NoSQL database is recently developed.…”
Section: Introductionmentioning
confidence: 99%