2014 IEEE High Performance Extreme Computing Conference (HPEC) 2014
DOI: 10.1109/hpec.2014.7040945
|View full text |Cite
|
Sign up to set email alerts
|

Achieving 100,000,000 database inserts per second using Accumulo and D4M

Abstract: Abstract-The Apache Accumulo database is an open source relaxed consistency database that is widely used for government applications. Accumulo is designed to deliver high performance on unstructured data such as graphs of network data. This paper tests the performance of Accumulo using data from the Graph500 benchmark. The Dynamic Distributed Dimensional Data Model (D4M) software is used to implement the benchmark on a 216-node cluster running the MIT SuperCloud software stack. A peak performance of over 100,0… Show more

Help me understand this report
View preprint versions

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1
1

Citation Types

3
36
0

Year Published

2015
2015
2021
2021

Publication Types

Select...
4
3
2

Relationship

3
6

Authors

Journals

citations
Cited by 50 publications
(39 citation statements)
references
References 12 publications
3
36
0
Order By: Relevance
“…Q-UEL owes a considerable debt to many efforts concerning biomedical and general biological information and the SW , and will benefit from relevant advances in high performance "big data processing" (e.g. Ref [53]), though it is notable that this large body of work rarely touches upon probability.…”
Section: Other Efforts After or Relevant To Pcastmentioning
confidence: 99%
“…Q-UEL owes a considerable debt to many efforts concerning biomedical and general biological information and the SW , and will benefit from relevant advances in high performance "big data processing" (e.g. Ref [53]), though it is notable that this large body of work rarely touches upon probability.…”
Section: Other Efforts After or Relevant To Pcastmentioning
confidence: 99%
“…For example, Kepner et al compared different database technologies such as Cassandra, Oracle and Accumulo, with the latter offering the best performance. Even though they achieve 115 million inserts per second in an Accumulo database, they require a vast amount of resources, specifically, 216 Accumulo nodes and 1,296 ingest processes, with an average performance of 100,000 entries/second per ingest process [11].…”
Section: A State Of the Artmentioning
confidence: 99%
“…of the four largest computing ecosystems: supercomputing, enterprise computing, big data, and traditional databases. The MIT SuperCloud has spurred the development of a number of cross-ecosystem innovations in high performance databases [31], [32], database management [33], data protection [34], database federation [35], [36], data analytics [37] and system monitoring [38].…”
Section: Experimental Environmentmentioning
confidence: 99%