2015 IEEE High Performance Extreme Computing Conference (HPEC) 2015
DOI: 10.1109/hpec.2015.7396313
|View full text |Cite
|
Sign up to set email alerts
|

A cloud-based approach to big graphs

Abstract: Data sizes in today's Big Data age presents a profound scalability challenge to modeling networks as graphs. Historically, memory-based solutions were utilized to cope with high latency incurred by irregular data access common in many natural networks. But current data rates impose both economic and environmental challenges to continually expand the total aggregate system memory to "fit" the graph. Graph scalability has wide-reaching impact since network analysis has expanded beyond its traditional fields into… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

0
10
0

Year Published

2016
2016
2017
2017

Publication Types

Select...
5
1

Relationship

1
5

Authors

Journals

citations
Cited by 9 publications
(10 citation statements)
references
References 20 publications
0
10
0
Order By: Relevance
“…Most users deploy Accumulo on a sizable cluster. Graphs as large as 70 trillion edges (1.1 PB) have been analyzed by clusters as large as 1200 machines and 57.6 TB collective memory [25]. This work evaluates single-node performance as a proof of concept and an indicator for how multi-node performance might scale.…”
Section: Performancementioning
confidence: 99%
“…Most users deploy Accumulo on a sizable cluster. Graphs as large as 70 trillion edges (1.1 PB) have been analyzed by clusters as large as 1200 machines and 57.6 TB collective memory [25]. This work evaluates single-node performance as a proof of concept and an indicator for how multi-node performance might scale.…”
Section: Performancementioning
confidence: 99%
“…The Graph500 generator is scalable, can be run in parallel without requiring communication between processors, and has been used to generate some of the largest graphs in the world [Burkhardt & Waring 2015, Kepner et al 2014. Each edge in the graph is defined by a pair of numbers representing the start and end vertices of the edge.…”
Section: A Kernel 0: Generate Graphmentioning
confidence: 99%
“…We refer to such analytics as cued analytics, that is, analytics on selected table subsets. Such analytics are ideally suited for databases by quickly accessing subsets of interest (for example, particular communities in a large social media graph) in contrast to whole-table analytics that may be better suited for parallel processing frameworks such as Hadoop Map-Reduce [19].…”
Section: Experiment: Subgraph Extractionmentioning
confidence: 99%