2015 IEEE 31st International Conference on Data Engineering 2015
DOI: 10.1109/icde.2015.7113298
|View full text |Cite
|
Sign up to set email alerts
|

LLAMA: Efficient graph analytics using Large Multiversioned Arrays

Abstract: Abstract-We present LLAMA, a graph storage and analysis system that supports mutability and out-of-memory execution. LLAMA performs comparably to immutable main-memory analysis systems for graphs that fit in memory and significantly outperforms existing out-of-memory analysis systems for graphs that exceed main memory. LLAMA bases its implementation on the compressed sparse row (CSR) representation, which is a read-only representation commonly used for graph analytics. We augment this representation to support… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1
1

Citation Types

0
57
0

Year Published

2015
2015
2023
2023

Publication Types

Select...
5
2
1

Relationship

0
8

Authors

Journals

citations
Cited by 108 publications
(60 citation statements)
references
References 23 publications
0
57
0
Order By: Relevance
“…General data flow systems are able to efficiently run vertex-centric programs, and additionally provide the machinery for conducting relational operations on the graph data. While recent work shows that single-machine solutions easily outperform distributed systems on the dataset sizes used in academia [29,112,119,123], we think that these single machine solutions will stay constrained to academic use cases. In industry scenarios, the value of distributed graph processing on general dataflow systems comes not from the performance of the algorithms, but from the ability to create complex pipelines mixing ETL, machine learning and graph-processing tasks, using a single system.…”
Section: Discussion Of Results and Future Research Directionsmentioning
confidence: 99%
See 1 more Smart Citation
“…General data flow systems are able to efficiently run vertex-centric programs, and additionally provide the machinery for conducting relational operations on the graph data. While recent work shows that single-machine solutions easily outperform distributed systems on the dataset sizes used in academia [29,112,119,123], we think that these single machine solutions will stay constrained to academic use cases. In industry scenarios, the value of distributed graph processing on general dataflow systems comes not from the performance of the algorithms, but from the ability to create complex pipelines mixing ETL, machine learning and graph-processing tasks, using a single system.…”
Section: Discussion Of Results and Future Research Directionsmentioning
confidence: 99%
“…Recently, it has been shown that the large networks commonly used in academia can be efficiently processed on single machines [29,112,119,123].…”
Section: Related Workmentioning
confidence: 99%
“…As a result, it introduces unnecessary random I/Os. To deal with this drawback, one recent work has proposed to remove buffer managers [14]. Besides, there are also alternative approaches which utilize index structures such as log structured merge tree [18] or fractal tree [4] to handle update-intensive workload.…”
Section: Buffer Manager On Databasementioning
confidence: 99%
“…In the figure, the interval [14,14] actually represents an individual data page at the position 14 on disk.…”
Section: Algorithm 1 Trivial Algorithmmentioning
confidence: 99%
“…However, since developing distributed graph algorithm is challenging, some researchers divert their attention to design the graph processing system that handle large scale graphs on a single PC. The research endeavours in this direction have delivered the systems such as GraphChi [17], PathGraph [45], GraphQ [39], LLAMA [27] and GridGraph [51]. However, these systems suffer from the limited degree of parallelism in conventional processors.…”
Section: Introductionmentioning
confidence: 99%