LLAMA: Efficient graph analytics using Large Multiversioned Arrays

Macko, Peter; Marathe, Virendra J.; Margo, Daniel; Seltzer, Margo

doi:10.1109/icde.2015.7113298

Cited by 108 publications

(60 citation statements)

References 23 publications

Supporting

Mentioning

Contrasting

Order By: Relevance

“…General data flow systems are able to efficiently run vertex-centric programs, and additionally provide the machinery for conducting relational operations on the graph data. While recent work shows that single-machine solutions easily outperform distributed systems on the dataset sizes used in academia [29,112,119,123], we think that these single machine solutions will stay constrained to academic use cases. In industry scenarios, the value of distributed graph processing on general dataflow systems comes not from the performance of the algorithms, but from the ability to create complex pipelines mixing ETL, machine learning and graph-processing tasks, using a single system.…”

Section: Discussion Of Results and Future Research Directionsmentioning

confidence: 99%

See 1 more Smart Citation

Scaling data mining in massively parallel dataflow systems

Schelter

2014

Proceedings of the 2014 SIGMOD PhD Symposium

View full text Add to dashboard Cite

Section: Discussion Of Results and Future Research Directionsmentioning

confidence: 99%

“…Recently, it has been shown that the large networks commonly used in academia can be efficiently processed on single machines [29,112,119,123].…”

Section: Related Workmentioning

confidence: 99%

Scaling data mining in massively parallel dataflow systems

Schelter

2014

Proceedings of the 2014 SIGMOD PhD Symposium

View full text Add to dashboard Cite

“…As a result, it introduces unnecessary random I/Os. To deal with this drawback, one recent work has proposed to remove buffer managers [14]. Besides, there are also alternative approaches which utilize index structures such as log structured merge tree [18] or fractal tree [4] to handle update-intensive workload.…”

Section: Buffer Manager On Databasementioning

confidence: 99%

“…In the figure, the interval [14,14] actually represents an individual data page at the position 14 on disk.…”

Section: Algorithm 1 Trivial Algorithmmentioning

confidence: 99%

An I/O-Efficient Buffer Batch Replacement Policy for Update-Intensive Graph Databases

Zhou

Zhang

et al. 2016

Data Sci. Eng.

View full text Add to dashboard Cite

With the proliferation of graph-based applications, such as social network management and Web structure mining, update-intensive graph databases have become an important component of today's data management platforms. Several techniques have been recently proposed to exploit locality on both data organization and computational model in graph databases. However, little investigation has been conducted on buffer management of graph databases. To the best of our knowledge, current buffer managers of graph databases suffer performance loss caused by unnecessary random I/O access. To solve this problem, we develop a novel batch replacement policy for buffer management. This policy enables us to maximally exploit sequential I/O to improve the performance of graph database. However, trivial solution produces impractical maintenance for replacement plan with maximal sequential I/O. To enable the policy, we first devise a segment treebased buffer manager to efficiently maintain a optimal replacement plan. Unfortunately, segment tree-based solution becomes bottleneck in multi-core environment. To remedy this weakness, a B-tree-based buffer manager is further proposed. Extensive experiments on real-world and synthetic datasets demonstrate the superiority of our method.

show abstract

“…However, since developing distributed graph algorithm is challenging, some researchers divert their attention to design the graph processing system that handle large scale graphs on a single PC. The research endeavours in this direction have delivered the systems such as GraphChi [17], PathGraph [45], GraphQ [39], LLAMA [27] and GridGraph [51]. However, these systems suffer from the limited degree of parallelism in conventional processors.…”

Section: Introductionmentioning

confidence: 99%

WolfPath: Accelerating Iterative Traversing-Based Graph Processing Algorithms on GPU

Zhu

et al. 2017

Int J Parallel Prog

View full text Add to dashboard Cite

There is the significant interest nowadays in developing the frameworks of parallelizing the processing for the large graphs such as social networks, Web graphs, etc. Most parallel graph processing frameworks employ iterative processing model. However, by benchmarking the state-of-art GPU-based graph processing frameworks, we observed that the performance of iterative traversing-based graph algorithms (such as Bread First Search, Single Source Shortest Path and so on) on GPU is limited by the frequent data exchange between host and GPU. In order to tackle the problem, we develop a GPU-based graph framework called WolfPath to accelerate the processing of iterative traversing-based graph processing algorithms. In WolfPath, the iterative process is guided by the graph diameter to eliminate the frequent data exchange between host and GPU. To accomplish this goal, WolfPath proposes a data structure called Layered Edge list to represent the graph, from which the graph diameter is known before the start of graph processing. In order to enhance the applicability of our WolfPath 123Int J Parallel Prog framework, a graph preprocessing algorithm is also developed in this work to convert any graph into the format of the Layered Edge list. We conducted extensive experiments to verify the effectiveness of WolfPath. The experimental results show that WolfPath achieves significant speedup over the state-of-art GPU-based in-memory and out-of-memory graph processing frameworks.

show abstract

LLAMA: Efficient graph analytics using Large Multiversioned Arrays

Cited by 108 publications

References 23 publications

Scaling data mining in massively parallel dataflow systems

Scaling data mining in massively parallel dataflow systems

An I/O-Efficient Buffer Batch Replacement Policy for Update-Intensive Graph Databases

WolfPath: Accelerating Iterative Traversing-Based Graph Processing Algorithms on GPU

Contact Info

Product

Resources

About