Proceedings of the 2012 SIAM International Conference on Data Mining 2012
DOI: 10.1137/1.9781611972825.80
|View full text |Cite
|
Sign up to set email alerts
|

A Flexible Open-Source Toolbox for Scalable Complex Graph Analysis

Abstract: The Knowledge Discovery Toolbox (KDT) enables domain experts to perform complex analyses of huge datasets on supercomputers using a high-level language without grappling with the difficulties of writing parallel code, calling parallel libraries, or becoming a graph expert. KDT provides a flexible Python interface to a small set of high-level graph operations; composing a few of these operations is often sufficient for a specific analysis. Scalability and performance are delivered by linking to a state-of-the-a… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1
1

Citation Types

0
35
0
1

Year Published

2013
2013
2021
2021

Publication Types

Select...
4
2
1

Relationship

2
5

Authors

Journals

citations
Cited by 63 publications
(36 citation statements)
references
References 23 publications
(18 reference statements)
0
35
0
1
Order By: Relevance
“…For example, some distributed graph processing systems, such as KDT [34], PowerGraph [18], and GPS [42], use graph partitioning techniques that can reduce the communication cost significantly when executing some algorithms on input graphs with skewed degree distributions. In this paper we focus on optimizations that are algorithmic and do not appear to have any system-level equivalents.…”
Section: Name Description Section Strongly Connected Components (Scc)mentioning
confidence: 99%
See 1 more Smart Citation
“…For example, some distributed graph processing systems, such as KDT [34], PowerGraph [18], and GPS [42], use graph partitioning techniques that can reduce the communication cost significantly when executing some algorithms on input graphs with skewed degree distributions. In this paper we focus on optimizations that are algorithmic and do not appear to have any system-level equivalents.…”
Section: Name Description Section Strongly Connected Components (Scc)mentioning
confidence: 99%
“…References [34,18,42] describe graph partitioning techniques for assigning graph vertices to machines, with the goal of reducing the communication cost of algorithms. These techniques are effective when vertices send the same message to all of their neighbors in each phase of the algorithm.…”
Section: Related Workmentioning
confidence: 99%
“…The Knowledge Discovery Toolbox [23] is a flexible Python-based open-source toolbox for implementing complex graph algorithms and executing them on high-performance parallel computers. KDT achieves high performance by invoking linear-algebraic computational primitives supplied by a parallel C++/MPI backend, the Combinatorial BLAS [7].…”
Section: Sejits For Performancementioning
confidence: 99%
“…The Knowledge Discovery Toolbox [23], [24] is a flexible open-source toolkit for complex graph algorithms on highperformance parallel computers. KDT targets two classes of users: domain-expert analysts who are not graph experts, who use KDT by invoking existing routines from Python, and graph-algorithm developers who write Python code that invokes and composes KDT computational primitives.…”
Section: B Kdt Filters In Pythonmentioning
confidence: 99%
“…The Knowledge Discovery Toolbox [32] is a flexible, Python-based, open-source toolbox for implementing complex graph algorithms and executing them on high-performance parallel computers. KDT achieves high performance by invoking linear-algebraic computational primitives supplied by a parallel C++/MPI backend -the Combinatorial BLAS [13].…”
Section: Introductionmentioning
confidence: 99%