2012
DOI: 10.14778/2212351.2212354
|View full text |Cite
|
Sign up to set email alerts
|

Distributed GraphLab

Abstract: While high-level data parallel frameworks, like MapReduce, simplify the design and implementation of large-scale data processing systems, they do not naturally or efficiently support many important data mining and machine learning algorithms and can lead to inefficient learning systems. To help fill this critical void, we introduced the GraphLab abstraction which naturally expresses asynchronous, dynamic, graph-parallel computation while ensuring data consistency and achieving a high de… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1
1

Citation Types

0
67
0

Year Published

2013
2013
2018
2018

Publication Types

Select...
7
2

Relationship

0
9

Authors

Journals

citations
Cited by 1,333 publications
(93 citation statements)
references
References 21 publications
0
67
0
Order By: Relevance
“…It is also worth noting that this workflow is essentially a materials science adaptation of existing similar workflows of data-driven analytics in other domains, as most of the advanced techniques for big data management and informatics come from the field of computer science and more specifically high-performance data mining, [42][43][44][45][46][47][48][49][50] via applications in many different [64][65][66][67][68] and social media analytics, 69-71 among many others.…”
Section: Knowledge Discovery Workflow For Materials Informaticsmentioning
confidence: 99%
“…It is also worth noting that this workflow is essentially a materials science adaptation of existing similar workflows of data-driven analytics in other domains, as most of the advanced techniques for big data management and informatics come from the field of computer science and more specifically high-performance data mining, [42][43][44][45][46][47][48][49][50] via applications in many different [64][65][66][67][68] and social media analytics, 69-71 among many others.…”
Section: Knowledge Discovery Workflow For Materials Informaticsmentioning
confidence: 99%
“…However, writing new or customizing existing algorithms is too costly because all algorithms need to be implemented (or modified) by following fixed distributed runtime plans and underlying data-parallel framework. To address this limitation, researchers proposed several approaches of fast implementing approaches as SystemML [10], NIMBLE [11], MLbase [12], Distributed GraphLab [13], and Tupleware [14]. These approaches are classified as "Declarative ML" [54].…”
Section: Utilization Of Mlas In Cloud Computingmentioning
confidence: 99%
“…Therefore, many popular computational algorithms cannot be directly utilized in cloud computing architecture [8]. To address this limitation, researchers have started designing scalable performant machine learning applications that run in the cloud computing environment [9][10][11][12][13][14]. In addition, many researchers emphasized the importance of machine learning algorithms (MLAs) to intrusion detection analysis using cloud computing technology [15][16][17][18].…”
mentioning
confidence: 99%
“…It is inspired by probabilistic graphical models [Jordan et al 1999] that have been popular for representing variable dependence in multi-variate optimization problems. These problems can be parallelized [Low et al 2010] according to the associated graph structure.…”
Section: Related Workmentioning
confidence: 99%