2011 IEEE 27th International Conference on Data Engineering 2011
DOI: 10.1109/icde.2011.5767930
|View full text |Cite
|
Sign up to set email alerts
|

SystemML: Declarative machine learning on MapReduce

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
2
1

Citation Types

3
202
0
2

Year Published

2011
2011
2019
2019

Publication Types

Select...
4
2
2

Relationship

0
8

Authors

Journals

citations
Cited by 259 publications
(207 citation statements)
references
References 12 publications
3
202
0
2
Order By: Relevance
“…From a performance perspective, Stonebraker et al [6] propose the reuse of carefully optimized external C++ libraries as user defined functions for linear algebra calculations, but they leave the problem with resource management and suitable data structures in this "hybrid" world yet unsolved. Another approach based on Hadoop is SystemML [1], where basic linear algebra primitives are addressable via a subset of the R language with a MapReduce backend. Few commercial data warehouse vendors already offer minor support for linear algebra operations integrated in the database engine, but to the best of our knowledge there is no solution which integrates transparent optimization based on topological features of the matrix (e.g., sparsity).…”
Section: Linear Algebra In Databasesmentioning
confidence: 99%
See 1 more Smart Citation
“…From a performance perspective, Stonebraker et al [6] propose the reuse of carefully optimized external C++ libraries as user defined functions for linear algebra calculations, but they leave the problem with resource management and suitable data structures in this "hybrid" world yet unsolved. Another approach based on Hadoop is SystemML [1], where basic linear algebra primitives are addressable via a subset of the R language with a MapReduce backend. Few commercial data warehouse vendors already offer minor support for linear algebra operations integrated in the database engine, but to the best of our knowledge there is no solution which integrates transparent optimization based on topological features of the matrix (e.g., sparsity).…”
Section: Linear Algebra In Databasesmentioning
confidence: 99%
“…Analytical algorithms in those fields are often composed of linear algebra operations, including matrix-matrix, matrix-vector and elementwise multiplications. Moreover, linear algebra operations form the building blocks of machine learning algorithms [1] used in data warehousing environments, which is a common domain for commercial databases.…”
Section: Introductionmentioning
confidence: 99%
“…RHIPE [23] and SystemML [7] also aim to extend R for largescale distributed computations. All the cited efforts to merge R with MapReduce suffer from two issues: (1) R is based on C and hence not native to Java-based frameworks such as Hadoop and Spark.…”
Section: Related Workmentioning
confidence: 99%
“…Data analysts aim to extract knowledge and to better understand their data; they do not wish to learn complex programming paradigms and new languages. Recent implementations attempt to provide highlevel interfaces for data mining and associated algorithms which are compiled to low-level primitives [6], [7]. Such developments tend to require knowledge of the underlying distributed system effectively shifting the focus from data mining to individual algorithm implementation.…”
Section: Introductionmentioning
confidence: 99%
“…Large scale graph mining using HADOOP has attracted significant interests [4,5,6] due to its simplicity, fault tolerance, and low maintenance costs, compared to graph mining based on MPI [7] and Bulk Synchronous Parallel model [8].…”
Section: Related Workmentioning
confidence: 99%