2018
DOI: 10.21105/joss.00726
|View full text |Cite
|
Sign up to set email alerts
|

mlpack 3: a fast, flexible machine learning library

Abstract: MLPACK is a state-of-the-art, scalable, multi-platform C++ machine learning library released in late 2011 offering both a simple, consistent API accessible to novice users and high performance and flexibility to expert users by leveraging modern features of C++. ML-PACK provides cutting-edge algorithms whose benchmarks exhibit far better performance than other leading machine learning libraries. MLPACK version 1.0.3, licensed under the LGPL, is available at http://www.mlpack.org.

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1
1

Citation Types

0
115
1

Year Published

2019
2019
2022
2022

Publication Types

Select...
5
2
1

Relationship

0
8

Authors

Journals

citations
Cited by 119 publications
(116 citation statements)
references
References 6 publications
0
115
1
Order By: Relevance
“…We use the publicly available mlpack kmeans program in mlpack [12]; we run it as $ mlpack_kmeans -i dataset.csv -I centroids.csv -c $k -v -e -a $algorithm where $k is the number of clusters and $algorithm is the algorithm to be used. [28,35,30].…”
Section: Methodsmentioning
confidence: 99%
“…We use the publicly available mlpack kmeans program in mlpack [12]; we run it as $ mlpack_kmeans -i dataset.csv -I centroids.csv -c $k -v -e -a $algorithm where $k is the number of clusters and $algorithm is the algorithm to be used. [28,35,30].…”
Section: Methodsmentioning
confidence: 99%
“…Besides exploiting the structure of the input data and the learning task, the problem of learning models over databases can also benefit tremendously from database system techniques. Recent work [50] showed non-trivial speedups (several orders of magnitude) brought by code optimization for machine learning workloads over state-of-the-art systems such as TensorFlow [1], R [46], Scikit-learn [44], and mlpack [13]. Prime examples of code optimizations leading to such performance improvements include:…”
Section: Database Systems Considerationsmentioning
confidence: 99%
“…Although this step serves only for coarsening the data representation, it dominates the computation cost of the first four steps. Two state-of-the-art K-means++ implementations were tested, K-MeansRex [26], and scalable mlpack package [27]. For a test run with the data points in the range of 1 to 100K (d = 2, n c = 100), K-means++ from mlpack was 1.82 times faster on average in execution than K-MeansRex's implementation.…”
Section: Clustering Stepmentioning
confidence: 99%