Our system is currently under heavy load due to increased usage. We're actively working on upgrades to improve performance. Thank you for your patience.
2020
DOI: 10.48550/arxiv.2002.04803
|View full text |Cite
Preprint
|
Sign up to set email alerts
|

Machine Learning in Python: Main developments and technology trends in data science, machine learning, and artificial intelligence

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1
1

Citation Types

0
14
0

Year Published

2020
2020
2024
2024

Publication Types

Select...
5
2
2

Relationship

0
9

Authors

Journals

citations
Cited by 13 publications
(14 citation statements)
references
References 0 publications
0
14
0
Order By: Relevance
“…We perform k-Means clustering and learn 1000 cluster centroids (as many as the concepts in each level) on the training sets. We use the cuML k-Means implementation [47], repeat clustering with 3 seeds and compute cluster assignments for test samples using the centroids that gave the lowest inertia across the 3 runs. We report clustering metrics, including cluster purity and adjusted [60] and normalized mutual information scores between the cluster assignments and the labels of the test samples.…”
Section: Ig-1bmentioning
confidence: 99%
“…We perform k-Means clustering and learn 1000 cluster centroids (as many as the concepts in each level) on the training sets. We use the cuML k-Means implementation [47], repeat clustering with 3 seeds and compute cluster assignments for test samples using the centroids that gave the lowest inertia across the 3 runs. We report clustering metrics, including cluster purity and adjusted [60] and normalized mutual information scores between the cluster assignments and the labels of the test samples.…”
Section: Ig-1bmentioning
confidence: 99%
“…An excellent general overview that digs deeper into the mathematical background than this review is the "high-bias, low variance introduction to Machine Learning" by Mehta et al, 7 recent applications of ML to materials science are covered by Schmidt et al 30 But also many textbooks cover the fundamentals of machine learning; e.g., Tibshirani and Friedman, 31 Shalev-Shwartz and Ben-David 32 as well as Bishop (from a more Bayesian point of view) 33 focus more on the theoretical background of statistical learning, whereas Géron provides a "how-to" for the actual implementation, also of neural network (NN) architectures, using popular Python frameworks, 34 which were recently reviewed by Rascka et al 35…”
Section: Discussionmentioning
confidence: 99%
“…To accelerate the experiments, we implement the entire evaluation pipeline on GPU utilizing the RAPIDS GPU data science framework. Data loading and preprocessing are boosted by cuDF [21] while scikit-learn models and scorers are replaced with their GPU counterparts in cuML library [11].…”
Section: Gpu Accelerated Exhaustive Searchmentioning
confidence: 99%
“…• A suite of GPU-optimized cuML [11] models including scikit-learn counterparts, MLPs and Xgboost [12] are added to the Bayesmark toolkit to accelerate single model evaluation.…”
Section: Introductionmentioning
confidence: 99%