2019
DOI: 10.1016/j.knosys.2018.09.019
|View full text |Cite
|
Sign up to set email alerts
|

AMIDST: A Java toolbox for scalable probabilistic machine learning

Abstract: The AMIDST Toolbox is a software for scalable probabilistic machine learning with a special focus on (massive) streaming data. The toolbox supports a flexible modeling language based on probabilistic graphical models with latent variables and temporal dependencies. The specified models can be learnt from large data sets using parallel or distributed implementations of Bayesian learning algorithms for either streaming or batch data. These algorithms are based on a flexible variational message passing scheme, wh… Show more

Help me understand this report
View preprint versions

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1
1

Citation Types

0
13
0

Year Published

2019
2019
2021
2021

Publication Types

Select...
7
1

Relationship

2
6

Authors

Journals

citations
Cited by 11 publications
(13 citation statements)
references
References 9 publications
0
13
0
Order By: Relevance
“…Lastly, we discuss the proposed modeling framework in a more general setting, linking model validation and concept drift analysis. The proposed methods are released as part of an open-source toolbox for scalable probabilistic machine learning (http://www.amidsttoolbox.com) [6,7,8].…”
Section: Introductionmentioning
confidence: 99%
“…Lastly, we discuss the proposed modeling framework in a more general setting, linking model validation and concept drift analysis. The proposed methods are released as part of an open-source toolbox for scalable probabilistic machine learning (http://www.amidsttoolbox.com) [6,7,8].…”
Section: Introductionmentioning
confidence: 99%
“…The R package ramidst (Salmeron et al 2016) links to the AMIDST (Masegosa et al 2019) toolbox for scalable, but only approximate inference algorithms for BNs written in Java (Gosling, Joy, Steele, and Bracha 2000). The package was recently removed from the CRAN repository for check issues, but the Java toolbox remains functional.…”
Section: Bayesnetbpmentioning
confidence: 99%
“…Edward can exploit GPUs for parallelism. AMIDST [44] is a Java toolbox for scalable probabilistic machine learning and allows a user to build probabilistic graphical models and perform scalable inference. To process large data streams and large-scale datasets, AMIDST employs Apache Flink [23] and Apache Spark [58].…”
Section: Distributed Machine Learning Frameworkmentioning
confidence: 99%
“…Because gossiping can be stopped on the cluster nodes after a period of time and started again, DiSC can eciently recompute the family scores as new data are produced. Unlike AMIDST [44] that is designed for a streaming scenario where new data arrive continuously, we focus on stored datasets that may be updated over time but not in real-time. Suppose a new data block with some number of data instances is added to a cluster node.…”
Section: Recomputing Family Scores On New Datamentioning
confidence: 99%