Proceedings of the 2017 ACM International Conference on Management of Data 2017
DOI: 10.1145/3035918.3035937
|View full text |Cite
|
Sign up to set email alerts
|

The BUDS Language for Distributed Bayesian Machine Learning

Abstract: We describe BUDS, a declarative language for succinctly and simply specifying the implementation of large-scale machine learning algorithms on a distributed computing platform. The types supported in BUDS-vectors, arrays, etc.-are simply logical abstractions useful for programming, and do not correspond to the actual implementation. In fact, BUDS automatically chooses the physical realization of these abstractions in a distributed system, by taking into account the characteristics of the data. Likewise, there … Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
4
1

Citation Types

0
8
0

Year Published

2017
2017
2021
2021

Publication Types

Select...
6
2
1

Relationship

0
9

Authors

Journals

citations
Cited by 24 publications
(8 citation statements)
references
References 27 publications
(25 reference statements)
0
8
0
Order By: Relevance
“…Similarly, Sys-temML [9] expresses machine learning algorithms by a simplified R and python like-language, and automatically translates the program into execution plan on top of Spark. BUDS [26] is a language for Bayesian machine learning, specifically, Markov chain simulation, allowing distributed computation on types of sets, maps, vectors and matrices. In addition, Datalog and its extensions [10,22,36] are also used to integrate statistical and machine learning into data management systems.…”
Section: Related Workmentioning
confidence: 99%
“…Similarly, Sys-temML [9] expresses machine learning algorithms by a simplified R and python like-language, and automatically translates the program into execution plan on top of Spark. BUDS [26] is a language for Bayesian machine learning, specifically, Markov chain simulation, allowing distributed computation on types of sets, maps, vectors and matrices. In addition, Datalog and its extensions [10,22,36] are also used to integrate statistical and machine learning into data management systems.…”
Section: Related Workmentioning
confidence: 99%
“…Another related research thread includes the implementation of linear algebra systems on data management systems [1], [24]. There is increasing interest in building systems with the aim to achieve closer integration of ML with data management [1], [6], [9], [14], [20], [19], [13]. In the context of nonlinear models, [28] is concerned with Bayesian Markov Chain Monte Carlo as applied to Factorization Machine (FM) models and [33] present the algorithms for Support Vector Machine (SVM).…”
Section: Related Workmentioning
confidence: 99%
“…Machine learning frameworks SystemML [15], Ten-sorFlow [12], PyTorch [32], Mahout Samsara [1] and BUDS [22] provide domain specific languages (DSLs) and APIs that support linear algebra operations and data structures, probability distribution and/or deep learning functions, as well as useful ML-centric features, such as automatic differentation. These systems are more than ML libraries as they apply algebraic rewrites and operator fusion [20], [16] to optimize users' code.…”
Section: Related Workmentioning
confidence: 99%