2010
DOI: 10.14778/1920841.1920942
|View full text |Cite
|
Sign up to set email alerts
|

Scalable probabilistic databases with factor graphs and MCMC

Abstract: Probabilistic databases play a crucial role in the management and understanding of uncertain data. However, incorporating probabilities into the semantics of incomplete databases has posed many challenges, forcing systems to sacrifice modeling power, scalability, or restrict the class of relational algebra formula under which they are closed. We propose an alternative approach where the underlying relational database always represents a single world, and an external factor graph encodes a distribution over pos… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1

Citation Types

0
45
0

Year Published

2011
2011
2018
2018

Publication Types

Select...
4
2
1

Relationship

0
7

Authors

Journals

citations
Cited by 55 publications
(45 citation statements)
references
References 23 publications
0
45
0
Order By: Relevance
“…Our effort is inspired by these approaches, but the goal of this work is to understand the extent to which we can handle these analytics tasks with a single unified architecture. Of these approaches, MCDB [27] and Wick et al [51] are the most related in that they propose a single unified interface for uncertain data based on sampling and graphical models respectively. In contrast, we consider data analytics techniques that are modeled as convex programming problems.…”
mentioning
confidence: 99%
“…Our effort is inspired by these approaches, but the goal of this work is to understand the extent to which we can handle these analytics tasks with a single unified architecture. Of these approaches, MCDB [27] and Wick et al [51] are the most related in that they propose a single unified interface for uncertain data based on sampling and graphical models respectively. In contrast, we consider data analytics techniques that are modeled as convex programming problems.…”
mentioning
confidence: 99%
“…Nath and Domingos [29] studied how to extend belief propagation on factor graphs with new evidence, but without any modification to the structure of the graph. Wick and McCallum [41] proposed a "query-aware MCMC" method. They designed a proposal scheme so that query variables tend to be sampled more frequently than other variables.…”
Section: Incremental Maintenance Of Statistical Inference and Learningmentioning
confidence: 99%
“…Instead, we use a general and flexible representation of a probabilistic database, proposed by Sen et al [36] and also used by Wick et al [42], that can capture complex correlations among the tuples or the attributes in the database through use of factor graphs, a class of graphical models that generalizes both directed Bayesian networks and undirected Markov networks. More formally we have:…”
Section: Pr(w)mentioning
confidence: 99%
“…These include applications such as information extraction, data integration, computer vision applications, sensor networks, etc., where heavy use of machine learning techniques naturally results in complex correlations among the data. Hence, over the last few years, several probabilistic database systems have been proposed that can manage such correlated databases [37,41,42], with correlations typically captured using graphical models such as factor graphs or Bayesian networks [15]. However, intensional approaches that can process queries over such correlated databases and handle more general classes of queries, are typically much slower than extensional approaches and have poor scalability, leading to a significant efficiency gap between the two approaches [21].…”
mentioning
confidence: 99%
See 1 more Smart Citation