2008
DOI: 10.14778/1453856.1453944
|View full text |Cite
|
Sign up to set email alerts
|

Exploiting shared correlations in probabilistic databases

Abstract: There has been a recent surge in work in probabilistic databases, propelled in large part by the huge increase in noisy data sourcesfrom sensor data, experimental data, data from uncurated sources, and many others. There is a growing need for database management systems that can efficiently represent and query such data. In this work, we show how data characteristics can be leveraged to make the query evaluation process more efficient. In particular, we exploit what we refer to as shared correlations where the… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1
1

Citation Types

0
45
0

Year Published

2009
2009
2018
2018

Publication Types

Select...
7
1
1

Relationship

0
9

Authors

Journals

citations
Cited by 54 publications
(45 citation statements)
references
References 24 publications
0
45
0
Order By: Relevance
“…Recently, there has been an escalating arms race to add sophisticated analytics into the RDBMS with each iteration bringing more sophisticated tools into the RDBMS. So far, this arms race has centered around bringing individual statistical data mining techniques into an RDBMS, notably Support Vector Machines [35], Monte Carlo sampling [27,51], Conditional Random Fields [25,49], and Graphical Models [43,50]. Our effort is inspired by these approaches, but the goal of this work is to understand the extent to which we can handle these analytics tasks with a single unified architecture.…”
mentioning
confidence: 99%
“…Recently, there has been an escalating arms race to add sophisticated analytics into the RDBMS with each iteration bringing more sophisticated tools into the RDBMS. So far, this arms race has centered around bringing individual statistical data mining techniques into an RDBMS, notably Support Vector Machines [35], Monte Carlo sampling [27,51], Conditional Random Fields [25,49], and Graphical Models [43,50]. Our effort is inspired by these approaches, but the goal of this work is to understand the extent to which we can handle these analytics tasks with a single unified architecture.…”
mentioning
confidence: 99%
“…The work of [19] represents dependencies resulting from queries with a tree of RV assignments. We are also investigating the shared correlations work of [20].…”
Section: Knowledge Application Is Conditioningmentioning
confidence: 99%
“…Note that, our model is generic enough to cover the previously proposed model that assumes either object independence [6,25] or global correlations [28,29,31,16]. Specifically, they, respectively, correspond to two special cases: 1) the database has multiple LCPs, each containing only one object, and 2) the database has only one LCP, which contains all objects correlated with each other.…”
Section: Data Modelmentioning
confidence: 99%
“…Furthermore, the probabilistic graphical model (PGM) [14] has been adopted in some works [28,29,31] to represent probabilistic relational table(s) with globally correlated data (i.e., every pair of uncertain objects are assumed to be correlated). These works usually consider relational queries on these tables, including select, project, and join.…”
Section: Introductionmentioning
confidence: 99%