2008
DOI: 10.14778/1453856.1453935
|View full text |Cite
|
Sign up to set email alerts
|

Cleaning uncertain data with quality guarantees

Abstract: Uncertain or imprecise data are pervasive in applications like location-based services, sensor monitoring, and data collection and integration. For these applications, probabilistic databases can be used to store uncertain data, and querying facilities are provided to yield answers with statistical confidence. Given that a limited amount of resources is available to "clean" the database (e.g., by probing some sensor data values to get their latest values), we address the problem of choosing the set of uncertai… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
3
2

Citation Types

0
57
0

Year Published

2010
2010
2020
2020

Publication Types

Select...
5
3
1

Relationship

0
9

Authors

Journals

citations
Cited by 78 publications
(59 citation statements)
references
References 35 publications
(80 reference statements)
0
57
0
Order By: Relevance
“…e.g. data collected from sensor networks [15], information extraction from the web [16,17], data integration [18,19], data cleaning [20][21][22][23][24][25], social networks [26,27], radio frequency identification RFID [7]. Due to various reasons that differ from one application to another, the uncertainty is inherent in such applications.…”
Section: Related Workmentioning
confidence: 99%
“…e.g. data collected from sensor networks [15], information extraction from the web [16,17], data integration [18,19], data cleaning [20][21][22][23][24][25], social networks [26,27], radio frequency identification RFID [7]. Due to various reasons that differ from one application to another, the uncertainty is inherent in such applications.…”
Section: Related Workmentioning
confidence: 99%
“…The probabilistic models of the uncertain data fall into two categories: one is the possible world model [13,14] [20], and the other is the probability function model [21], in which the existence of a record is represented by a probability density function. Till now, the research for query processing over uncertain data mainly focuses on nearest neighbor (NN) problem [21], K-nearest neighbor (K-NN) problem [22], join operation [23], ranking operation [20], and top-K queries [24].…”
Section: Related Workmentioning
confidence: 99%
“…According to the possible world model [13][14][15], the skyline over uncertain data is not determinate, and any possible skyline has an existence probability. In such a case, people may ask that "what are the skylines of the data with existence probabilities greater than a given constant p?"…”
Section: Introductionmentioning
confidence: 99%
“…The mobile trajectory data precision is not high, which is unable to accurately describe the user's position every time when the mobile phone accesses the base station. The literature (Cheng, Chen and Xie, 2008)puts forward the precision evaluation method based on the semantic data, and gives a data cleaning solution. But its work is not for trajectory data.…”
Section: Introductionmentioning
confidence: 99%