Scalable Probabilistic Similarity Ranking in Uncertain Databases

Bernecker, Thomas; Kriegel, Hans‐Peter; Mamoulis, Nikos; Renz, Matthias; Zuefle, Andreas

doi:10.1109/tkde.2010.78

Cited by 34 publications

(24 citation statements)

References 28 publications

Supporting

Mentioning

Contrasting

Order By: Relevance

“…For the verification step, we perform, for each remaining candidate B, a probabilistic nearest neighbor query using the algorithm proposed in [6] for probabilistic ranking queries (and setting k = 1). This algorithm takes Q, B and D \ B (in particular this set can be reduced, as shown in Appendix D) as input and returns P (N NB(Q)) which is equivalent to P (RN NQ(B)).…”

Section: Verificationmentioning

confidence: 99%

Efficient probabilistic reverse nearest neighbor query processing on uncertain data

et al. 2011

Self Cite

View full text Add to dashboard Cite

Given a query object q, a reverse nearest neighbor (RNN) query in a common certain database returns the objects having q as their nearest neighbor. A new challenge for databases is dealing with uncertain objects. In this paper we consider probabilistic reverse nearest neighbor (PRNN) queries, which return the uncertain objects having the query object as nearest neighbor with a sufficiently high probability. We propose an algorithm for efficiently answering PRNN queries using new pruning mechanisms taking distance dependencies into account. We compare our algorithm to state-ofthe-art approaches recently proposed. Our experimental evaluation shows that our approach is able to significantly outperform previous approaches. In addition, we show how our approach can easily be extended to PRkNN (where k > 1) query processing for which there is currently no efficient solution.

show abstract

Section: Verificationmentioning

confidence: 99%

Efficient probabilistic reverse nearest neighbor query processing on uncertain data

et al. 2011

Self Cite

View full text Add to dashboard Cite

show abstract

“…These probabilities can be computed in a single database scan. We can process the p t j successively by means of the Poisson binomial recurrence [17], as proposed in [18]. Therefore, let P t i,j be the probability that, out of the j objects processed so far, exactly i objects have a higher score than q.…”

Section: Initial Computationmentioning

confidence: 99%

Continuous Inverse Ranking Queries in Uncertain Streams

Bernecker

Kriegel

Mamoulis

et al. 2011

Lecture Notes in Computer Science

Self Cite

View full text Add to dashboard Cite

Abstract. This paper introduces a scalable approach for continuous inverse ranking on uncertain streams. An uncertain stream is a stream of object instances with confidences, e.g. observed positions of moving objects derived from a sensor. The confidence value assigned to each instance reflects the likelihood that the instance conforms with the current true object state. The inverse ranking query retrieves the rank of a given query object according to a given score function. In this paper we present a framework that is able to update the query result very efficiently, as the stream provides new observations of the objects. We will theoretically and experimentally show that the query update can be performed in linear time complexity. We conduct an experimental evaluation on synthetic and real-world data, which demonstrates the efficiency of our approach.

show abstract

“…At the same time, uncertainty is inherent in many datasets due to various factors like noise [1], privacy protection strategy [2], incompleteness of data and delay or loss in data transfer [3]. In this paper, we connect top k query and uncertain data model, and propose a novel top (k 1 , k 2 ) query in uncertain datasets.…”

Section: Introductionmentioning

confidence: 99%

Top (k1,k2) Query in Uncertain Datasets

Liu

Lin

Jia

2015

IEICE Trans. Inf. & Syst.

View full text Add to dashboard Cite

SUMMARYIn this letter, we propose a novel kind of uncertain query, top (k 1 , k 2 ) query. The x-tuple model and the possible world semantics are used to describe data objects in uncertain datasets. The top (k 1 , k 2 ) query is going to find k 2 x-tuples with largest probabilities to be the result of top k 1 query in a possible world. Firstly, we design a basic algorithm for top (k 1 , k 2 ) query based on dynamic programming. And then some pruning strategies are designed to improve its efficiency. An improved initialization method is proposed for further acceleration. Experiments in real and synthetic datasets prove the performance of our methods. key words: uncertain query, top k, x-tuple, possible world

show abstract

Scalable Probabilistic Similarity Ranking in Uncertain Databases

Cited by 34 publications

References 28 publications

Efficient probabilistic reverse nearest neighbor query processing on uncertain data

Efficient probabilistic reverse nearest neighbor query processing on uncertain data

Continuous Inverse Ranking Queries in Uncertain Streams

Top (<i>k</i><sub>1</sub>,<i>k</i><sub>2</sub>) Query in Uncertain Datasets

Contact Info

Product

Resources

About