2010
DOI: 10.1007/s10619-010-7067-2
|View full text |Cite
|
Sign up to set email alerts
|

Cardinality estimation and dynamic length adaptation for Bloom filters

Abstract: Bloom filters are extensively used in distributed applications, especially in distributed databases and distributed information systems, to reduce network requirements and to increase performance. In this work, we propose two novel Bloom filter features that are important for distributed databases and information systems. First, we present a new approach to encode a Bloom filter such that its length can be adapted to the cardinality of the set it represents, with negligible overhead with respect to computation… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
2
1

Citation Types

0
49
0

Year Published

2013
2013
2024
2024

Publication Types

Select...
4
2
2

Relationship

0
8

Authors

Journals

citations
Cited by 49 publications
(51 citation statements)
references
References 45 publications
0
49
0
Order By: Relevance
“…The resulting Bloom filter has no false negative, which means the query result of any element y ∈ S 1 ∩ S 2 against BF S 1 ∩S 2 is always true. The false positive probability of the resulting Bloom filter is no higher than either of the constituent Bloom filter [38]. Note that due to collisions, it is possible that the jth bit is set in BF S 1 by an element in S 1 − S 1 ∩ S 2 and jth bit is set in BF S 2 by an element in S 2 − S 1 ∩ S 2 .…”
Section: Bloom Filtersmentioning
confidence: 99%
“…The resulting Bloom filter has no false negative, which means the query result of any element y ∈ S 1 ∩ S 2 against BF S 1 ∩S 2 is always true. The false positive probability of the resulting Bloom filter is no higher than either of the constituent Bloom filter [38]. Note that due to collisions, it is possible that the jth bit is set in BF S 1 by an element in S 1 − S 1 ∩ S 2 and jth bit is set in BF S 2 by an element in S 2 − S 1 ∩ S 2 .…”
Section: Bloom Filtersmentioning
confidence: 99%
“…These bit arrays are similar to those employed in traditional Bloom filters and is supported by a sufficiently large body of research work [14], [16], [17] that allows us to estimate number of documents reachable for a multi-concept query solely based on these bit arrays. Similar to level 1, level 2(TSBF 2,P ) also contains multiple bit arrays each representing different multi-concept queries that whose concepts have C as the least common ancestor in the ontology hierarchy for which P has at least one qualified document in its local document collection (TSBF 2,P (C)).…”
Section: Two-level Semantic Bloom Filter (Tsbf)mentioning
confidence: 99%
“…Research community has proposed many works to estimate the cardinality(i.e. number of elements) of an original set solely based on its Bloom filter bit array [14], [16], [17]. For our work we used the work presented by authors of [16].…”
Section: ) Estimating Set Intersection Based Cardinality From Bloom mentioning
confidence: 99%
See 1 more Smart Citation
“…A single Bloom filter is used for all grams generated by a single string S. Papapetrou et al [22] conclude, that the optimal number of hash functions to do cardinality estimation using Bloom filters is 1. Based on this we fix k = 1 and only use a single hash function to build and query Bloom filters throughout the rest of the paper.…”
Section: String Matching Using Bloom Filtersmentioning
confidence: 99%