Proceedings of the Tenth International Workshop on Data Management on New Hardware 2014
DOI: 10.1145/2619228.2619234
|View full text |Cite
|
Sign up to set email alerts
|

Vectorized Bloom filters for advanced SIMD processors

Abstract: Analytics are at the core of many business intelligence tasks. Efficient query execution is facilitated by advanced hardware features, such as multi-core parallelism, shared-nothing low-latency caches, and SIMD vector instructions. Only recently, the SIMD capabilities of mainstream hardware have been augmented with wider vectors and non-contiguous loads termed gathers. While analytical DBMSs minimize the use of indexes in favor of scans based on sequential memory accesses, some data structures remain crucial. … Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
2
1

Citation Types

0
34
0

Year Published

2015
2015
2022
2022

Publication Types

Select...
5
3

Relationship

2
6

Authors

Journals

citations
Cited by 48 publications
(34 citation statements)
references
References 22 publications
0
34
0
Order By: Relevance
“…For example, our implementation of SIMD GALLOPING relies on a scalar binary search like the conventional galloping. However, recent Intel processors have introduced gather instructions (vpgatherdd) that can retrieve at once several integers from various locations . Such instructions could accelerate binary search.…”
Section: Fast Intersectionsmentioning
confidence: 99%
“…For example, our implementation of SIMD GALLOPING relies on a scalar binary search like the conventional galloping. However, recent Intel processors have introduced gather instructions (vpgatherdd) that can retrieve at once several integers from various locations . Such instructions could accelerate binary search.…”
Section: Fast Intersectionsmentioning
confidence: 99%
“…With ViViD Cuckoo we could address the two main issues that caused the build to become a bottleneck for the Cuckoo Join: the chained unpredictable control dependencies and the data dependencies between cuckoo paths. We extended the methods described by [Polychroniou et al 2015] and [Polychroniou and Ross 2014] to use new capabilities of AVX-512 and to take advantage of logical operations and the dispersion provided by the hash functions.…”
Section: Vivid Cuckoo Hash Implementationmentioning
confidence: 99%
“…The left side receives all the keys that will be replaced for new ones from the relation, in the example the keys 675 and 76. This process is based on [Polychroniou and Ross 2014] approach using a permutation table kept in memory to speed the process.…”
Section: Vivid Cuckoo Hash Implementationmentioning
confidence: 99%
“…In databases, however, regular expressions are used as a boolean filter and the matching can skip large portions of each input string, making lockstep processing wasteful. Vectorization using data-parallel processing of multiple input instances was used to accelerate database operators on CPUs and Xeon Phi co-processors [8,9]. Our design not only traverses the DFA for multiple strings in parallel, but also accesses the strings at arbitrary offsets rather than in lockstep, buffers multiple bytes per access, and replaces strings as soon as they are accepted or rejected by the DFA, in order to fully utilize the SIMD lanes.…”
Section: Related Workmentioning
confidence: 99%