Proceedings of the 2022 International Conference on Management of Data 2022
DOI: 10.1145/3514221.3517911
|View full text |Cite
|
Sign up to set email alerts
|

Triton Join: Efficiently Scaling to a Large Join State on GPUs with Fast Interconnects

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1

Citation Types

1
2
0

Year Published

2022
2022
2024
2024

Publication Types

Select...
5
2

Relationship

1
6

Authors

Journals

citations
Cited by 9 publications
(3 citation statements)
references
References 46 publications
1
2
0
Order By: Relevance
“…Several works have shown the benefits of adapting applications to the underlying platform, e.g., by using SIMD [41,44,45,57]. IBM Power systems are used in previous work, as they integrate well with various accelerators [49][50][51]. Bari et al [12] find that A64FX single-thread performance is low, in line with our findings.…”
Section: Related Worksupporting
confidence: 89%
“…Several works have shown the benefits of adapting applications to the underlying platform, e.g., by using SIMD [41,44,45,57]. IBM Power systems are used in previous work, as they integrate well with various accelerators [49][50][51]. Bari et al [12] find that A64FX single-thread performance is low, in line with our findings.…”
Section: Related Worksupporting
confidence: 89%
“…This approach facilitated the rapid integration of FPGA kernels into existing software and communication patterns. However, it is important to note that the OpenCL programming model was initially designed to leverage the acceleration characteristics of GPUs, which by essence involve processing large volumes of data [37,70]. In contrast, FPGAs do not necessarily follow the same principle, and can operate efficiently as fine-grained data-flow units, handling smaller data sets at a time.…”
Section: Discussionmentioning
confidence: 99%
“…In heterogeneous compute architectures, the overhead of transferring data (e.g., between host and graphics processing unit (GPU) memory) can still have a major impact on the overall performance, even when the latest state-of-the-art interconnection technologies are used such as NVLink-2 on the intra-node level 1,2 and InfiniBand EDR on the inter-node level. 1 For many data-intensive applications, scaling out to multiple nodes is the most feasible strategy to satisfy their resource demands.…”
Section: Introductionmentioning
confidence: 99%