2020
DOI: 10.1186/s40537-020-00376-9
|View full text |Cite
|
Sign up to set email alerts
|

Comparison of sort algorithms in Hadoop and PCJ

Abstract: Sorting algorithms are among the most commonly used algorithms in computer science and modern software. Having efficient implementation of sorting is necessary for a wide spectrum of scientific applications. This paper describes the sorting algorithm written using the partitioned global address space (PGAS) model, implemented using the Parallel Computing in Java (PCJ) library. The iterative implementation description is used to outline the possible performance issues and provide means to resolve them. The key … Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1

Citation Types

0
2
0

Year Published

2021
2021
2024
2024

Publication Types

Select...
4
1

Relationship

1
4

Authors

Journals

citations
Cited by 5 publications
(2 citation statements)
references
References 36 publications
0
2
0
Order By: Relevance
“…In this implementation only the single SealedObject is created for each PCJ thread and the object is reused for all iterations of the main loop. Full source code is available on GitHub 49 …”
Section: Performance Resultsmentioning
confidence: 99%
See 1 more Smart Citation
“…In this implementation only the single SealedObject is created for each PCJ thread and the object is reused for all iterations of the main loop. Full source code is available on GitHub 49 …”
Section: Performance Resultsmentioning
confidence: 99%
“…37,38 However, one could argue, that the implemented problem was not well-suited for Hadoop processing. The paper 39 presents the TeraSort benchmark used for measuring the performance of Hadoop clusters. For that problem, the PCJ implementation shows better or at least the same performance as the Apache Hadoop implementation, even with the data placement agnostic approach for reading data from the HDFS system.…”
Section: Use Casesmentioning
confidence: 99%