A Loosely Coordinated Model for Heap-Based Priority Queues in Multicore Environments

Laccetti, Giuliano; Lapegna, Marco; Mele, Valeria

doi:10.1007/s10766-015-0398-x

Cited by 14 publications

(10 citation statements)

References 30 publications

Supporting

Mentioning

Contrasting

Order By: Relevance

“…Such results confirm our expectation of acceptable overhead of the redistribution procedure on the efficiency of the algorithm, because there are not global synchronization among the threads. Only families F (3) , F (5) , and F (6) show values for the efficiency E N < 0.7 with N = 16 threads, because the presence of features (the peak in the corner or the discontinuity) needing a more frequent redistribution of sub-domains. A last set of experiment is aimed to measure the effectiveness of the GPU as floating point accelerator in the hybrid Algorithm 2 as described in the previous section.…”

Section: Test Resultsmentioning

confidence: 99%

“…More precisely, the families F (2) and F (3) show a performance gain of about 5× because their analytic expressions are based only on floating point operations, without the use of trigonometric or exponential functions as for the families F (1) , F (4) , and F (5) . The worst case is represented by the family F (6) with a performance gain of about 1.8× because the thread divergence due to the presence of the selection structure in its analytic expression, that greatly limits the GPU performance when threads follow different paths in the control flow.…”

Section: Test Resultsmentioning

confidence: 99%

“…In a previous work, we have shown that, in the parallelization process of Algorithm 1 in a shared memory environment, two strategies are available: a first approach is based on a centralized data structure for storing the sub‐domains, where several global synchronizations are needed among all threads in order to process only sub‐domains with large error, while in the second approach the sub‐domains are distributed in separate data structures managed by different threads, without synchronizations but with a significant risk that some of them process items with small error. Let us recall shortly the strategy on the basis of an effective trade off between the previous two requirements.…”

Section: Adaptive Algorithms For Hybrid Nodesmentioning

confidence: 99%

“…For example, our previous work introduced an algorithm based on two adaptive strategies on MPP systems with multi‐core CPU, combining together two different programming models for distributed memory and shared memory environments.…”

Section: Related Workmentioning

confidence: 99%

“…The work presented in the paper is the continuation of our investigations on the development of adaptive algorithm on modern hardware platforms (see, eg, our previous works), where we focused our attention mainly on computing systems based on multi‐core CPUs only. Some preliminary results on a hybrid system are reported in another previous work .…”

Section: Introductionmentioning

confidence: 99%

See 4 more Smart Citations

An adaptive algorithm for high‐dimensional integrals on heterogeneous CPU‐GPU systems

Laccetti

Lapegna

Mele

et al. 2018

Concurrency and Computation

Self Cite

View full text Add to dashboard Cite

Summary In this paper, we introduce an adaptive procedure for the numerical computation of a high‐dimensional integrals on HPC systems with heterogeneous nodes composed of multi‐core CPU and GPU devices. To this aim, we have integrated together two different approaches: a first one is in charge of a fair workload among the threads running on the multi‐core CPU, while a second one is in charge of an efficient execution of the computational kernels on the GPU. We tested the resulting algorithm on several test functions on a system where the nodes are provided with two Intel ten‐core CPU and one NVIDIA GPU device.

show abstract

Section: Test Resultsmentioning

confidence: 99%

Section: Test Resultsmentioning

confidence: 99%

Section: Adaptive Algorithms For Hybrid Nodesmentioning

confidence: 99%

Section: Related Workmentioning

confidence: 99%

Section: Introductionmentioning

confidence: 99%

See 3 more Smart Citations

An adaptive algorithm for high‐dimensional integrals on heterogeneous CPU‐GPU systems

Laccetti

Lapegna

Mele

et al. 2018

Concurrency and Computation

Self Cite

View full text Add to dashboard Cite

show abstract

Vessel to shore data movement through the Internet of Floating Things: A microservice platform at the edge

Luccio

Kosta

Castiglione

et al. 2020

Concurrency and Computation

View full text Add to dashboard Cite

Summary The rise of the Internet of Things has generated high expectations about the improvement in people's lifestyles. In the last decade, we saw several examples of instrumented cities where different types of data were gathered, processed, and made available to inspire the next generation of scientists and engineers. In this framework, sensors and actuators became leading actors of technologically pervasive urban environments. However, in coastal areas, marine data crowdsourcing is difficult to apply due to the challenging operational conditions, extremely unstable network connectivity, and security issues in data movement. To fill this gap, we present a novel version of our DYNAMO transfer protocol (DTP), a platform‐independent data mover framework where data collected on board of vessels are stored locally and then moved from the edge to the cloud when the operating conditions are favorable. We evaluate the performance of DTP in a controlled environment with a private cloud by measuring the time it takes for the clouds ide to process and store a fixed amount of data while varying the number of microservice instances. We show that the time decreases exponentially when the number of microservice instances goes from 1 to 16 and it remains constant above that number.

show abstract

Performance Evaluation for a PETSc Parallel-in-Time Solver Based on the MGRIT Algorithm

Mele

Romano

Constantinescu

et al. 2018

Lecture Notes in Computer Science

View full text Add to dashboard Cite

A Loosely Coordinated Model for Heap-Based Priority Queues in Multicore Environments

Cited by 14 publications

References 30 publications

An adaptive algorithm for high‐dimensional integrals on heterogeneous CPU‐GPU systems

An adaptive algorithm for high‐dimensional integrals on heterogeneous CPU‐GPU systems

Vessel to shore data movement through the Internet of Floating Things: A microservice platform at the edge

Performance Evaluation for a PETSc Parallel-in-Time Solver Based on the MGRIT Algorithm

Contact Info

Product

Resources

About