A Framework for Efficient Execution of Data Parallel Irregular Applications on Heterogeneous Systems

Ribeiro, Roberto; Barbosa, Joao; Santos, Luís Paulo

doi:10.1142/s0129626415500048

Cited by 3 publications

(3 citation statements)

References 13 publications

(24 reference statements)

Supporting

Mentioning

Contrasting

Order By: Relevance

“…Barbosa et al [5,18] proposed allowing the developer to define the partition method for the data and rely on a performance model and scheduler to dice the tasks to be decomposed into smaller ones when needed. This mechanism can be leveraged to enable the interweaving of simulation and visualization tasks, primarily because they operate in different time intervals, t and t − 1, respectively.…”

Section: Related Workmentioning

confidence: 99%

“…The solution gives rise to a granularity of τ v , i.e., if the workload from τ v (W (τ v )) is too large, it will delay the execution of subsequently scheduled τ s , on the other hand, if W (τ v ) is too small it will increase the scheduling overhead. To address this problem, we follow the approach proposed by Barbosa et al [5,18] referred to as dicing. The dicing strategy enables the developer to define a generic workload applied to a partition created by the scheduler using a developer-defined partitioning method called dice.…”

Section: Our Approachmentioning

confidence: 99%

See 1 more Smart Citation

LOOM: Interweaving tightly coupled visualization and numeric simulation framework

Barbosa

Navrátil

Santos

et al. 2021

ISAV'21: In Situ Infrastructures for Enabling Extreme-Scale Analysis and Visualization

View full text Add to dashboard Cite

Traditional post-hoc high-fidelity scientific visualization (HSV) of numerical simulations requires multiple I/O check-pointing to inspect the simulation progress. The costs of these I/O operations are high and can grow exponentially with increasing problem sizes. In situ HSV dispenses with costly check-pointing I/O operations, but requires additional computing resources to generate the visualization, increasing power and energy consumption. In this paper we present LOOM, a new interweaving approach supported by a task scheduling framework to allow tightly coupled in situ visualization without significantly adding to the overall simulation runtime. The approach exploits the idle times of the numerical simulation threads, due to workload imbalances, to perform the visualization steps. Overall execution time (simulation plus visualization) is minimized. Power requirements are also minimized by sharing the same computational resources among numerical simulation and visualization tasks. We demonstrate that LOOM reduces time to visualization by 3× compared to a traditional noninterwoven pipeline. Our results here demonstrate good potential for additional gains for large distributed-memory use cases with larger interleaving opportunities. CCS CONCEPTS• Computing methodologies → Parallel algorithms.

show abstract

Section: Related Workmentioning

confidence: 99%

Section: Our Approachmentioning

confidence: 99%

LOOM: Interweaving tightly coupled visualization and numeric simulation framework

Barbosa

Navrátil

Santos

et al. 2021

ISAV'21: In Situ Infrastructures for Enabling Extreme-Scale Analysis and Visualization

View full text Add to dashboard Cite

show abstract

“…Furthermore, different architectures usually exhibit different execution and programming models and are deployed with different programming languages and development tools, severely impacting on both code and performance portability. Additionally, the application's workload has to be distributed and balanced among the multiple devices, and, within each device, among its multiple computing units; this leads to multilevel scheduling, which must be effectively handled in order to achieve acceptable performance levels [1]. To efficiently use the available resources in these and future systems, algorithms and software packages have to be revisited and re-evaluated to assess their adequateness to these environments.…”

Section: Introductionmentioning

confidence: 99%

Exploring heterogeneous computing with advanced path tracing algorithms

Oliveira

Perdigao

Santos

et al. 2016

2016 23° Encontro Português De Computação Gráfica E Interação (E

View full text Add to dashboard Cite

The CG research community has a renewed interest on rendering algorithms based on path space integration, mainly due to new approaches to discover, generate and exploit relevant light paths while keeping the numerical integrator unbiased or, at the very least, consistent. Simultaneously, the current trend towards massive parallelism and heterogeneous environments, based on a mix of conventional computing units with accelerators, is playing a major role both in HPC and embedded platforms. To efficiently use the available resources in these and future systems, algorithms and software packages are being revisited and reevaluated to assess their adequateness to these environments.This paper assesses the performance and scalability of three different path based algorithms running on homogeneous servers (dual multicore Xeons) and heterogeneous systems (those multicore plus manycore Xeon and NVidia Kepler GPU devices). These algorithms include path tracing (PT), its bidirectional counterpart (BPT) and the more recent Vertex Connect and Merge (VCM). Experimental results with two conventional scenes (one mainly diffuse, the other exhibiting specular-diffuse-specular paths) show that all algorithms scale well across the different platforms, the actual scalability depending on whether shared data structures are accessed or not (PT vs. BPT vs. VCM).

show abstract