2012 IEEE 10th International Symposium on Parallel and Distributed Processing With Applications 2012
DOI: 10.1109/ispa.2012.96
|View full text |Cite
|
Sign up to set email alerts
|

Towards the Dynamic Load Balancing on Heterogeneous Multi-GPU Systems

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
5

Citation Types

0
14
0

Year Published

2013
2013
2019
2019

Publication Types

Select...
3
3
2

Relationship

0
8

Authors

Journals

citations
Cited by 18 publications
(14 citation statements)
references
References 11 publications
0
14
0
Order By: Relevance
“…Most powerful platforms for a wide class of Many parallel programs embrace the concept of many-core processors, which can be accomplished via Compiler based, Application Programming Interface (API) such as OpenMP parallel programming. In the other hand multi-GPUs can be interfaced to on-chip multicores as efficient accelerators to increase system-level computational performance and improve system responsiveness [1,2]. For instance, the NVIDIA Tesla S1070 GPU [20] consists of (30×4) multiprocessors, where each multiprocessor contains 8 cores, resulting in an aggregate number of 960 cores inside a single GPU.…”
Section: Introductionmentioning
confidence: 99%
“…Most powerful platforms for a wide class of Many parallel programs embrace the concept of many-core processors, which can be accomplished via Compiler based, Application Programming Interface (API) such as OpenMP parallel programming. In the other hand multi-GPUs can be interfaced to on-chip multicores as efficient accelerators to increase system-level computational performance and improve system responsiveness [1,2]. For instance, the NVIDIA Tesla S1070 GPU [20] consists of (30×4) multiprocessors, where each multiprocessor contains 8 cores, resulting in an aggregate number of 960 cores inside a single GPU.…”
Section: Introductionmentioning
confidence: 99%
“…Most powerful platforms for a wide class of many parallel programs embrace the concept of manycore processors, which can be accomplished via Compiler based, Application. Multi-GPUs can be interfaced to on-chip multicores as efficient accelerators to increase system-level computational performance and improve system responsiveness [1,2]. For instance, the NVIDIA Tesla S1070 GPU [20] consists of (30×4) multiprocessors, where each multiprocessor contains 8 cores, resulting in an aggregate number of 960 cores inside a single GPU.…”
Section: Introductionmentioning
confidence: 99%
“…They hid the other details of the possible differences between processors. In [1], Acosta et al worked in CPU-GPU. They also measured the processing time.…”
Section: Introductionmentioning
confidence: 99%
“…It is not clear whether this technique is employed in [1] and [7], but it is used in all the other ones mentioned above. Another technique is asymmetry load balancing, which means tasks are better distributed to the processors according to their capabilities, instead of distributing to them evenly.…”
Section: Introductionmentioning
confidence: 99%
See 1 more Smart Citation