2016
DOI: 10.1002/cpe.4019
|View full text |Cite
|
Sign up to set email alerts
|

IVM‐based parallel branch‐and‐bound using hierarchical work stealing on multi‐GPU systems

Abstract: International audienceTree-based exploratory methods, like Branch-and-Bound (B&B) algorithms, are highly irregular applications which makes their design and implementation on graphics processing unit (GPU) challenging. In this paper, we present a multi-GPU B&B algorithm for solving large permutation-based combinatorial optimization problems. To tackle the problem of the irregular workload, we propose a hierarchical work stealing (WS) strategy that balances the workload inside the GPU and between different GPUs… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1

Citation Types

0
20
0

Year Published

2017
2017
2025
2025

Publication Types

Select...
4
1
1

Relationship

0
6

Authors

Journals

citations
Cited by 13 publications
(20 citation statements)
references
References 26 publications
0
20
0
Order By: Relevance
“…As these algorithms are compute-intensive, diverse strategies have been used for improving performance, such as instruction-level parallelism, architecturespecific code optimizations and problem-specific data structures [6,12,14,17]. Thus, parallel tree-based search algorithms are frequently written in C/C++, due to their low-level features and supported parallel computing libraries [5].…”
Section: Tree-based Search Algorithmsmentioning
confidence: 99%
See 1 more Smart Citation
“…As these algorithms are compute-intensive, diverse strategies have been used for improving performance, such as instruction-level parallelism, architecturespecific code optimizations and problem-specific data structures [6,12,14,17]. Thus, parallel tree-based search algorithms are frequently written in C/C++, due to their low-level features and supported parallel computing libraries [5].…”
Section: Tree-based Search Algorithmsmentioning
confidence: 99%
“…Tree-based search algorithms are compute-intensive and highly irregular, which demands hand-optimized data structures for efficient search and load balancing [6,14,17]. Thus, high-productivity languages are not often employed within the scope of tree search, as they historically suffer from severe performance penalties [11].…”
Section: Introductionmentioning
confidence: 99%
“…For instance, a performance comparison is available in [12] between multi-core and many-core systems by solving big optimization problems with a Branch-andBound algorithm. Another branch-and-bound implementation is described in [8] using multi-GPU systems. While the previous papers are related to multi-CPU systems on one hand and to multi-GPU systems on another hand, [13] implements a Branch-and-Bound for heterogeneous architectures (both multi-CPU systems with GPU accelerators).…”
Section: Multithreaded Branch-and-boundmentioning
confidence: 99%
“…[6]), and several authors investigated sophisticated approaches to take advantage of various architectures (e.g. [7], [8]). Other authors proposed approaches to provide a more general framework to design such parallel Branch-and-Bound algorithms (e.g.…”
mentioning
confidence: 99%
“…Paper [4] presents a novel B&B algorithm for solving large permutation-based combinatorial optimization problems. A hierarchical work stealing strategy was proposed, in order to tackle the problems of the irregular workload, as well as to balance the workload inside the GPU and between different GPUs and CPU cores.…”
mentioning
confidence: 99%