2012 20th Euromicro International Conference on Parallel, Distributed and Network-Based Processing 2012
DOI: 10.1109/pdp.2012.23
|View full text |Cite
|
Sign up to set email alerts
|

Parallel Branch and Bound on a CPU-GPU System

Abstract: International audienceHybrid implementation via CUDA of a branch and bound method for knapsack problems is proposed. Branch and bound computations can be carried out either on the CPU or on the GPU according to the size of the branch and bound list, i.e. the number of nodes. Tests are carried out on a Tesla C2050 GPU. A first series of computational results showing a substantial speedup is displayed and analyzed

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1
1

Citation Types

0
32
0

Year Published

2012
2012
2020
2020

Publication Types

Select...
4
2
1

Relationship

1
6

Authors

Journals

citations
Cited by 22 publications
(32 citation statements)
references
References 14 publications
0
32
0
Order By: Relevance
“…We shall see in Section IV that the improvements we propose in this paper have permitted us to increase substantially speedup as compared with the one obtained in our previous work (see [13]). …”
Section: Implementation On a Cpu-gpu Systemmentioning
confidence: 60%
See 3 more Smart Citations
“…We shall see in Section IV that the improvements we propose in this paper have permitted us to increase substantially speedup as compared with the one obtained in our previous work (see [13]). …”
Section: Implementation On a Cpu-gpu Systemmentioning
confidence: 60%
“…The speedup obtained with this implementation is generally twice as much as the one in [13], where noncoalesced global memory accesses may occur in conditional part of codes, leading to poor efficiency.…”
Section: Computational Resultsmentioning
confidence: 94%
See 2 more Smart Citations
“…However, because of the irregularity of the B&B tree search, scheduling the load inside a GPU device is not fully compatible with the underlying SIMD programming model. In [10,16,39,7,34,11], implementations on a single GPU device are presented with respect to specific optimization problems, e.g., Flowshop, Knapsack and TSP (Traveling salesman). With respect to this paper, these studies are rather of limited interest since they are problem specific and do not consider a heterogeneous and large scale setting.…”
Section: Parallel Bandb With Gpusmentioning
confidence: 99%