2015
DOI: 10.1007/s00366-014-0393-7
|View full text |Cite
|
Sign up to set email alerts
|

Efficient parallel optimization of volume meshes on heterogeneous computing systems

Abstract: being very popular. The shape of mesh elements significantly impacts the efficiency and accuracy of simulation codes. The problem of improving element quality in unstructured tetrahedral meshes and triangular surface meshes is classically framed as an optimization problem. In this framework, the positions of the mesh vertices are adjusted to optimize element quality. In this paper, we explore how modern computer hardware, meaning heterogeneous many-core systems, accelerates these optimization computations. The… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1

Citation Types

0
3
0

Year Published

2017
2017
2022
2022

Publication Types

Select...
7

Relationship

0
7

Authors

Journals

citations
Cited by 9 publications
(3 citation statements)
references
References 21 publications
0
3
0
Order By: Relevance
“…Simplex in general is used for that purpose because it is such has a minimum number of points (n+1). One of these points in each iteration is dropped and a new point is added, thus a new simplex is defined [17]. Finally, replacing SSA with cooling simulated live search by annealing (DSSA) method rate is received.…”
Section: Nelder-mead Simplicial Heuristicmentioning
confidence: 99%
“…Simplex in general is used for that purpose because it is such has a minimum number of points (n+1). One of these points in each iteration is dropped and a new point is added, thus a new simplex is defined [17]. Finally, replacing SSA with cooling simulated live search by annealing (DSSA) method rate is received.…”
Section: Nelder-mead Simplicial Heuristicmentioning
confidence: 99%
“…end if (11) dtrsm( , right( )) or cublasDtrsm( , right( )) (12) dgemm( → ( +1) , right( ), rest( )) or cublasDgemm( → ( +1) , right( ), rest( )) (13) end for Algorithm 1: Heterogeneous parallel LU factorization algorithm based on a basic column block uniform allocation strategy for a multiple CPU/GPU system. Assume that all the basic column blocks of matrix A have been distributed to the corresponding processes and form submatrix S in process .…”
Section: Heterogeneous Parallel Lu Factorization Algorithmmentioning
confidence: 99%
“…However, heterogeneous computing systems have introduced new challenges to algorithm design and system software development because of the significantly different architectures and programming models of CPUs and GPUs. Conventional optimization techniques for CPUs may not work well in a heterogeneous system with multiple CPUs and GPUs [11,12]. Hence, it is necessary to present novel techniques to exploit the computing potentiality of heterogeneous computing.…”
Section: Introductionmentioning
confidence: 99%