2019
DOI: 10.31577/cai_2019_4_817
|View full text |Cite
|
Sign up to set email alerts
|

Multilevel Algebraic Approach for Performance Analysis of Parallel Algorithms

Abstract: We provide a multilevel approach for analysing performances of parallel algorithms. The main outcome of such approach is that the algorithm is described by using a set of operators which are related to each other according to the problem decomposition. Decomposition level determines the granularity of the algorithm. A set of block matrices (decomposition and execution) highlights fundamental characteristics of the algorithm, such as inherent parallelism and sources of overheads.

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

0
21
0

Year Published

2020
2020
2021
2021

Publication Types

Select...
5

Relationship

3
2

Authors

Journals

citations
Cited by 13 publications
(21 citation statements)
references
References 16 publications
0
21
0
Order By: Relevance
“…Our future works intends: to invest some effort to develop new algorithms able to improve granularity (e.g., algorithms for the matrix‐power kernel), to elaborate further on the issue related to the evaluation of the performance portability of KM block‐based algorithms on the computing systems which will respond to the new requirements of the incoming exascale era when computing environments will include very large MIMD systems, heterogeneous CPU‐GPU systems and combinations of both all equipped with standard scientific libraries as PETSc 43 and MAGMA 12 and to investigate the performance portability evaluation in a more general performance evaluation framework we developed and already applied in a few other works 18,19,44‐46 and we intend to continue to evolve. …”
Section: Discussionmentioning
confidence: 99%
“…Our future works intends: to invest some effort to develop new algorithms able to improve granularity (e.g., algorithms for the matrix‐power kernel), to elaborate further on the issue related to the evaluation of the performance portability of KM block‐based algorithms on the computing systems which will respond to the new requirements of the incoming exascale era when computing environments will include very large MIMD systems, heterogeneous CPU‐GPU systems and combinations of both all equipped with standard scientific libraries as PETSc 43 and MAGMA 12 and to investigate the performance portability evaluation in a more general performance evaluation framework we developed and already applied in a few other works 18,19,44‐46 and we intend to continue to evolve. …”
Section: Discussionmentioning
confidence: 99%
“…In this context, there is a need for a more formal analysis of the expected performances, that can help to understand the choices, identify bottlenecks, and drive directions of optimizations. Examples of relevant references concerning the performance analysis and prediction to support the design process can be found in articles 29‐32 …”
Section: Related Workmentioning
confidence: 99%
“…The authors define the upper limit as the maximum number of independent sub‐algorithms that can be executed simultaneously in the architecture and the lower limit is provided by the dependency degree. The performance prediction model proposed in Reference 13 was employed for the MGRIT (MUlti‐Grid In Time) algorithm 14 and a simplified version of the model was applied for a matrix‐multiply algorithm 15 …”
Section: Longest Common Subsequencementioning
confidence: 99%
“…We used the approach of Reference 13 (section 2.2) partially, with a focus on expressing the dependencies of the LCS problem and determining the complexity of the algorithm we employed to solve it. The LCS dependencies are shown in Equation ) and illustrated for the 4 × 4 case in Figure 3A,B.…”
Section: Proposed Architecturementioning
confidence: 99%
See 1 more Smart Citation