The platform will undergo maintenance on Sep 14 at about 7:45 AM EST and will be unavailable for approximately 2 hours.
2020 IEEE International Parallel and Distributed Processing Symposium Workshops (IPDPSW) 2020
DOI: 10.1109/ipdpsw50202.2020.00124
|View full text |Cite
|
Sign up to set email alerts
|

Improving MPI Application Communication Time with an Introspection Monitoring Library

Abstract: In this report we describe how to improve communication time of MPI parallel applications with the use of a library that enables to monitor MPI applications and allows for introspection (the program itself can query the state of the monitoring system). Based on previous work, this library is able to see how collective communications are decomposed into point-to-point messages. It also features monitoring sessions that allow suspending and restarting the monitoring, limiting it to specific portions of the code.… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1

Citation Types

0
3
0

Year Published

2021
2021
2023
2023

Publication Types

Select...
4
2

Relationship

1
5

Authors

Journals

citations
Cited by 6 publications
(3 citation statements)
references
References 18 publications
0
3
0
Order By: Relevance
“…There have been many efforts to optimize MPI communication. For example, MPI point-to-point communication routines can be optimized by using more efficient primitives [18], or through the use of a library for monitoring MPI applications [19]. MPI collective communications can be optimized over wide-area networks by considering network details [20], or through a library like HPC-X [21] for offloading.…”
Section: Tuning Of Mpi Applicationsmentioning
confidence: 99%
“…There have been many efforts to optimize MPI communication. For example, MPI point-to-point communication routines can be optimized by using more efficient primitives [18], or through the use of a library for monitoring MPI applications [19]. MPI collective communications can be optimized over wide-area networks by considering network details [20], or through a library like HPC-X [21] for offloading.…”
Section: Tuning Of Mpi Applicationsmentioning
confidence: 99%
“…Therefore, it is useful to enforce the mapping at runtime. For instance, if we see that in an application rank i and rank j communicate a lot, it is better to reorder the ranks such that the processes of rank i and j are close in the topology [30]. This might require to exchange some data.…”
Section: Rank Reorderingmentioning
confidence: 99%
“…Both proposals show improvements over default forms of mapping, however require profiling information. In turn, the work of [22] proposes online monitoring and rank remapping that provide improvements and does not need prior executions, however it still requires active modification of application code. Sparbit could be potentially coupled with these techniques, however its main advantage in comparison is that it works out of the box, providing significant improvements on communication time for theoretically any hierarchical network, and without need for topology information, additional communication or computation.…”
Section: Related Workmentioning
confidence: 99%