Abstract-To exploit the capabilities of current and future systems, developers must understand the interplay between onnode performance, domain decomposition, and an application's intrinsic communication patterns. While tools exist to gather and analyze data for each of these components individually, the resulting information is generally processed in isolation and presented in an abstract, categorical fashion unintuitive to most users. In this paper we present the HAC model, in which we identify the three domains of performance data most familiar to the user: (i) the application domain containing the application's working set, (ii) the hardware domain of the compute and network devices, and (iii) the communication domain of logical data transfers.We show that taking data from each of these domains and projecting, visualizing, and correlating it to the other domains can give valuable insights into the behavior of parallel application codes. The HAC abstraction opens the door for a new generation of tools that can help users more easily and intuitively associate performance data with root causes in the hardware system, the application's structure, and in its communication behavior, and by doing so leads to an improved understanding of the performance of their codes.
Keywords-Performance Analysis and Visualization
I. MOTIVATIONBy the end of this decade, exascale machines are expected to support 500 million to 4 billion concurrent tasks. At the same time the complexity of both architectures and applications will increase significantly, leading to often unexpected and unpredictable interactions between applications, middleware, and underlying hardware. In order to exploit such machines fully, programmers must be able to measure, analyze, and understand the behavior of their applications and its corresponding impact on the overall performance in great detail.Most current tools gather and present performance data in the native hardware or MPI rank domain, relying on the user to infer causal relationships and interdependencies. However, neither of these two domains is intuitive or familiar to the users. As a result, data appear as abstract or highly discrete, and domain and performance experts must expect