Vitus J. Leung scite author profile

Abstract-The cost of data movement has always been an important concern in high performance computing (HPC) systems. It has now become the dominant factor in terms of both energy consumption and performance. Support for expression of data locality has been explored in the past, but those efforts have had only modest success in being adopted in HPC applications for various reasons. However, with the increasing complexity of the memory hierarchy and higher parallelism in emerging HPC systems, locality management has acquired a new urgency. Developers can no longer limit themselves to low-level solutions and ignore the potential for productivity and performance portability obtained by using locality abstractions. Fortunately, the trend emerging in recent literature on the topic alleviates many of the concerns that got in the way of their adoption by application developers. Data locality abstractions are available in the forms of libraries, data structures, languages and runtime systems; a common theme is increasing productivity without sacrificing performance. This paper examines these trends and identifies commonalities that can combine various locality concepts to develop a comprehensive approach to expressing and managing data locality on future large-scale high-performance computing systems.

show abstract

Designing Contamination Warning Systems for Municipal Water Networks Using Imperfect Sensors

Berry

Carr

Hart

et al. 2009

J. Water Resour. Plann. Manage.

View full text Add to dashboard Cite

Online Diagnosis of Performance Variation in HPC Systems Using Machine Learning

Tuncer

Ateş

Zhang

et al. 2019

IEEE Trans. Parallel Distrib. Syst.

View full text Add to dashboard Cite

Exploiting Geometric Partitioning in Task Mapping for Parallel Computers

Deveci

Rajamanickam

Leung

et al. 2014

View full text Add to dashboard Cite

Abstract-We present a new method for mapping applications' MPI tasks to cores of a parallel computer such that communication and execution time are reduced. We consider the case of sparse node allocation within a parallel machine, where the nodes assigned to a job are not necessarily located within a contiguous block nor within close proximity to each other in the network. The goal is to assign tasks to cores so that interdependent tasks are performed by "nearby" cores, thus lowering the distance messages must travel, the amount of congestion in the network, and the overall cost of communication.Our new method applies a geometric partitioning algorithm to both the tasks and the processors, and assigns task parts to the corresponding processor parts. We show that, for the structured finite difference mini-app MiniGhost, our mapping method reduced execution time 34% on average on 65,536 cores of a Cray XE6. In a molecular dynamics mini-app, MiniMD, our mapping method reduced communication time by 26% on average on 6144 cores. We also compare our mapping with graph-based mappings from the LibTopoMap library and show that our mappings reduced the communication time on average by 15% in MiniGhost and 10% in MiniMD.

show abstract

Diagnosing Performance Variations in HPC Applications Using Machine Learning

Tuncer

Ateş

Zhang

et al. 2017

View full text Add to dashboard Cite

scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.

Contact Info

hi@scite.ai

10624 S. Eastern Ave., Ste. A-614

Henderson, NV 89052, USA

Blog Terms and Conditions API Terms Privacy Policy Contact Cookie Preferences Do Not Sell or Share My Personal Information

Made with 💙 for researchers

Part of the Research Solutions Family.

Vitus J. Leung

Trends in Data Locality Abstractions for HPC Systems

Designing Contamination Warning Systems for Municipal Water Networks Using Imperfect Sensors

Online Diagnosis of Performance Variation in HPC Systems Using Machine Learning

Exploiting Geometric Partitioning in Task Mapping for Parallel Computers

Diagnosing Performance Variations in HPC Applications Using Machine Learning

Contact Info

Product

Resources

About