Numerical analysis of heat source/sink on peristalsis of MHD carbon-water nanofluid in symmetric channel with permeable space

Summary The 3‐P challenge of high‐performance programming—performance, portability and productivity—has become more difficult than ever in the age of heterogeneous computing. It would be naïve to think that the performance portability problem can be completely solved, but it can certainly be reduced and made tolerable. However, first and foremost, an agreement is needed on what it means for an application to be performance portable. Unfortunately, there is still no consensus in the scientific community on a workable definition of the term performance portability. Several years ago, a comprehensive effort was made to formulate a novel definition of performance portability and an associated metric. Since the new metric was first introduced, it has been widely adopted by the scientific community, and many advanced studies have used it. Unfortunately, the definition of the new metric has flaws. This article presents a proof of the theoretical flaws in the definition of the new metric, considers the practical implications of these flaws as reflected in many studies that have used it in recent years, and proposes a revised metric that addresses the flaws and provides guidelines on how to use it correctly.

show abstract

Parallel computing on any desktop

Marowka

2007

Commun. ACM

View full text Add to dashboard Cite

show abstract

Python accelerators for high-performance computing

Marowka¹

2017

J Supercomput

View full text Add to dashboard Cite

Extending Amdahl's Law for Heterogeneous Computing

Marowka

2012

View full text Add to dashboard Cite

Back to Thin-Core Massively Parallel Processors

Marowka

2011

Computer

View full text Add to dashboard Cite

Think Parallel: Teaching Parallel Programming Today

Marowka

2008

IEEE Distrib. Syst. Online

View full text Add to dashboard Cite

On parallel software engineering education using python

Marowka¹

2017

Educ Inf Technol

View full text Add to dashboard Cite

OpenMP‐oriented applications for distributed shared memory architectures

Marowka

Liu

Chapman

2004

Concurrency and Computation

View full text Add to dashboard Cite

SUMMARYThe rapid rise of OpenMP as the preferred parallel programming paradigm for small-to-medium scale parallelism could slow unless OpenMP can show capabilities for becoming the model-of-choice for large scale high-performance parallel computing in the coming decade.The main stumbling block for the adaptation of OpenMP to distributed shared memory (DSM) machines, which are based on architectures like cc-NUMA, stems from the lack of capabilities for data placement among processors and threads for achieving data locality. The absence of such a mechanism causes remote memory accesses and inefficient cache memory use, both of which lead to poor performance. This paper presents a simple software programming approach called copy-inside-copy-back (CC) that exploits the data privatization mechanism of OpenMP for data placement and replacement. This technique enables one to distribute data manually without taking away control and flexibility from the programmer and is thus an alternative to the automat and implicit approaches. Moreover, the CC approach improves on the OpenMP-SPMD style of programming that makes the development process of an OpenMP application more structured and simpler.The CC technique was tested and analyzed using the NAS Parallel Benchmarks on SGI Origin 2000 multiprocessor machines. This study shows that OpenMP improves performance of coarse-grained parallelism, although a fast copy mechanism is essential.

show abstract

12 3 4 5 6

scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.

Contact Info

hi@scite.ai

10624 S. Eastern Ave., Ste. A-614

Henderson, NV 89052, USA

Blog Terms and Conditions API Terms Privacy Policy Contact Cookie Preferences Do Not Sell or Share My Personal Information

Made with 💙 for researchers

Part of the Research Solutions Family.

Ami Marowka

Reformulation of the performance portability metric

Parallel computing on any desktop

Python accelerators for high-performance computing

Extending Amdahl's Law for Heterogeneous Computing

Back to Thin-Core Massively Parallel Processors

Think Parallel: Teaching Parallel Programming Today

On parallel software engineering education using python

OpenMP‐oriented applications for distributed shared memory architectures

Contact Info

Product

Resources

About