Scalable implicit finite element solver for massively parallel processing with demonstration to 160K cores

Sahni, Onkar; Zhou, Min; Shephard, Mark S.; Jansen, Kenneth E.

doi:10.1145/1654059.1654129

Cited by 50 publications

(32 citation statements)

References 17 publications

Supporting

Mentioning

Contrasting

Order By: Relevance

“…Since there is little computation performed during mesh adaptation relative to the substantial increase in communications required as the given mesh is distributed to more processors, the scaling decreases on high core counts (note that a strong scaling study is performed and therefore, the problem size is the same). However, the analysis have been shown to scale strongly with the similar amount of work load provided [21,41]. The fact that mesh modification routines are able to scale on bigger core count with more entities involved into communication supports the statement that it is likely to at a minimum provide the equivalent scaling with more work load on the same number of parts.…”

Section: Parallel Anisotropic Boundary Layer Adaptivity On a Heat Tramentioning

confidence: 62%

Parallel Adaptive Boundary Layer Meshing for CFD Analysis

Ovcharenko

Chitale

Sahni

et al. 2013

Proceedings of the 21st International Meshing Roundtable

Self Cite

View full text Add to dashboard Cite

Summary. This paper describes a parallel procedure for anisotropic mesh adaptation with boundary layers for use in scalable CFD simulations. The parallel mesh adaptation algorithm operates with local mesh modification operations developed for both unstructured and boundary layer parts of the mesh. The adaptive approach maintains layered elements near the viscous walls and accounts for the mesh modification operations that are carried out in parallel on a distributed mesh. In the process mesh relationships and approximations with respect to curved complex 3D geometries of interest are properly maintained. The parallel mesh adaptation procedures are applied to two problems: a heat transfer manifold and a scramjet engine.

show abstract

Section: Parallel Anisotropic Boundary Layer Adaptivity On a Heat Tramentioning

confidence: 62%

Parallel Adaptive Boundary Layer Meshing for CFD Analysis

Ovcharenko

Chitale

Sahni

et al. 2013

Proceedings of the 21st International Meshing Roundtable

Self Cite

View full text Add to dashboard Cite

show abstract

“…Both of these work types can be equally divided among the processors by partitioning the aggregate mesh into equal load parts [12,13]. So far, PHASTA is a pure MPI based code and each process executes of copy of the analysis code to handle the computation work and interactions corresponding to its mesh part.…”

Section: Parallelizationmentioning

confidence: 99%

Petascale, Adaptive CFD (ALCF ESP Technical Report): ALCF-2 Early Science Program Technical Report

Jansen¹,

Rasquin²

2013

View full text Add to dashboard Cite

“…The more complex message passing parallel model employs peer-to-peer (p2p) message exchanges among processors and takes advantage of communication and computation overlapping. Following [4], the point-to-point (p2p) communication strategy is based on a master-slave relationship between processors. This relationship is established by creating a hierarchy based on host partition numbers.…”

Section: Edgecfd: the Benchmark Softwarementioning

confidence: 99%

Evaluation of Message Passing Communication Patterns in Finite Element Solution of Coupled Problems

Elias

Camata

Aveleda

et al. 2011

Lecture Notes in Computer Science

View full text Add to dashboard Cite

Abstract. This work presents a performance evaluation of single node and subdomain communication schemes available in EdgeCFD, an implicit edgebased coupled fluid flow and transport code for solving large scale problems in modern clusters. A natural convection flow problem is considered to assess performance metrics. Tests, focused in single node multi-core performance, show that past Intel Xeon processors dramatically suffer when large workloads are imposed to a single node. However, the problem seems to be mitigated in the newest Intel Xeon processor. We also observe that MPI non-blocking pointto-point interface sub-domain communications, although more difficult to implement, are more effective than collective interface sub-domain communications.

show abstract

Scalable implicit finite element solver for massively parallel processing with demonstration to 160K cores

Cited by 50 publications

References 17 publications

Parallel Adaptive Boundary Layer Meshing for CFD Analysis

Parallel Adaptive Boundary Layer Meshing for CFD Analysis

Petascale, Adaptive CFD (ALCF ESP Technical Report): ALCF-2 Early Science Program Technical Report

Evaluation of Message Passing Communication Patterns in Finite Element Solution of Coupled Problems

Contact Info

Product

Resources

About