2020
DOI: 10.1177/1094342020943652
|View full text |Cite
|
Sign up to set email alerts
|

Implementation and performance of Barnes-hut n-body algorithm on extreme-scale heterogeneous many-core architectures

Abstract: In this paper, we report the implementation and measured performance of our extreme-scale whole planetary ring simulation code on Sunway TaihuLight and two PEZY-SC2 systems: Shoubu System B and Gyoukou. The numerical algorithm is the parallel Barnes-Hut tree algorithm, which has been used in many large-scale astrophysical particle-based simulations. Our implementation is based on our FDPS framework. However, the extremely large numbers of cores of the systems used (10 M on TaihuLight and 16 M on Gyoukou) and t… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1

Citation Types

0
2
0

Year Published

2020
2020
2024
2024

Publication Types

Select...
3
1
1

Relationship

0
5

Authors

Journals

citations
Cited by 5 publications
(2 citation statements)
references
References 20 publications
(33 reference statements)
0
2
0
Order By: Relevance
“…This means that grains which are far away from the point of interest can be clustered without a big lack in accuracy. A similar approximation is successfully applied by the Barns-Hut tree method which is used to solve N-body problems in astrophysics or electrodynamics [17,18]. The tree code reduces the theoretical run time to the order O(nlog(n)) and there are libraries available which implement this method.…”
Section: Demagnetization Fieldmentioning
confidence: 99%
“…This means that grains which are far away from the point of interest can be clustered without a big lack in accuracy. A similar approximation is successfully applied by the Barns-Hut tree method which is used to solve N-body problems in astrophysics or electrodynamics [17,18]. The tree code reduces the theoretical run time to the order O(nlog(n)) and there are libraries available which implement this method.…”
Section: Demagnetization Fieldmentioning
confidence: 99%
“…Multi‐core processor refers to the use of multiple processor cores on a chip, and each core performs the same or similar tasks, the whole chip acts as a unified structure to provide external services, output performance, to meet the needs of improving performance and achieving load balance at a minimum cost [3]. The future development of high‐performance computing will come from the development of multi‐core processor technology [4]. The system tends to be composed of multi‐core, and multi‐core forms a cluster hierarchically [5].…”
Section: Introductionmentioning
confidence: 99%