Proceedings of the Sixth Annual ACM Symposium on Parallel Algorithms and Architectures - SPAA '94 1994
DOI: 10.1145/181014.181081
|View full text |Cite
|
Sign up to set email alerts
|

Experiences with parallel N-body simulation

Abstract: AbstractÐThis paper describes our experiences developing high-performance code for astrophysical N-body simulations. Recent N-body methods are based on an adaptive tree structure. The tree must be built and maintained across physically distributed memory; moreover, the communication requirements are irregular and adaptive. Together with the need to balance the computational work-load among processors, these issues pose interesting challenges and tradeoffs for high-performance implementation. Our implementation… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
2
1

Citation Types

0
26
0

Year Published

1996
1996
2007
2007

Publication Types

Select...
4
2
2

Relationship

0
8

Authors

Journals

citations
Cited by 32 publications
(26 citation statements)
references
References 25 publications
(42 reference statements)
0
26
0
Order By: Relevance
“…We also implemented the ORB version in a manner similar to that reported in [5,12], and found no significant performance differences with our costzones approach. Instead, using costzones allowed us to make easier comparisons with the CC-SAS implementation of the N-Body problem.…”
Section: N-body Problemmentioning
confidence: 89%
“…We also implemented the ORB version in a manner similar to that reported in [5,12], and found no significant performance differences with our costzones approach. Instead, using costzones allowed us to make easier comparisons with the CC-SAS implementation of the N-Body problem.…”
Section: N-body Problemmentioning
confidence: 89%
“…In order to address computationally intensive problems, NOWs have been used in a wide range of fields including large-scale databases, scientific computations, computer graphics, multimedia, wave propagation predictions, and telecommunication systems simulations [18,62,54,56,2,39,71,59]. …”
Section: Related Workmentioning
confidence: 99%
“…Here, the movement of a set of particles is simulated under the influence of gravitational, electrostatic and Vander Waals attractions [69,10,36]. Two prevalent forms of the N -body problem known as the Barnes-Hut and the Fast multi-pole (FMM) methods have been implemented using message passing and shared-memory architectures [11,31,54,74,47,57,37]. In this context, computation-partitioning and workload-balancing scheduling approaches have been proposed in [69].…”
Section: Related Workmentioning
confidence: 99%
“…Historically, the first decomposition proposed for Nbody algorithms is the Orthogonal Recursive Bisection (ORB) for the Barnes-Hut algorithm [8,14] (which can be extended to the FMM): the computation space is recursively halved along one dimension. Other decompositions have then been introduced: they rely directly on the octree space decomposition thanks to space-filling curves and cost functions.…”
Section: Introductionmentioning
confidence: 99%
“…The costzones decomposition has been shown to be more efficient that ORB on common address space architecture in [13] (multi-thread mode) and similar decompositions, based on Morton or Hilbert ordering, have also been prefered to ORB on distributed memory architecture with message passing [12,15] (multi-process mode). Moreover, in order to obtain efficient parallelization in message passing environment, communications have to be overlaped with computation, small messages have to be aggregated into bigger ones (large-grain communications), communications have to ordered so as to avoid contention, and sender-driven communications have to be prefered to receiver-initiated communications [7,8,12,15].…”
Section: Introductionmentioning
confidence: 99%