2008 SC - International Conference for High Performance Computing, Networking, Storage and Analysis 2008
DOI: 10.1109/sc.2008.5215501
|View full text |Cite
|
Sign up to set email alerts
|

High-frequency simulations of global seismic wave propagation using SPECFEM3D_GLOBE on 62K processors

Abstract: SPECFEM3D_GLOBE is a spectral-element application enabling the simulation of global seismic wave propagation in 3D anelastic, anisotropic, rotating and self-gravitating Earth models at unprecedented resolution. A fundamental challenge in global seismology is to model the propagation of waves with periods between 1 and 2 seconds, the highest frequency signals that can propagate clear across the Earth. These waves help reveal the 3D structure of the Earth's deep interior and can be compared to seismographic reco… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

1
48
0

Year Published

2009
2009
2016
2016

Publication Types

Select...
5
1
1

Relationship

1
6

Authors

Journals

citations
Cited by 45 publications
(49 citation statements)
references
References 14 publications
(22 reference statements)
1
48
0
Order By: Relevance
“…The insight provided by these two simple experiments is that, while a read-only access pattern can be easily scaled 2 lockb is the basic instruction to implement atomic operations on x86 processors, so the results can be directly generalized to other primitives such as read-and-or, that we use in the graph exploration. across multiple sockets relying on the native memory pipelining units, more sophisticated patterns that put pressure on the cache-coherency protocol require an innovative algorithmic solution.…”
Section: System Architecture and Experimental Platformsmentioning
confidence: 99%
See 1 more Smart Citation
“…The insight provided by these two simple experiments is that, while a read-only access pattern can be easily scaled 2 lockb is the basic instruction to implement atomic operations on x86 processors, so the results can be directly generalized to other primitives such as read-and-or, that we use in the graph exploration. across multiple sockets relying on the native memory pipelining units, more sophisticated patterns that put pressure on the cache-coherency protocol require an innovative algorithmic solution.…”
Section: System Architecture and Experimental Platformsmentioning
confidence: 99%
“…This is a complex task because there are different types of locality: hierarchical networks, distributed, shared and local caches, local and non-local memory, and various cache-coherence effects. In an ideal spectrum of parallel applications, those that have high locality, rely on limited communication and closely match the cache hierarchy, can readily take advantage of existing multicore processors and accelerators [1], [2]. At the other end of the spectrum there are problems, such as graph exploration, with very little data reuse and a random access pattern.…”
Section: Introductionmentioning
confidence: 99%
“…Table 1 provides an overview of investigated application classes, their test cases, and a short description of the respective data access patterns. In detail, we analyzed the complex applications WRF [17], SPECFEM3D_GLOBE [7], MILC [4] and LAMMPS [15], representing the fields of weather simulation, seismic wave propagation, quantum chromodynamics and molecular dynamics. We also included existing parallel computing benchmarks and mini-apps, such as the NAS [19], the Sequoia benchmarks as well as the Mantevo mini apps [10].…”
Section: Representative Communication Data Access Patternsmentioning
confidence: 99%
“…It is used on some of the biggest HPC systems available [7]. Grid points that lie on the sides, edges or corners of an element are shared between neighboring elements.…”
Section: Exchange Of Unstructured Elementsmentioning
confidence: 99%
“…On the global scale, high-frequency body waves are routinely observed at propagation distances of 1000-2000 wavelengths and more, a regime that is hardly accessible with current global 3-D solvers and computers (Carrington et al 2008). While the methods we propose for parameter optimization are completely general and the coarse-grained memory variable approach applicable to all high-order finite-element methods, we use the axisymmetric SEM AxiSEM introduced by Nissen-Meyer et al (2007a,b, 2008, further developed to include anisotropy by and published open source by Nissen-Meyer et al (2014) as an example implementation to test our theoretical arguments.…”
Section: Introductionmentioning
confidence: 99%