2011
DOI: 10.15669/pnst.2.643
|View full text |Cite
|
Sign up to set email alerts
|

Development of a High-Performance Eigensolver on a Peta-Scale Next-Generation Supercomputer System

Abstract: For current supercomputer systems, multicore and multisocket processors are required in order to build a system, and choice of interconnection is essential. In addition, for effective development of new code, high-performance, scalable, and reliable numerical software is key. ScaLAPACK and PETSc are software developed for distributed memory parallel computer systems. Real computation requires software that is highly tuned for implementation on new architectures, such as many-core processors.In the present stud… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1
1

Citation Types

0
37
0

Year Published

2013
2013
2023
2023

Publication Types

Select...
5
2
1

Relationship

1
7

Authors

Journals

citations
Cited by 49 publications
(37 citation statements)
references
References 8 publications
(8 reference statements)
0
37
0
Order By: Relevance
“…A narrow-band reduction algorithm coupled with a band divide and conquer approach was also described and successfully implemented in the new EigenExa library. [21,25] If orthogonality is not an issue then the eigenvectors can also be computed as the null spaces of shifted banded matrices [56,57]. Finally, even the tridiagonalization of the banded matrix (steps (IIIb) and (Va)) may be subdivided further [43,44,58].…”
Section: The Eigenvalue Problemmentioning
confidence: 99%
See 1 more Smart Citation
“…A narrow-band reduction algorithm coupled with a band divide and conquer approach was also described and successfully implemented in the new EigenExa library. [21,25] If orthogonality is not an issue then the eigenvectors can also be computed as the null spaces of shifted banded matrices [56,57]. Finally, even the tridiagonalization of the banded matrix (steps (IIIb) and (Va)) may be subdivided further [43,44,58].…”
Section: The Eigenvalue Problemmentioning
confidence: 99%
“…[25,21] As mentioned above, this library incorporates a narrow-band reduction and a band divide and conquer method, thus modifying steps (III)-(V) above. Another very recent benchmark [74] for the K computer, a 640,000 core, SPARC64 processor based distributed-memory computer, shows very promising results for this new approach.…”
Section: Recent Developments In Parallel Eigenvalue Solversmentioning
confidence: 99%
“…The parallelization of the transformation was optimized for massively parallel computing on the K computer . We also implemented the eigen_sx routine of EigenExa and the locally optimal block preconditioned conjugated gradient (LOBPCG) algorithm into the Platypus‐QM unit, to accelerate the computation of eigenvalue problems.…”
Section: Methodsmentioning
confidence: 99%
“…Peta-scale systems are defined as systems which are able to provide peta-FLOPS, millions of billions of FLoating OPerations per Second, computational power [1,2]. They can be described as the increasingly massive and dynamic networks of interconnected diverse processors and components (i.e.…”
Section: Simulation and Peta-scale Systemsmentioning
confidence: 99%