2020
DOI: 10.1098/rsta.2019.0066
|View full text |Cite
|
Sign up to set email alerts
|

Numerical algorithms for high-performance computational science

Abstract: A number of features of today’s high-performance computers make it challenging to exploit these machines fully for computational science. These include increasing core counts but stagnant clock frequencies; the high cost of data movement; use of accelerators (GPUs, FPGAs, coprocessors), making architectures increasingly heterogeneous; and multi- ple precisions of floating-point arithmetic, including half-precision. Moreover, as well as maximizing speed and accuracy, minimizing energy consumption is an … Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
4

Citation Types

0
5
0
1

Year Published

2020
2020
2024
2024

Publication Types

Select...
7

Relationship

0
7

Authors

Journals

citations
Cited by 22 publications
(7 citation statements)
references
References 85 publications
0
5
0
1
Order By: Relevance
“…Linear equations and eigenvalue problems with large dimensionality, on the order of >10 6 degrees of freedom, and no special structure or sparsity arise in many scientific and engineering applications, for example, quantum chemical and materials science simulations, partial differential equation solvers, signal reconstruction, and machine learning applications. [1][2][3] Density functional calculations of light-induced processes in organic and semiconductor nanostructures have pushed the limit of quantum chemical studies to systems with 1000 or more atoms. Molecular and material property calculations in these systems amount to solving eigenvalue problems of dimension one billion or larger.…”
Section: Introductionmentioning
confidence: 99%
“…Linear equations and eigenvalue problems with large dimensionality, on the order of >10 6 degrees of freedom, and no special structure or sparsity arise in many scientific and engineering applications, for example, quantum chemical and materials science simulations, partial differential equation solvers, signal reconstruction, and machine learning applications. [1][2][3] Density functional calculations of light-induced processes in organic and semiconductor nanostructures have pushed the limit of quantum chemical studies to systems with 1000 or more atoms. Molecular and material property calculations in these systems amount to solving eigenvalue problems of dimension one billion or larger.…”
Section: Introductionmentioning
confidence: 99%
“…In [9], for example, an approach to an automated CFD-based optimization is shown. With the rapid increase in performance of processor and memory technologies and current research in the HPC domain [10], further increases in efficiency and cost reductions are highly probable. Combined with progressive development of computational methodologies [11,12], this opens new potential for product development [13].…”
Section: Introductionmentioning
confidence: 99%
“…With the slowing down of Moore's Law [51], future computer systems will need to resort to domain-specific accelerators for continuous performance scaling [27], [63] under the same power envelope. This is especially the case for HPC and data centers, as we are quickly entering an era of extreme heterogeneity [71], characterized by cluster nodes integrating a multitude of cooperating accelerators [33], [38], [43], [61].…”
Section: Introductionmentioning
confidence: 99%
“…(2) If data is not locally available, during the long-time remote data fetching, the compute units can be idle; (2) Even worse, these idle units or idle nodes cannot be reclaimed for other tasks despite the node may hold their desired data. As emerging workloads become more dynamic and data-driven, decentralized asynchronous task management is highly desired, while data locality becomes a crucial factor for the system design [18], [27], [63]. This is largely due to the observation that the energy cost of data-movement significantly overweights the energy cost of computing them [34], [40], [50], which is particularly the case when migrating arXiv:2011.04931v1 [cs.DC] 10 Nov 2020 data through the interconnect network (e.g., the power budget is ∼5.5 watts per full bi-directional NVLink port [1]).…”
Section: Introductionmentioning
confidence: 99%