2017
DOI: 10.1016/j.procs.2017.05.145
|View full text |Cite
|
Sign up to set email alerts
|

cuHinesBatch: Solving Multiple Hines systems on GPUs Human Brain Project * *This project has received funding from the European Union's Horizon 2020 research and innovation programme under grant agreement No 720270 (HBP SGA1), from the Spanish Ministry of Economy and Competitiveness under the project Computación de Altas Prestaciones VII (TIN2015-65316-P) and the Departament d'Innovació, Universitats i Empresa de la Generalitat de Catalunya, under project MPEXPAR: Models de Programació i Entorns d'Execució

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

0
12
0

Year Published

2018
2018
2022
2022

Publication Types

Select...
6
1
1

Relationship

2
6

Authors

Journals

citations
Cited by 15 publications
(12 citation statements)
references
References 7 publications
0
12
0
Order By: Relevance
“…But, with few cells per morphological type, Gaussian elimination suffers from non-contiguous layout of parents relative to a group of nodes. This results in irregular, strided memory accesses and hence poor performance (Valero-Lara et al, 2017). To address this, two alternative node orderings schemes, Interleaved layout and Constant Depth layout, are implemented as illustrated in Figures 6D,E.…”
Section: Optimizationsmentioning
confidence: 99%
“…But, with few cells per morphological type, Gaussian elimination suffers from non-contiguous layout of parents relative to a group of nodes. This results in irregular, strided memory accesses and hence poor performance (Valero-Lara et al, 2017). To address this, two alternative node orderings schemes, Interleaved layout and Constant Depth layout, are implemented as illustrated in Figures 6D,E.…”
Section: Optimizationsmentioning
confidence: 99%
“…New HPC architectures such as the addition of ubiquitous GPU resources have been a new challenge, requiring new code adaptation with codes such as CoreNEURON and GeNN for single-node GPU neuronal networks. Developing performant algorithms for computing the Hines matrix on GPUs and other vectorize hardware has been an additional hurdle [8], [9]. The development of Arbor [10] has focused on tackling issues of vectorization and emerging hardware architectures by using modern C++ and automated code generation, within an opensource and open-development model.…”
Section: Introductionmentioning
confidence: 99%
“…Big efforts have been carried out by the scientific community in order to increase SpMV performance. An important part of the optimization of scientific codes consists of using the appropriate format to represent matrices in memory [21,20,8,25]. Following different approaches, cache performance, data locality and, consequently, the overall performance of SpMV, has been proven to be affected substantially.…”
Section: State Of the Artmentioning
confidence: 99%