2016
DOI: 10.1016/j.cpc.2016.05.024
|View full text |Cite
|
Sign up to set email alerts
|

Hierarchical parallelisation of functional renormalisation group calculations — hp-fRG

Abstract: The functional renormalisation group (fRG) has evolved into a versatile tool in condensed matter theory for studying important aspects of correlated electron systems. Practical applications of the method often involve a high numerical effort, motivating the question in how far High Performance Computing (HPC) can leverage the approach.In this work we report on a multi-level parallelisation of the underlying computational machinery and show that this can speed up the code by several orders of magnitude. This… Show more

Help me understand this report
View preprint versions

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1
1

Citation Types

0
6
0

Year Published

2017
2017
2024
2024

Publication Types

Select...
6

Relationship

1
5

Authors

Journals

citations
Cited by 8 publications
(7 citation statements)
references
References 28 publications
0
6
0
Order By: Relevance
“…Minor deviations between the fRG and PA (on one side) and DQMC (on the other side) are instead observed, as expected, by increasing the interaction values. Further optimization and parallelization of the code [54] will allow us to overcome the present restriction to essentially a single s-wave form factor. This is necessary to explore broader parameter regions.…”
Section: Discussionmentioning
confidence: 99%
See 1 more Smart Citation
“…Minor deviations between the fRG and PA (on one side) and DQMC (on the other side) are instead observed, as expected, by increasing the interaction values. Further optimization and parallelization of the code [54] will allow us to overcome the present restriction to essentially a single s-wave form factor. This is necessary to explore broader parameter regions.…”
Section: Discussionmentioning
confidence: 99%
“…Strongly inspired by earlier channel-decomposition schemes [48] and the singular-mode fRG by Wang et al [41], the so-called truncated unity fRG was set up [53]. This formalism combines various technical improvements and allows for the development of a highly parallelizable and fast-performing code, mainly involving 2D (or 3D) integrations and matrix multiplications [54]. The name "truncated unity" comes from the insertion of unity into loop diagrams.…”
Section: Introductionmentioning
confidence: 99%
“…1 We consider the coarseness of the momentum discretisation as a parameter, albeit not in a fully comprehensive manner due to compute time limits. Thanks to a highly scalable parallel algorithm [38] we can treat flow equations for up to about 40 million couplings and simultaneously the two-loop contributions for the self-energy at a sufficiently accurate level. 2 Figure 4 summarises various fRG treatments we used, together with results from other methods available in the literature [14,[39][40][41][42][43].…”
Section: Results From Functional Rgmentioning
confidence: 99%
“…In fact on the formal level we can write (iω − ξ 0 k − gΣ g ) −1 ∂ g (g Σ g ) = −∂ g ln(iω − ξ 0 k − gΣ g ) = ∂ g ln((iω − ξ 0 k − gΣ g ) −1 ) ≈ ∂ g ln(Z g (iω − ξ 0 k ) −1 ) = ∂ g (ln(Z g ) − ln(iω − ξ 0 k )) = ∂ g ln(Z g ) = (∂ g Z g )Z −1 g q.e.d. , (38) where we have inserted equation (15) in the third step. This provides a consistency check to some extent.…”
Section: A4 Step 2: Inclusion Of Self-energy Feedback Via the Quasi-mentioning
confidence: 99%
“…Examination of Scalasca execution traces was key to determining optimal load-balancing of MPI and OpenMP computations, and associated workload distribution and loop scheduling strategies, to allow hp-fRG to scale effectively to use all of the JUQUEEN compute nodes with 1.8 million threads [31].…”
Section: Parallel Performance Analysismentioning
confidence: 99%