2014
DOI: 10.1016/j.amc.2014.02.051
|View full text |Cite
|
Sign up to set email alerts
|

Towards an efficient use of the BLAS library for multilinear tensor contractions

Abstract: Mathematical operators whose transformation rules constitute the building blocks of a multi-linear algebra are widely used in physics and engineering applications where they are very often represented as tensors. In the last century, thanks to the advances in tensor calculus, it was possible to uncover new research fields and make remarkable progress in the existing ones, from electromagnetism to the dynamics of fluids and from the mechanics of rigid bodies to quantum mechanics of many atoms. By now, the forma… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1

Citation Types

0
33
0

Year Published

2014
2014
2020
2020

Publication Types

Select...
4
3
1

Relationship

1
7

Authors

Journals

citations
Cited by 33 publications
(33 citation statements)
references
References 19 publications
0
33
0
Order By: Relevance
“…BLAS procedure) the number of efficient algorithms for tensor contractions is rather limited. In practice, due to the high computational complexity of tensor contractions, especially for tensor networks with loops, this operation is often performed approximately [66,107,138,167]. A variant of Tensor Trace [128] for the case of the partial tensor selfcontraction considers a tensor A P R RˆI 1ˆI2ˆ¨¨¨ˆINˆR and yields a reducedorder tensor r…”
mentioning
confidence: 99%
“…BLAS procedure) the number of efficient algorithms for tensor contractions is rather limited. In practice, due to the high computational complexity of tensor contractions, especially for tensor networks with loops, this operation is often performed approximately [66,107,138,167]. A variant of Tensor Trace [128] for the case of the partial tensor selfcontraction considers a tensor A P R RˆI 1ˆI2ˆ¨¨¨ˆINˆR and yields a reducedorder tensor r…”
mentioning
confidence: 99%
“…1: C abc = A ai B ibc : 9 exemplary algorithms out of 36. 6 As already mentioned, given a contraction, there is no obvious a-priori choice of kernel and slicings to attain the highest performance. We therefore generate all possible combinations.…”
Section: Algorithm Generationmentioning
confidence: 99%
“…The most prominent project targeting the efficient computation of tensor contractions is probably the Tensor Contraction Engine, a compiler built specifically for multi-tensor multi-index contractions to be executed within memory constraints [4]; in light of the wide diffusion and nearly optimal efficiency of the BLAS library, an extension to TCE was proposed to compute contractions via BLAS operations [5]. In the same spirit, we provided simple rules to build a taxonomy for all contractions between two tensors, identifying which BLAS routines are usable and how to best exploit them [6].…”
Section: Introductionmentioning
confidence: 99%
“…This explains extra parameters in gemm (matrix multiplication) and gemv (matrix-vector) multiplication that allow block-strided access (which is equivalent to allowing matrices within larger matrices). Recently it was observed that the gemm interface is not sufficient when multiplying matrices in higher-dimensional tensors [13].…”
Section: Related Workmentioning
confidence: 99%