Proceedings of the 21st Annual International Conference on Supercomputing 2007
DOI: 10.1145/1274971.1274989
|View full text |Cite
|
Sign up to set email alerts
|

Representation-transparent matrix algorithms with scalable performance

Abstract: Positive results from new object-oriented tools for scientific programming are reported. Using template classes, abstractions of matrix representations are available that subsume conventional row-major, column-major, either Z-or IMorton-order, as well as block-wise combinations of these. Moreover, the design of the Matrix Template Library (MTL) has been independently extended to provide recursators, to support block-recursive algorithms, supplementing MTL's iterators. Data types modeling both concepts enable t… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

0
18
0

Year Published

2008
2008
2022
2022

Publication Types

Select...
4
3

Relationship

0
7

Authors

Journals

citations
Cited by 32 publications
(18 citation statements)
references
References 19 publications
(22 reference statements)
0
18
0
Order By: Relevance
“…First, methods for program transformation have been explored to improve the cache hit rate in uniprocessor systems, or to improve the data locality in distributed-memory parallel computers [9], [10]. Gottschling [11] proposed a representation-transparent matrix algorithm for multicore chip and developed matrix template library (MTL) for matrix application, such as matrix multiplication. Ruetsch [12] and Podlozhnyuk [13] combine features of the GPU warp access memory and the shared memory structure to enhance matrix transpose performance.…”
Section: Related Workmentioning
confidence: 99%
“…First, methods for program transformation have been explored to improve the cache hit rate in uniprocessor systems, or to improve the data locality in distributed-memory parallel computers [9], [10]. Gottschling [11] proposed a representation-transparent matrix algorithm for multicore chip and developed matrix template library (MTL) for matrix application, such as matrix multiplication. Ruetsch [12] and Podlozhnyuk [13] combine features of the GPU warp access memory and the shared memory structure to enhance matrix transpose performance.…”
Section: Related Workmentioning
confidence: 99%
“…Sequential libraries such as STL [19], BGL [10], and MTL [9], provide data structures such as arrays, vectors, lists, maps, matrices, and graphs. A parallel container is an object oriented implementation of a data structure designed to be used efficiently in a parallel environment.…”
Section: The Stapl Parallel Containermentioning
confidence: 99%
“…pContainers can be constructed from any base container, sequential or parallel, so long as it can support the required interface. The pContainers currently provided in STAPL use the corresponding STL containers (e.g., the STAPL pVector uses the STL vector), containers from other sequential libraries (e.g., MTL [9] for matrices), containers available in libraries developed for multicore (e.g., TBB [14] concurrent containers), or other pContainers. This flexibility allows for code reuse and supports interoperability with other libraries.…”
Section: Pcontainer Definitionmentioning
confidence: 99%
See 2 more Smart Citations