1996
DOI: 10.1006/jpdc.1996.0143
|View full text |Cite
|
Sign up to set email alerts
|

Optimal Compilation of HPF Remappings

Abstract: Applications with varying array access patterns require to dynamically change array mappings on distributed-memory parallel machines. Hpf (High Performance Fortran) provides such remappings, on data that can be replicated, explicitly through the realign and redistribute directives and implicitly at procedure calls and returns. However such features are left out of the hpf subset or of the currently discussed hpf kernel for e ciency reasons. This paper presents a new compilation technique to handle hpf remap… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
3
2

Citation Types

0
8
0

Year Published

1996
1996
2006
2006

Publication Types

Select...
5
1

Relationship

0
6

Authors

Journals

citations
Cited by 12 publications
(8 citation statements)
references
References 25 publications
0
8
0
Order By: Relevance
“…Further, the approach used for multidimensional array redistribution involves a series of one-dimensional redistributions, which can be costly. Reference [18] proposes a general theoretical framework for data redistribution; this framework is currently being implemented.…”
Section: Related Workmentioning
confidence: 99%
“…Further, the approach used for multidimensional array redistribution involves a series of one-dimensional redistributions, which can be costly. Reference [18] proposes a general theoretical framework for data redistribution; this framework is currently being implemented.…”
Section: Related Workmentioning
confidence: 99%
“…A different approach has been proposed by Coelho and Ancourt. In [7], they describe compilation techniques which optimize remappings by removing useless ones and taking advantage of replications to shorten individual remappings. Optimality in their sense means that for a given remapping, a minimal number of messages is sent over the network.…”
Section: Related Workmentioning
confidence: 99%
“…That is, all the global indices of array elements have to subtract 9 from their original indices in the special case of a twolevel data-processor mapping to get their real indices in the general case. Thus, the compressed local array of processor p 0 becomes A p0 =(&9, &8, &2, 4,5,11,12,18,24,25). In addition, there are three pseudo active elements on processor p 0 (i.e., T (1, 4, 22)).…”
Section: Fig 12 a General Two-level Data-processor Mapping Array Amentioning
confidence: 99%
“…Actually, the compressed local array of processor p 0 for the general two-level dataprocessor mapping shown in Fig. 12 is A p0 = (4,5,11,12,18,24,25). However, if the number of compressed local array elements on processor p is known in advance, the compressed local array of processor p can be generated more efficiently.…”
Section: Fig 12 a General Two-level Data-processor Mapping Array Amentioning
confidence: 99%
See 1 more Smart Citation