Proceedings of the ACM SIGPLAN 1999 Conference on Programming Language Design and Implementation 1999
DOI: 10.1145/301618.301670
|View full text |Cite
|
Sign up to set email alerts
|

Improving cache performance in dynamic applications through data and computation reorganization at run time

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
3
1
1

Citation Types

0
98
0

Year Published

2004
2004
2015
2015

Publication Types

Select...
4
3
2

Relationship

1
8

Authors

Journals

citations
Cited by 169 publications
(98 citation statements)
references
References 16 publications
0
98
0
Order By: Relevance
“…This approach claims to deliver execution time reductions averaging 20% in a multiprocessor and 30% in a uniprocessor due to significant increases in memory parallelism. A runtime approach to improve computation and data locality in irregular programs based on the inspector-executor method used by Saltz has been proposed in [13]. This work improves computation and data locality, and also eliminates most of the runtime overhead.…”
Section: Code Transformation and Compiler-based Solutionsmentioning
confidence: 99%
“…This approach claims to deliver execution time reductions averaging 20% in a multiprocessor and 30% in a uniprocessor due to significant increases in memory parallelism. A runtime approach to improve computation and data locality in irregular programs based on the inspector-executor method used by Saltz has been proposed in [13]. This work improves computation and data locality, and also eliminates most of the runtime overhead.…”
Section: Code Transformation and Compiler-based Solutionsmentioning
confidence: 99%
“…For N-body simulation, irregular mesh, and sparse matrix computations, a number of studies have showed that run-time reordering can improve sequential running time despite its cost. Ding and Kennedy used lexicographic grouping [6]. Han and Tseng used lexicographic sorting [7].…”
Section: Related Workmentioning
confidence: 99%
“…Section 4 discusses the inexact Newton method implemented and the computational techniques used to improve performance. One technique is to reorder the input data to obtain better locality of reference and cache performance [12,21]. This modification results in a 40% reduction in the overall solution time for the largest meshes tested.…”
Section: Introductionmentioning
confidence: 99%