2011
DOI: 10.1016/j.jpdc.2010.08.005
|View full text |Cite
|
Sign up to set email alerts
|

Data locality and parallelism optimization using a constraint-based approach

Abstract: Cataloged from PDF version of article.Embedded applications are becoming increasingly complex and processing ever-increasing datasets. In\ud the context of data-intensive embedded applications, there have been two complementary approaches to\ud enhancing application behavior, namely, data locality optimizations and improving loop-level parallelism.\ud Data locality needs to be enhanced to maximize the number of data accesses satisfied from the higher\ud levels of the memory hierarchy. On the other hand, compil… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
3
1

Citation Types

0
4
0

Year Published

2014
2014
2018
2018

Publication Types

Select...
5
1

Relationship

0
6

Authors

Journals

citations
Cited by 7 publications
(4 citation statements)
references
References 58 publications
(46 reference statements)
0
4
0
Order By: Relevance
“…There are also several approaches to the automatic computation of a suitable data layout transformation, either directly [20,17,21,22,16] or in combination with a loop transformation [23,24,25]. We have not done so, but it would be possible to add one or more of these techniques to our code generator.…”
Section: Related Workmentioning
confidence: 99%
“…There are also several approaches to the automatic computation of a suitable data layout transformation, either directly [20,17,21,22,16] or in combination with a loop transformation [23,24,25]. We have not done so, but it would be possible to add one or more of these techniques to our code generator.…”
Section: Related Workmentioning
confidence: 99%
“…Naci in 2007 [20] tried to improve data locality of loops with loop transformations. Ozturk in 2011 [21] presented a method based on constraint satisfaction problem (CSP) to handle data locality and parallelizing of loop nests. Parsa et al [22] proposed a method to expose coarse-grain parallelism by reusing data to execute loops on multicore processors.…”
Section: Relatedmentioning
confidence: 99%
“…Also, the order of the scheduling coefficients in the objective function of ILP formulation impacts the obtained solution, and it is a time consuming task to examine all the ordering possibilities. Also, in [13], a unified approach that integrates locality and parallelization optimizations for chip multiprocessors was proposed. In that work, the problem was formulated in a constraint network and used a search algorithm to solve it.…”
Section: Related Workmentioning
confidence: 99%