2014
DOI: 10.1145/2579561
|View full text |Cite
|
Sign up to set email alerts
|

Integrating profile-driven parallelism detection and machine-learning-based mapping

Abstract: Compiler-based auto-parallelization is a much-studied area but has yet to find widespread application. This is largely due to the poor identification and exploitation of application parallelism, resulting in disappointing performance far below that which a skilled expert programmer could achieve. We have identified two weaknesses in traditional parallelizing compilers and propose a novel, integrated approach resulting in significant performance improvements of the generated parallel code. Using profile-driven … Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

0
37
0

Year Published

2014
2014
2022
2022

Publication Types

Select...
7
2
1

Relationship

5
5

Authors

Journals

citations
Cited by 59 publications
(37 citation statements)
references
References 68 publications
0
37
0
Order By: Relevance
“…Predictive Modeling Machine learning based predictive modeling is emerging as a powerful technique for optimizing parallel programs [28,29,33,37,[39][40][41]. Its great advantage is its ability to adapt to changing platforms as it has no a prior assumptions about their behavior.…”
Section: Related Workmentioning
confidence: 99%
“…Predictive Modeling Machine learning based predictive modeling is emerging as a powerful technique for optimizing parallel programs [28,29,33,37,[39][40][41]. Its great advantage is its ability to adapt to changing platforms as it has no a prior assumptions about their behavior.…”
Section: Related Workmentioning
confidence: 99%
“…The supervised methods usually distinguish and label the code regions which are suitable for parallelization or not, and then use the information to train the model [78]. While unsupervised learning methods are mainly based on clustering techniques and employ the historical data or profile data [79] to find program fragments with similar patterns [80].…”
Section: Intelligent Algorithmsmentioning
confidence: 99%
“…HELIX [10] uses a dynamic loop nesting graph to select a set of loops to parallelize. [30] uses dynamic analysis of control and data dependencies as input to a trained predictor to autoparallelize loops, relying on the user to check correctness.…”
Section: Generated Halide Code For Photoshop Filtersmentioning
confidence: 99%