2007
DOI: 10.1145/1273442.1250759
|View full text |Cite
|
Sign up to set email alerts
|

Optimistic parallelism requires abstractions

Abstract: Irregular applications, which manipulate large, pointer-based data structures like graphs, are difficult to parallelize manually. Automatic tools and techniques such as restructuring compilers and runtime speculative execution have failed to uncover much parallelism in these applications, in spite of a lot of effort by the research community. These difficulties have even led some researchers to wonder if there is any coarse-grain parallelism worth exploiting in irregular applications.In this paper, we describe… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

1
143
0

Year Published

2008
2008
2021
2021

Publication Types

Select...
4
3

Relationship

0
7

Authors

Journals

citations
Cited by 139 publications
(146 citation statements)
references
References 45 publications
1
143
0
Order By: Relevance
“…A few larger workloads that target TM systems have been designed to address the disparity between microbenchmarks and full applications: Delaunay mesh generation [35], database management [14], BerkeleyDB [12], maze routing [40], and Delaunay mesh refinement and agglomerative clustering [24]. These applications represent realistic workloads and avoid the pitfalls of microbenchmarks.…”
Section: B Transactional Memory Benchmarksmentioning
confidence: 99%
See 1 more Smart Citation
“…A few larger workloads that target TM systems have been designed to address the disparity between microbenchmarks and full applications: Delaunay mesh generation [35], database management [14], BerkeleyDB [12], maze routing [40], and Delaunay mesh refinement and agglomerative clustering [24]. These applications represent realistic workloads and avoid the pitfalls of microbenchmarks.…”
Section: B Transactional Memory Benchmarksmentioning
confidence: 99%
“…The usage of transactions in yada is similar to that in [24], but it is applied to a different algorithm in this benchmark. Accesses to the work queue are enclosed by a transaction as is the entire refinement of a skinny triangle.…”
Section: B Applicationsmentioning
confidence: 99%
“…The experiments were performed in a system with 2 Intel Xeon E5-2660 Sandy Bridge-EP CPUs (8 cores/CPU) at 2.2 GHz and 64 GB of RAM, using g++ 4.7.2 with optimization flag −O3. The graph, node and edge classes used were taken from the Galois system [3], as they were found to be more efficient than the locally developed ones used in [5] and the skeleton transparently supports any classes. The inputs were a road map of the USA with 24 million nodes and 58 million edges for Boruvka, IS and ST, a road map of New York City with 264 thousand nodes and 733 thousand edges for SSSP -both maps taken from [22]-and a mesh with 1 million triangles taken from the Galois project for DMR.…”
Section: Discussionmentioning
confidence: 99%
“…RELATED WORK While we have not found any other skeleton-based approach oriented to the parallelization of this kind of applications, there are proposals with this aim. The Galois system [3] is a framework for this kind of algorithms that relies on user annotations that describe the properties of the operations. Its interface can be simplified though, if only cautious and unordered algorithms are considered.…”
Section: Discussionmentioning
confidence: 99%
See 1 more Smart Citation