Performance of CAP-specified linear algebra algorithms

Mazzariol, Marc; Gennart, Benoit A.; Messerli, Vincent; Hersch, Roger D.

doi:10.1007/3-540-63697-8_104

Cited by 4 publications

(1 citation statement)

References 2 publications

(4 reference statements)

Supporting

Mentioning

Contrasting

Order By: Relevance

“…Besides the Visible Human Slice Server, CAP has been applied successfully to a number of applications, both in the field of image processing 4 and in the field scientific computing 3,7 . We have shown 3 that the overhead specific to CAP is very low: each token incorporates in addition to its user-defined structure a 24 bytes header.…”

Section: Discussionmentioning

confidence: 99%

<title>Computer-aided synthesis of parallel image processing applications</title>

Gennart

Hersch

1999

SPIE Proceedings

Self Cite

View full text Add to dashboard Cite

We present a tutorial description of the CAP Computer-Aided Parallelization tool. CAP has been designed with the goal of letting the parallel application programmer having the complete control about how his application is parallelized, and at the same time freeing him from the burden of managing explicitly a large number of threads and associated synchronization and communication primitives. The CAP tool, a precompiler generating C++ source code, enables application programmers to specify at a high level of abstraction the set of threads present in the application, the processing operations offered by these threads, and the parallel constructs specifying the flow of data and parameters between operations. A configuration map specifies the mapping between CAP threads and operating system processes, possibly located on different computers. The generated program may run on various parallel configurations without recompilation. We discuss the issues of flow control and load balancing and show the solutions offered by CAP. We also show how CAP can be used to generate relatively complex parallel programs incorporating neighbourhood dependent operations. Finally, we briefly describe a real 3D image processing application: the Visible Human Slice Server (http://visiblehuman.epfl.ch), its implementation according to the previously defined concepts and its performances.

show abstract

Section: Discussionmentioning

confidence: 99%

<title>Computer-aided synthesis of parallel image processing applications</title>

Gennart

Hersch

1999

SPIE Proceedings

Self Cite

View full text Add to dashboard Cite

show abstract

Parallelizing I/O-intensive image access and processing applications

et al. 1999

View full text Add to dashboard Cite

Abstract. We propose a new approach for developing parallel I/O-and computeintensive applications on distributed memory PC. Using the CAP Computer-Aided Parallelization tool, application programmers create separately the serial program parts and express the parallel behavior of the program at a high level of abstraction. This highlevel parallel program description (CAP) is preprocessed into a compilable and executable C++ source parallel program. Low-level parallel file system components can, thanks to the CAP formalism, be combined with processing operations in order to yield efficient pipelined parallel I/O and compute intensive programs. These programs may run on multiple PC servers offering their access and processing services to clients located over the network. The applicability of the CAP tools on a real application is demonstrated with a parallel 3D tomographic image server application enabling clients to specify and access in parallel image slices having any desired position and orientation. The image slices are extracted from a 14 GByte color 3D tomographic image striped over the available set of disks. On a 5 Bi-Pentium Pro PC server comprising 60 disks, the system is able to extract in parallel, resample and visualize 4.8 512x512 colour image slices per second. At the highest load and when file caching is disabled, an aggregate I/O disk bandwidth of 104 MBytes/s has been obtained. When caching is enabled, an I/O throughput of up to 240 MBytes/s is obtained.

show abstract

Tools for parallel I/O and compute intensive applications

Messerli¹

2005

View full text Add to dashboard Cite

Performance of CAP-specified linear algebra algorithms

Cited by 4 publications

References 2 publications

<title>Computer-aided synthesis of parallel image processing applications</title>

<title>Computer-aided synthesis of parallel image processing applications</title>

Parallelizing I/O-intensive image access and processing applications

Tools for parallel I/O and compute intensive applications

Contact Info

Product

Resources

About