ACM SIGGRAPH ASIA 2008 Courses on - SIGGRAPH Asia '08 2008
DOI: 10.1145/1508044.1508084
|View full text |Cite
|
Sign up to set email alerts
|

Massively parallel volume rendering using 2--3 swap image compositing

Abstract: The ever-increasing amounts of simulation data produced by scientists demand high-end parallel visualization capability. However, image compositing, which requires interprocessor communication, is often the bottleneck stage for parallel rendering of large volume data sets. Existing image compositing solutions either incur a large number of messages exchanged among processors (such as the direct send method), or limit the number of processors that can be effectively utilized (such as the binary swap method). We… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1
1

Citation Types

1
39
0

Year Published

2010
2010
2021
2021

Publication Types

Select...
5
2
1

Relationship

0
8

Authors

Journals

citations
Cited by 46 publications
(40 citation statements)
references
References 15 publications
1
39
0
Order By: Relevance
“…This approach has inefficiencies because processes have to sit idle during most of the computation. The 2-3 swap algorithm [34] takes a different approach. It relaxes binary swap such that processes can be grouped into pairs of two (like binary swap) or sets of three (unlike binary swap).…”
Section: Basic Parallel Compositing Algorithmsmentioning
confidence: 99%
See 1 more Smart Citation
“…This approach has inefficiencies because processes have to sit idle during most of the computation. The 2-3 swap algorithm [34] takes a different approach. It relaxes binary swap such that processes can be grouped into pairs of two (like binary swap) or sets of three (unlike binary swap).…”
Section: Basic Parallel Compositing Algorithmsmentioning
confidence: 99%
“…These increased demands on sort-last rendering have spawned a resurgence in image compositing research. Recent studies led to the creation of new image compositing algorithms [20,34], and new compositing enhancements [8]. Although each of these studies improve the state of the art in image compositing, all involve locally built algorithm implementations that contain some isolated subset of enhancements.…”
Section: Introductionmentioning
confidence: 99%
“…However, it will also generate p × (p − 1) messages to be exchanged among all participating processes. In a communication network where each of the participating processes are connected by network links, this will likely generate link contention as multiple processes will simultaneously be sending messages to the same process [3]. The execution time for direct-send is:…”
Section: Related Workmentioning
confidence: 99%
“…This lack of locality can result in "cache thrashing," which is a relatively low level of cache reuse. They report that object-parallel partitionings scale well, and this form of partitioning has been adopted as the basis for parallel work decomposition in many subsequent works (e.g., [4,35,40,10,20]). In contrast, our work here is a more comprehensive, systematic exploration of the relationship between algorithmic optimization and tunable algorithmic parameters -image tile size, work assignment strategy, and alternative memory layouts for the source data, and algorithmic optimizations -and their impact on algorithm performance in terms of runtime and cache utilization measured via hardware performance counters.…”
Section: Previous Workmentioning
confidence: 99%