2018
DOI: 10.1117/1.jbo.23.1.010504
|View full text |Cite
|
Sign up to set email alerts
|

Scalable and massively parallel Monte Carlo photon transport simulations for heterogeneous computing platforms

Abstract: . We present a highly scalable Monte Carlo (MC) three-dimensional photon transport simulation platform designed for heterogeneous computing systems. Through the development of a massively parallel MC algorithm using the Open Computing Language framework, this research extends our existing graphics processing unit (GPU)-accelerated MC technique to a highly scalable vendor-independent heterogeneous computing environment, achieving significantly improved performance and software portability. A number o… Show more

Help me understand this report
View preprint versions

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

2
94
0

Year Published

2019
2019
2022
2022

Publication Types

Select...
4
3
1

Relationship

2
6

Authors

Journals

citations
Cited by 146 publications
(96 citation statements)
references
References 9 publications
2
94
0
Order By: Relevance
“…From our tests, MMCL has shown about 2 orders of magnitude speedup compared to our highly optimized single-thread CPU implementation. Although we recognize the 2-to 5-fold speed disadvantage for OpenCL compared to CUDA on NVIDIA hardware, as shown in our previous study, 14 in this work, our decision of prioritizing the development of OpenCL MMC is largely motivated by 1) the open-source ecosystem of OpenCL allowing the developed software to be rapidly and widely disseminated through public software repositories, and 2) the upcoming highperformance GPUs from Intel and AMD being expected to attract more development attention to OpenCL libraries and drivers, likely resulting in boosts in performance.…”
Section: Introductionsupporting
confidence: 44%
See 2 more Smart Citations
“…From our tests, MMCL has shown about 2 orders of magnitude speedup compared to our highly optimized single-thread CPU implementation. Although we recognize the 2-to 5-fold speed disadvantage for OpenCL compared to CUDA on NVIDIA hardware, as shown in our previous study, 14 in this work, our decision of prioritizing the development of OpenCL MMC is largely motivated by 1) the open-source ecosystem of OpenCL allowing the developed software to be rapidly and widely disseminated through public software repositories, and 2) the upcoming highperformance GPUs from Intel and AMD being expected to attract more development attention to OpenCL libraries and drivers, likely resulting in boosts in performance.…”
Section: Introductionsupporting
confidence: 44%
“…Moreover, based on our previous observations in voxel-based MC using OpenCL and CUDA on NVIDIA devices, 14 we noticed that CUDA-based MC simulation is about 2-to-5-fold faster than the OpenCL implementation due to driver support differences. As a result, we anticipate that if we further port MMCL to the CUDA programming language, one may achieve further speed improvement, with the resulting software limited to NVIDIA GPUs only.…”
Section: Discussionmentioning
confidence: 68%
See 1 more Smart Citation
“…We used an open source, GPU-accelerated Monte Carlo program (Monte Carlo eXtreme) to compute the detection sensitivity functions for DiFC (25). We modeled the tail as a homogenous 4 mm diameter, 4 cm long cylinder, with voxel size of 250 µm 3 .…”
Section: Monte Carlo Simulationsmentioning
confidence: 99%
“…32 This tool can shorten the simulation runtime by several hundreds fold compared to conventional CPU based simulations. 32,33 The remainder of the paper is organized as follows. In the Materials and Methods section, we detail the preprocessing steps to create 4-layer head segmentations from the neurodevelopmental MRI brain atlas library.…”
Section: Introductionmentioning
confidence: 99%