2015 Conference on Design and Architectures for Signal and Image Processing (DASIP) 2015
DOI: 10.1109/dasip.2015.7367257
|View full text |Cite
|
Sign up to set email alerts
|

BM3D image denoising using heterogeneous computing platforms

Abstract: Noise reduction is one of the most fundamental digital image processing problems, and is often designed to be solved at an early stage of the image processing path. Noise appears on the images in many different ways, and it is inevitable. In general, various image processing algorithms perform better if their input is as error-free as possible. In order to keep the processing delays small in different computing platforms, it is important that the noise reduction is performed swiftly.The recent progress in the … Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
3
1
1

Citation Types

0
16
0

Year Published

2016
2016
2024
2024

Publication Types

Select...
7
1

Relationship

0
8

Authors

Journals

citations
Cited by 18 publications
(17 citation statements)
references
References 19 publications
0
16
0
Order By: Relevance
“…The block-matching procedure was accelerated by an efficient data reuse method applied to the whole image, while the 3D transformation was accelerated by utilizing a fast software FFTW library. Later on, the work of [8] proposed the first open-source GPU-based implementation of the BM3D algorithm by using both the OpenCL and CUDA frameworks. The final design achieved 7.5× speed up compared to the CPU-based design of [7].…”
Section: Introductionmentioning
confidence: 99%
See 1 more Smart Citation
“…The block-matching procedure was accelerated by an efficient data reuse method applied to the whole image, while the 3D transformation was accelerated by utilizing a fast software FFTW library. Later on, the work of [8] proposed the first open-source GPU-based implementation of the BM3D algorithm by using both the OpenCL and CUDA frameworks. The final design achieved 7.5× speed up compared to the CPU-based design of [7].…”
Section: Introductionmentioning
confidence: 99%
“…Very recently, the study of [9] presented a highly efficient GPU-based software accelerator implemented in CUDA. The authors proposed a more fine-grained partition of the algorithm compared to [8] and optimized data caching and sharing schemes for block-matching, which removed many redundant computations and thus significantly improved the image processing speed. In [10], the authors targeted an GPU implementation for VBM3D and focused on reducing the use of external memory bandwidth, which is realized by regrouping all filtering operations, including fetching the patch data, the data transpositions, the 1D filtering and the thresholding, into one kernel without requiring intermediate bufffer for patches caching.…”
Section: Introductionmentioning
confidence: 99%
“…For a typically sized volume, the filtering speed requires further improvement in order for it to be useful in most typical MC simulations (10 6 to 10 8 photons). Although the GPU-BM3D filters 39,40 reported excellent speed, they are designed for filtering two-dimensional (2-D) images and are not suited for 3-D denoising. As far as we know, there is no publication on GPU-BM4D filters.…”
Section: Introductionmentioning
confidence: 99%
“…For all details of this algorithm realization, interested readers can refer to [16,21,22], while software realization is available at (http://www.cs.tut.fi/~foi/GCF-BM3D/).…”
Section: Algorithm Bm3d Filtermentioning
confidence: 99%
“…The BM3D algorithm consists of the coarse and fine algorithm runs [16,21,22] (http://www.cs.tut.fi/~foi/ GCF-BM3D/). Here, the BM3D algorithm is briefly summarized.…”
Section: Bm3dmentioning
confidence: 99%