2008
DOI: 10.1109/tpds.2007.70716
|View full text |Cite
|
Sign up to set email alerts
|

Parallel Implementation of the 2D Discrete Wavelet Transform on Graphics Processing Units: Filter Bank versus Lifting

Abstract: Abstract-The widespread usage of the discrete wavelet transform (DWT) has motivated the development of fast DWT algorithms and their tuning on all sorts of computer systems. Several studies have compared the performance of the most popular schemes, known as Filter Bank Scheme (FBS) and Lifting Scheme (LS), and have always concluded that LS is the most efficient option. However, there is no such study on streaming processors such as modern Graphics Processing Units (GPUs). Current trends have transformed these … Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1
1

Citation Types

1
65
0
1

Year Published

2009
2009
2016
2016

Publication Types

Select...
7
2

Relationship

0
9

Authors

Journals

citations
Cited by 92 publications
(67 citation statements)
references
References 41 publications
1
65
0
1
Order By: Relevance
“…A reasonable speed-up (13) has been obtained with high video resolutions. However, we can achieve better performance if we compute the filtering steps from the shared memory.…”
Section: Memory Access Optimizationmentioning
confidence: 88%
See 1 more Smart Citation
“…A reasonable speed-up (13) has been obtained with high video resolutions. However, we can achieve better performance if we compute the filtering steps from the shared memory.…”
Section: Memory Access Optimizationmentioning
confidence: 88%
“…In [12], a Single Instruction Multiple Data (SIMD) algorithm runs the 2D-DWT on a GeForce 7800 GTX using Cg and OpenGL, with a remarkable speed-up. A similar effort has been performed in [13] combining Cg and the 7800 GTX to report a 1.2-3.4 speed-up versus a CPU counterpart. http://asp.eurasipjournals.com/content/2013/1/24…”
Section: Introductionmentioning
confidence: 99%
“…The row-column methods process all of the horizontal filtering steps prior to the vertical ones. The row-column method applied on the entire 2-D image was used for instance in [10][11][12][13][14][15][16]. In some papers, the transition between the horizontal and vertical stage is accompanied with data transposition.…”
Section: Sweldens Schemementioning
confidence: 99%
“…For instance, Wong et al implement a two-dimension DWT with Cg and OpenGL on a GeForce GTX 7800 [31]. Similarly, in [28] authors also explore the implementation of a fast 2D-DWT with Filter Bank Scheme (FBS) and Lifting Scheme (LS) using Cg on the same GPU. With NVIDIA's CUDA library [6], people have implemented 2D-DWT variants [10,18] and a 3D-DWT on GPUs [11].…”
Section: Related Workmentioning
confidence: 99%