2005
DOI: 10.1007/s11265-005-6651-6
|View full text |Cite
|
Sign up to set email alerts
|

Reducing 3D Fast Wavelet Transform Execution Time Using Blocking and the Streaming SIMD Extensions

Abstract: The video compression algorithms based on the 3D wavelet transform obtain excellent compression rates at the expense of huge memory requirements, that drastically affects the execution time of such applications. Its objective is to allow the real-time video compression based on the 3D fast wavelet transform. We show the hardware and software interaction for this multimedia application on a general-purpose processor. First, we mitigate the memory problem by exploiting the memory hierarchy of the processor using… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1

Citation Types

0
4
0

Year Published

2008
2008
2016
2016

Publication Types

Select...
4
1

Relationship

1
4

Authors

Journals

citations
Cited by 15 publications
(4 citation statements)
references
References 32 publications
(33 reference statements)
0
4
0
Order By: Relevance
“…Today, the standard MPEG-4 [23][24] supports an ad-hoc tool for encoding textures and still images, based on a wavelet algorithm. In a previous work [25], we have presented the implementation of a lossy encoder for medical video based on the 3D fast wavelet transform. This encoder achieves both high compression ratios and excellent quality, so that medical doctors can not find longer differences between the original and the reconstructed video.…”
Section: Previous Workmentioning
confidence: 99%
“…Today, the standard MPEG-4 [23][24] supports an ad-hoc tool for encoding textures and still images, based on a wavelet algorithm. In a previous work [25], we have presented the implementation of a lossy encoder for medical video based on the 3D fast wavelet transform. This encoder achieves both high compression ratios and excellent quality, so that medical doctors can not find longer differences between the original and the reconstructed video.…”
Section: Previous Workmentioning
confidence: 99%
“…Meerwald et al [12], Bernabe et al [13], Chrysafis and Ortega [14] and Lafruit et al [15] present different memory-optimized execution orders or localizations of the WT, offering various methods to avoid off-chip misses: [12] reduces conflict misses in the vertical WT filtering by modifying the data layout and improves the spatial locality by modifying the execution order. Bernabe et al [13] reduces the cache misses during vertical filtering by computing tiles of merged horizontal and vertical filtering, [14] further avoids misses during the higher WT levels by merging lines of computation over all the WT levels, while [15] offers the same advantages, but by merging in a block-based manner, which corresponds well to further processing blocks.…”
Section: Related Workmentioning
confidence: 99%
“…Bernabe et al [13] reduces the cache misses during vertical filtering by computing tiles of merged horizontal and vertical filtering, [14] further avoids misses during the higher WT levels by merging lines of computation over all the WT levels, while [15] offers the same advantages, but by merging in a block-based manner, which corresponds well to further processing blocks. Chaver et al [16] finally realizes a trade-off between the in-placing freedom and spatial locality present in certain implementation styles of the WT.…”
Section: Related Workmentioning
confidence: 99%
“…SIMD extensions of the instruction sets are successfully used in 3D as well. For example, [41] describes the SIMD optimization of 3D wavelet transform. Tomographic reconstruction can be sped up by a factor of 3 using SSE, as described in [42].…”
mentioning
confidence: 99%