Low-Cost Hardware-Sharing Architecture of Fast 1-D Inverse Transforms for H.264/AVC and AVS Applications

Su, Guo-An; Fan, Chih‐Peng

doi:10.1109/tcsii.2008.2008058

Cited by 20 publications

(28 citation statements)

References 7 publications

Supporting

Mentioning

Contrasting

Order By: Relevance

“…The proposed architecture efficiently integrates H.264/AVC, VC-1 and AVS 1-D inverse transforms. The number of gates for the proposed architecture is close to those in [23] and [24], but the proposed architecture can execute all 1-D inverse transforms required in H.264/AVC, VC-1 and AVS decoders.…”

Section: Simulation Results and Comparisonmentioning

confidence: 92%

“…The architecture can achieve high throughput with many more logic gates. Two distinct 1-D inverse transforms can be integrated using matrix decompositions with sparse matrices and matrix offset computations [23,24]. However, this method is not suitable for three distinct transforms.…”

Section: Simulation Results and Comparisonmentioning

confidence: 99%

“…It uses matrix decompositions with sparse matrices and matrix offset computations. Similarly, a low-cost hardwaresharing architecture for fast 1-D inverse transforms for H.264/AVC and AVS that adds offset computations was designed in [24]. Designs with a hardware-sharing architecture generally have lower hardware costs than those with separate architectures.…”

Section: Introductionmentioning

confidence: 99%

See 2 more Smart Citations

Efficient inverse transform architectures for multi-standard video coding applications

Chao

Kao

Liu

et al. 2012

IET Image Processing

View full text Add to dashboard Cite

Hardware designs that can support multiple standards are required for versatile media players. The study proposes a unified inverse transform architecture that can be efficiently used in Moving Picture Expert Group and ITU International Telecommunication Standardisation Sector (ITU-T) H.264/advanced video coding (AVC), Microsoft video codec 1 (VC-1) and Chinese Audio Video Coding Standard (AVS) decoders. For H.264/AVC 8-, 4-and 2-point inverse transforms, the computational complexity in the proposed architecture is similar to that defined in the H.264/AVC standard. By using the symmetry of the transform matrices, the matrix product operations of the inverse transforms in VC-1 and AVS are efficiently decomposed to use only shifters, adders and subtractors. All the computations are verified and designed using a hardware unit to achieve a low-cost hardware kernel. The proposed multiple-transform architecture contains fast 1-D transforms and rounding operations for the computation of H.264/AVC, VC-1 and AVS 8-and 4-point inverse transforms. Simulation results show that the total number of gates for the proposed architecture is 8983, which is much lower than that required for architectures without hardware sharing. Compared with individual designs, the proposed shared architecture reduces the number of logic gates by a factor of two with a penalty of 20% in data throughput.

show abstract

Section: Simulation Results and Comparisonmentioning

confidence: 92%

Section: Simulation Results and Comparisonmentioning

confidence: 99%

Section: Introductionmentioning

confidence: 99%

See 1 more Smart Citation

Efficient inverse transform architectures for multi-standard video coding applications

Chao

Kao

Liu

et al. 2012

IET Image Processing

View full text Add to dashboard Cite

show abstract

“…Other 8 Â 8 transform and quantization designs implemented on FPGAs have been proposed such as the configurable forward and inverse architecture in [15], the simplified forward 8Â 8 transform and quantization architecture, which is capable of processing 64 data/cycle in [16] and its corresponding IP-block in [17], the reduced hardware architecture in [18], which processes pixel by pixel and where the quantization is done without a real multiplier, and the integrated solution in FPGA to perform all transforms and the quantization, which supports luma and chroma for intra-or inter-configurations in [19]. It is worth mentioning other implementations such as the one based on a VLIW+SIMD architecture [20], the unified video CODEC for standards JPEG, MPEG-1/2/4, H.264 and VC-1 [21] or the hardware-sharing designs for the standards H.264 and AVS (developed in China) [22].…”

Section: Introductionmentioning

confidence: 99%

A high-throughput ASIC processor for 8×8 transform coding in H.264/AVC

Michell

Solana

Ruiz

2011

Signal Processing: Image Communication

View full text Add to dashboard Cite

“…For decoder use only, [4] [7] implemented a transform processor for 8x8, 4x4 inverse integer transforms, and 4x4 Hadamard transform. 2x2 Hadamard transform even has been embedded into a common architecture in [3] [10]. In [8], an unique kernel for multi-standard video encoder transforms is presented.…”

Section: Introductionmentioning

confidence: 99%

An unified architecture of all transforms for H.264/AVC codec

Chen

Shie

et al. 2010

2010 International Symposium on Computer, Communication, Control and Automation (3CA)

View full text Add to dashboard Cite

Abstract²In this paper, an unified hardware architecture for the complete set of transforms in H.264/AVC codec is presented. This architecture has been mapped into 2-D 4x4 forward/inverse transforms, 2-D 4x4/2x2 Hadamard transforms, and 1-D 8x8 forward/inverse transforms resulting in 31 sub/adders, 7 adders, 6 subtractors, 34 shifter, 4 multiplexer, and 16 registers. The architecture calculates 16 inputs and 8 outputs in parallel for 4x4 integer forward/inverse transforms, and 8 inputs and 8 outputs in parallel for 8x8 integer forward/inverse transforms by our proposed fast 4-step process. The register array is not necessary for transpose operations of 4x4 forward/inverse and 4x4/2x2 Hadamard transforms. With 8 pixels/cycle throughput, the proposed design can complete the computation in 50 clock cycles with 8x8 and 4x4 transforms for one macroblock in 4:2:0 format.

show abstract

Low-Cost Hardware-Sharing Architecture of Fast 1-D Inverse Transforms for H.264/AVC and AVS Applications

Cited by 20 publications

References 7 publications

Efficient inverse transform architectures for multi-standard video coding applications

Efficient inverse transform architectures for multi-standard video coding applications

A high-throughput ASIC processor for 8×8 transform coding in H.264/AVC

An unified architecture of all transforms for H.264/AVC codec

Contact Info

Product

Resources

About