Proceedings of the 2011 Conference on Design &Amp; Architectures for Signal &Amp; Image Processing (DASIP) 2011
DOI: 10.1109/dasip.2011.6136860
|View full text |Cite
|
Sign up to set email alerts
|

An efficient parallel motion estimation algorithm and X264 parallelization in CUDA

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
4
1

Citation Types

0
7
0

Year Published

2013
2013
2018
2018

Publication Types

Select...
2
2
1

Relationship

0
5

Authors

Journals

citations
Cited by 7 publications
(7 citation statements)
references
References 5 publications
0
7
0
Order By: Relevance
“…Such works clearly highlight that, when compared to the fully accelerated version of x264 or WebM VP8, the known collaborative GPU-based techniques do not exhibit significant compute performance improvements [21]. For instance, [20] reports a speed improvement of only 20% with respect to x264.…”
Section: Introductionmentioning
confidence: 95%
See 3 more Smart Citations
“…Such works clearly highlight that, when compared to the fully accelerated version of x264 or WebM VP8, the known collaborative GPU-based techniques do not exhibit significant compute performance improvements [21]. For instance, [20] reports a speed improvement of only 20% with respect to x264.…”
Section: Introductionmentioning
confidence: 95%
“…In particular, the open source X264 [8] and WebM [2] projects, implementing the widely used H.264 and VP8/VP9 schemes, respectively, can obtain an impressive compute performance. For instance, X264 can achieve up to a ×1000 speed-up compared to the JM H.264 standard reference encoder [20,23]. Such results have been obtained by thoroughly exploiting the optimized SSE/MMX/AVX assembly instruction set, which enables instruction-level parallelism (ILP) on register vectors up to 512-bit-wide.…”
Section: Introductionmentioning
confidence: 99%
See 2 more Smart Citations
“…There are also some approaches that accelerate codec execution by using multicore CPUs or both CPUs and GPUs [14]. They effectively run most codec functions, but not the EC which is known to be hard to parallelize.…”
Section: Acceleration Techniquesmentioning
confidence: 99%