Proceedings of the 24th ACM International Conference on Supercomputing 2010
DOI: 10.1145/1810085.1810102
|View full text |Cite
|
Sign up to set email alerts
|

Evaluation of parallel H.264 decoding strategies for the Cell Broadband Engine

Abstract: How to develop efficient and scalable parallel applications is the key challenge for emerging many-core architectures. We investigate this question by implementing and comparing two parallel H.264 decoders on the Cell architecture. It is expected that future many-cores will use a Cell-like local store memory hierarchy, rather than a non-scalable shared memory. The two implemented parallel algorithms, the Task Pool (TP) and the novel Ring-Line (RL) approach, both exploit macroblock-level parallelism. The TP imp… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

0
8
0

Year Published

2010
2010
2018
2018

Publication Types

Select...
3
3
1

Relationship

1
6

Authors

Journals

citations
Cited by 15 publications
(13 citation statements)
references
References 15 publications
0
8
0
Order By: Relevance
“…This approach is promising due to the abundant parallelism and low synchronization overhead. In our previous work [7] a variant of the "single row" approach with distributed control was implemented on the Cell processor. By exploiting the Cell memory hierarchy a scalability was achieved that approached the theoretical limit.…”
Section: Related Workmentioning
confidence: 99%
See 2 more Smart Citations
“…This approach is promising due to the abundant parallelism and low synchronization overhead. In our previous work [7] a variant of the "single row" approach with distributed control was implemented on the Cell processor. By exploiting the Cell memory hierarchy a scalability was achieved that approached the theoretical limit.…”
Section: Related Workmentioning
confidence: 99%
“…By exploiting the Cell memory hierarchy a scalability was achieved that approached the theoretical limit. In most of these works (e.g., [1,3,5,7,16,20,23]), the entropy decoding was not considered or mapped on a single core, which causes a scalability bottleneck.…”
Section: Related Workmentioning
confidence: 99%
See 1 more Smart Citation
“…They use a master-worker programming model similar to ours. Meenderick et al also investigate different scheduling policies for macroblock-level parallelism, including a static scheduling approach to improve locality [8]. In contrast, we cannot exploit locality in many cases due to local store size limitations.…”
Section: F Programming Effortmentioning
confidence: 99%
“…Prior work has extensively analyzed the performance of video decoding on Cell [7], [8]. Video encoding is significantly more challenging than video decoding due to higher memory requirements and dynamic behavior.…”
Section: Introductionmentioning
confidence: 99%