The platform will undergo maintenance on Sep 14 at about 7:45 AM EST and will be unavailable for approximately 2 hours.
2012
DOI: 10.1007/s10766-012-0216-7
|View full text |Cite
|
Sign up to set email alerts
|

Parallelization of Full Search Motion Estimation Algorithm for Parallel and Distributed Platforms

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
4
1

Citation Types

0
16
0

Year Published

2015
2015
2020
2020

Publication Types

Select...
5
1

Relationship

0
6

Authors

Journals

citations
Cited by 19 publications
(16 citation statements)
references
References 9 publications
0
16
0
Order By: Relevance
“…Parallelizing DCT with SIMD instructions was one of the targets of [4]. In [5] a combined CPU-GPUs approach for parallel motion estimation is presented, while [6] includes a comparative study between motion estimation parallelism using CUDA cores, MPI and OpenMP. Such works are rather orthogonal to ours, since they tackle parallelism at a level lower than tiles, thus, can be (in principle) incorporated into tile parallelism approaches.…”
Section: Related Workmentioning
confidence: 99%
“…Parallelizing DCT with SIMD instructions was one of the targets of [4]. In [5] a combined CPU-GPUs approach for parallel motion estimation is presented, while [6] includes a comparative study between motion estimation parallelism using CUDA cores, MPI and OpenMP. Such works are rather orthogonal to ours, since they tackle parallelism at a level lower than tiles, thus, can be (in principle) incorporated into tile parallelism approaches.…”
Section: Related Workmentioning
confidence: 99%
“…This scheme enabled concurrent deblocking filtering with limited synchronization effort, independently of slice configuration. Several works have focused on the use of GPU to accelerate the ME process for H.264/AVC [9][10][11][12][13][14]. Most GPUbased ME algorithms employ the full-search method because it is suitable for the SIMD (single instruction and multiple data) architecture of GPU.…”
Section: Introductionmentioning
confidence: 99%
“…In addition, the frame-level parallel encoding technique on multiple CPUs is used to improve the overall throughput. Monteiro et al developed an ME algorithm in three kinds of platforms: multi-core general purposed processor, cluster/grid machines using message passing interface and GPU [14]. Although the GPU-based ME achieves significant speed-ups against the other platforms, only integer-pel ME and no SCP are considered.…”
Section: Introductionmentioning
confidence: 99%
“…As a result, the proposed algorithm provides tremendous speedup if implemented on modern high performance computing (HPC) platforms ranging from multicore/many-core machine architectures to graphics processing units to supercomputers. In the literature, there have been several attempts to parallelize motion estimation [37][38][39][40][41][42]. Several works have proposed applying GPUs for motion estimation.…”
Section: Introductionmentioning
confidence: 99%
“…In [39], implementations of the ES algorithm, the diamond search (DS) algorithm, and the four-step search (4SS) algorithms in CUDA have been proposed. A parallel implementation of the ES algorithm on the GPU using CUDA is also proposed in [40,41] along with a parallel solution for multi-core processors using the Open Message Passing (OpenMP) library and a distributed solution for cluster/grid machines using the Message Passing Interface (MPI) library [41]. GPU-based hierarchical motion estimation in CUDA has been proposed in [42].…”
Section: Introductionmentioning
confidence: 99%