2021
DOI: 10.1007/s11554-021-01187-8
|View full text |Cite
|
Sign up to set email alerts
|

Real-time optical flow processing on embedded GPU: an hardware-aware algorithm to implementation strategy

Abstract: Determining the optical flow of a video is a compute-intensive task essential for computer vision. For achieving this processing in real-time, the whole algorithm deployment chain must be thought of for efficiency first. The development is usually divided into two parts: first, designing an algorithm that meets precision constraints, then, implementing and optimizing its execution on the targeted platform. We argue that unifying those operations enhances performance on the embedded processor.This paper is base… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
2

Citation Types

0
0
0

Year Published

2023
2023
2024
2024

Publication Types

Select...
4
1

Relationship

0
5

Authors

Journals

citations
Cited by 5 publications
(4 citation statements)
references
References 27 publications
0
0
0
Order By: Relevance
“…A high-performance embedded processing platform based on a graphics processing unit (GPU), DSP, and FPGA has become the potential solution for onboard image processing [31][32][33][34]. As the specialized image processor, the embedded GPU processing platform [35] has been widely used in unmanned driving technology, AI computation, and video image processing. Its parallel processing capability supports it in handling complex data and geometry computing [36].…”
Section: Related Workmentioning
confidence: 99%
“…A high-performance embedded processing platform based on a graphics processing unit (GPU), DSP, and FPGA has become the potential solution for onboard image processing [31][32][33][34]. As the specialized image processor, the embedded GPU processing platform [35] has been widely used in unmanned driving technology, AI computation, and video image processing. Its parallel processing capability supports it in handling complex data and geometry computing [36].…”
Section: Related Workmentioning
confidence: 99%
“…The key to implementing the APSVD is the exact eigendecomposition, which can be achieved using the Symmetric Eigenvalue Divide (SYEVD) function based on QR decomposition or the Symmetric Eigenvalue Jacobi (SYEVJ) function based on Jacobi decomposition [46]. SYEVD employs a divide-and-conquer method to decompose a symmetric matrix into smaller sub-problems and solves them recursively.…”
Section: Apsvd Using Cudamentioning
confidence: 99%
“…To quantitatively analyze both methods, we introduce arithmetic intensity [46] which is a metric used to evaluate the performance of parallel computational tasks. Specifically, the arithmetic intensity I is defined as…”
Section: Apsvd Using Cudamentioning
confidence: 99%
“…Recent works present real-time implementation of HORN & SCHUNCK and other more recent optical flow algorithms on GPU [4] and on FPGA [5], but not on CPU. Indeed, these are more often used as a comparison reference for other architectures and the optimizations details are not described [6].…”
Section: Related Workmentioning
confidence: 99%