2021
DOI: 10.3390/app11041438
|View full text |Cite
|
Sign up to set email alerts
|

GPU-Enabled Serverless Workflows for Efficient Multimedia Processing

Abstract: Serverless computing has introduced scalable event-driven processing in Cloud infrastructures. However, it is not trivial for multimedia processing to benefit from the elastic capabilities featured by serverless applications. To this aim, this paper introduces the evolution of a framework to support the execution of customized runtime environments in AWS Lambda in order to accommodate workloads that do not satisfy its strict computational requirements: increased execution times and the ability to use GPU-based… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1
1

Citation Types

0
7
0

Year Published

2021
2021
2024
2024

Publication Types

Select...
7

Relationship

2
5

Authors

Journals

citations
Cited by 13 publications
(10 citation statements)
references
References 15 publications
(16 reference statements)
0
7
0
Order By: Relevance
“…For certain application scenarios, serving all the queries using expensive hardware accelerators may not be economically viable (e.g., ML inference queries). For such scenarios, hybrid approaches to opportunistically serve the incoming requests using a mix of GPU-enabled instances and traditional CPU-only instances, could be explored [138], [103]. Research eforts are also directed towards highlighting the adaptability of quantum computing in the design of serverless systems [54], [55].…”
Section: Runtime Resource Limitationsmentioning
confidence: 99%
“…For certain application scenarios, serving all the queries using expensive hardware accelerators may not be economically viable (e.g., ML inference queries). For such scenarios, hybrid approaches to opportunistically serve the incoming requests using a mix of GPU-enabled instances and traditional CPU-only instances, could be explored [138], [103]. Research eforts are also directed towards highlighting the adaptability of quantum computing in the design of serverless systems [54], [55].…”
Section: Runtime Resource Limitationsmentioning
confidence: 99%
“…It achieves 7.9 times lower latency and 17.2 times cost reduction on average compared to that of serverful alternatives. In [126] GPU processing power is harnessed in a serverless setting for video processing. Zhang et al, [168] present a measurement study to extract contributing factors such as the execution duration and monetary cost of serverless video processing approaches.…”
Section: Video Processing and Streamingmentioning
confidence: 99%
“…For those applications that do not fit within AWS Lambda's computing requirements, SCAR provides a seamless integration with AWS Batch [5] an elastic-cluster as a service offering by AWS which dynamically deploys a cluster in charge of executing jobs packaged as a Docker images and which can grow and shrink depending on the number of jobs queued up at the Local Resource Management System (LRMS). This integration allows to delegate into AWS Batch functions invocations that require longer execution times, larger amount of memory or even GPU resources for accelerated execution, as described in the work by Risco et al [56].…”
Section: Scar: Serverless Scientific Computing In Public Cloudsmentioning
confidence: 99%