GPU-Enabled Serverless Workflows for Efficient Multimedia Processing

Risco, Sebastián; Moltó, Germán

doi:10.3390/app11041438

Cited by 13 publications

(10 citation statements)

References 15 publications

(16 reference statements)

Supporting

Mentioning

Contrasting

Order By: Relevance

“…For certain application scenarios, serving all the queries using expensive hardware accelerators may not be economically viable (e.g., ML inference queries). For such scenarios, hybrid approaches to opportunistically serve the incoming requests using a mix of GPU-enabled instances and traditional CPU-only instances, could be explored [138], [103]. Research eforts are also directed towards highlighting the adaptability of quantum computing in the design of serverless systems [54], [55].…”

Section: Runtime Resource Limitationsmentioning

confidence: 99%

A Holistic View on Resource Management in Serverless Computing Environments: Taxonomy and Future Directions

2022

View full text Add to dashboard Cite

Serverless computing has emerged as an attractive deployment option for cloud applications in recent times. The unique features of this computing model include rapid auto-scaling, strong isolation, fine-grained billing options and access to a massive service ecosystem which autonomously handles resource management decisions. This model is increasingly being explored for deployments in geographically distributed edge and fog computing networks as well, due to these characteristics. Effective management of computing resources has always gained a lot of attention among researchers. The need to automate the entire process of resource provisioning, allocation, scheduling, monitoring and scaling, has resulted in the need for specialized focus on resource management under the serverless model. In this article, we identify the major aspects covering the broader concept of resource management in serverless environments and propose a taxonomy of elements which influence these aspects, encompassing characteristics of system design, workload attributes and stakeholder expectations. We take a holistic view on serverless environments deployed across edge, fog and cloud computing networks. We also analyse existing works discussing aspects of serverless resource management using this taxonomy. This article further identifies gaps in literature and highlights future research directions for improving capabilities of this computing model.

show abstract

Section: Runtime Resource Limitationsmentioning

confidence: 99%

A Holistic View on Resource Management in Serverless Computing Environments: Taxonomy and Future Directions

2022

View full text Add to dashboard Cite

show abstract

“…It achieves 7.9 times lower latency and 17.2 times cost reduction on average compared to that of serverful alternatives. In [126] GPU processing power is harnessed in a serverless setting for video processing. Zhang et al, [168] present a measurement study to extract contributing factors such as the execution duration and monetary cost of serverless video processing approaches.…”

Section: Video Processing and Streamingmentioning

confidence: 99%

Serverless Computing: A Survey of Opportunities, Challenges, and Applications

2022

View full text Add to dashboard Cite

The emerging serverless computing paradigm has attracted attention from both academia and industry. This paradigm brings benefits such as less operational complexity, a pay-as-you-go pricing model, and an auto-scaling feature. The paradigm opens up new opportunities and challenges for cloud application developers. In this paper, we present a comprehensive overview of the past development as well as the recent advances in research areas related to serverless computing. First, we survey serverless applications introduced in the literature. We categorize applications in eight domains and separately discuss the objectives and the viability of the serverless paradigm along with challenges in each of those domains. We then classify those challenges into nine topics and survey the proposed solutions. Finally, we present the areas that need further attention from the research community and identify open problems.

show abstract

“…For those applications that do not fit within AWS Lambda's computing requirements, SCAR provides a seamless integration with AWS Batch [5] an elastic-cluster as a service offering by AWS which dynamically deploys a cluster in charge of executing jobs packaged as a Docker images and which can grow and shrink depending on the number of jobs queued up at the Local Resource Management System (LRMS). This integration allows to delegate into AWS Batch functions invocations that require longer execution times, larger amount of memory or even GPU resources for accelerated execution, as described in the work by Risco et al [56].…”

Section: Scar: Serverless Scientific Computing In Public Cloudsmentioning

confidence: 99%

Serverless Workflows for Containerised Applications in the Cloud Continuum

et al. 2021

Self Cite

View full text Add to dashboard Cite

This paper introduces an open-source platform to support serverless computing for scientific data-processing workflow-based applications across the Cloud continuum (i.e. simultaneously involving both on-premises and public Cloud platforms to process data captured at the edge). This is achieved via dynamic resource provisioning for FaaS platforms compatible with scale-to-zero approaches that minimise resource usage and cost for dynamic workloads with different elasticity requirements. The platform combines the usage of dynamically deployed auto-scaled Kubernetes clusters on on-premises Clouds and automated Cloud bursting into AWS Lambda to achieve higher levels of elasticity. A use case in public health for smart cities is used to assess the platform, in charge of detecting people not wearing face masks from captured videos. Faces are blurred for enhanced anonymity in the on-premises Cloud and detection via Deep Learning models is performed in AWS Lambda for this data-driven containerised workflow. The results indicate that hybrid workflows across the Cloud continuum can efficiently perform local data processing for enhanced regulations compliance and perform Cloud bursting for increased levels of elasticity.

show abstract

GPU-Enabled Serverless Workflows for Efficient Multimedia Processing

Cited by 13 publications

References 15 publications

A Holistic View on Resource Management in Serverless Computing Environments: Taxonomy and Future Directions

A Holistic View on Resource Management in Serverless Computing Environments: Taxonomy and Future Directions

Serverless Computing: A Survey of Opportunities, Challenges, and Applications

Serverless Workflows for Containerised Applications in the Cloud Continuum

Contact Info

Product

Resources

About