2020
DOI: 10.48550/arxiv.2001.02772
|View full text |Cite
Preprint
|
Sign up to set email alerts
|

DeepRecSys: A System for Optimizing End-To-End At-scale Neural Recommendation Inference

Abstract: Neural personalized recommendation is the cornerstone of a wide collection of cloud services and products, constituting significant compute demand of the cloud infrastructure. Thus, improving the execution efficiency of neural recommendation directly translates into infrastructure capacity saving. In this paper, we devise a novel end-to-end modeling infrastructure, DeepRecInfra, that adopts an algorithm and system co-design methodology to custom-design systems for recommendation use cases. Leveraging the insig… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
3
2

Citation Types

1
25
0

Year Published

2020
2020
2021
2021

Publication Types

Select...
5
1

Relationship

1
5

Authors

Journals

citations
Cited by 10 publications
(26 citation statements)
references
References 56 publications
1
25
0
Order By: Relevance
“…Compared with the first configuration, the second has dual AMD Radeon Vega GPUs, providing 4×, 8×, and 16× more flops, memory bandwidth, and capacity, respectively. It represents a data-center-scale server [76], [77], yielding a 2.6× greater manufacturing carbon footprint.…”
Section: Addressing Carbon Footprint Of Systemsmentioning
confidence: 99%
See 1 more Smart Citation
“…Compared with the first configuration, the second has dual AMD Radeon Vega GPUs, providing 4×, 8×, and 16× more flops, memory bandwidth, and capacity, respectively. It represents a data-center-scale server [76], [77], yielding a 2.6× greater manufacturing carbon footprint.…”
Section: Addressing Carbon Footprint Of Systemsmentioning
confidence: 99%
“…Optimizing for capex-related emissions requires reducing hardware resources. Recently proposed schedulers optimize infrastructure efficiency in terms of total power consumption while balancing performance for latency-critical and batchprocessing workloads [76], [77], [86]- [88]. Others have proposed novel scheduler designs to enable energy-efficient, collaborative cloud and edge execution [66], [89].…”
Section: Addressing Carbon Footprint Of Systemsmentioning
confidence: 99%
“…In the past decade, deep learning recommendation models have successfully powered various applications and products. For instance, YouTube adopts deep candidate generation and ranking models to deliver video suggestions [1]; Microsoft utilizes deep learning recommendation models for news feed [2]; Alibaba leverages recommendation models for product suggestions [3]; Facebook uses deep learning recommendation models to support personalization and ranking in widespread products [4]- [6]. Recent years, along with the emergence of big data, recommendation models are rapidly expanding their capacity, especially the industry recommendation systems.…”
Section: Introductionmentioning
confidence: 99%
“…Recommender Systems serve to personalize user experience in a variety of applications including predicting click-through rates for ranking advertisements [34], improving search results [30], suggesting friends and content on social networks [30], suggesting food on Uber Eats [53], helping users find houses on Zillow [54], helping contain information overload by suggesting relevant news articles [55], helping users find videos to watch on YouTube [43] and movies on Netflix [59], and several more real-world use cases [60]. An introduction to recommender system technology can be found in [58] and a set of best practices and examples for building recommender systems in [56].…”
Section: Introductionmentioning
confidence: 99%
“…The focus of this paper is recommendation systems that use neural networks, referred to as Deep Learning RecSys, or simply RecSys 1 . These have been recently applied to a variety of areas with success [34] [30].…”
Section: Introductionmentioning
confidence: 99%