Understanding data storage and ingestion for large-scale deep recommendation model training

Zhao, Mark; Agarwal, Niket; Basant, Aarti; Gedik, Buğra; Pan, Satadru; Özdal, Mustafa; Komuravelli, Rakesh; Pan, Jerry; Bao, Tianshu; Lu, Haowei; Narayanan, Sundaram; Langman, Jack; Wilfong, Kevin; Rastogi, Harsha; Wu, Carole-Jean; Kozyrakis, Christos; Pol, Parik

doi:10.1145/3470496.3533044

Cited by 31 publications

(7 citation statements)

References 48 publications

Supporting

Mentioning

Contrasting

Order By: Relevance

“…Given a model, its architecture and algebraic computations are fixed; we also know which LA operators are affected by factorization [37]. To examine the relative speedup of factorization, we mainly need to inspect data redundancy [35], [37], and the interactions between physical data transfers (e.g., network and memory bandwidth) [73]. Existing solutions.…”

Section: B Cost Estimation Challenge: To Factorize or To Materializementioning

confidence: 99%

Amalur: Data Integration Meets Machine Learning

Li,

Sun,

Zhan

et al. 2024

IEEE Trans. Knowl. Data Eng.

View full text Add to dashboard Cite

Machine learning (ML) training data is often scattered across disparate collections of datasets, called data silos. This fragmentation poses a major challenge for data-intensive ML applications: integrating and transforming data residing in different sources demand a lot of manual work and computational resources. With data privacy constraints, data often cannot leave the premises of data silos; hence model training should proceed in a decentralized manner. In this work, we present a vision of how to bridge the traditional data integration (DI) techniques with the requirements of modern machine learning systems. We explore the possibilities of utilizing metadata obtained from data integration processes for improving the effectiveness, efficiency, and privacy of ML models. Towards this direction, we analyze ML training and inference over data silos. Bringing data integration and machine learning together, we highlight new research opportunities from the aspects of systems, representations, factorized learning, and federated learning.

show abstract

Section: B Cost Estimation Challenge: To Factorize or To Materializementioning

confidence: 99%

Amalur: Data Integration Meets Machine Learning

Li,

Sun,

Zhan

et al. 2024

IEEE Trans. Knowl. Data Eng.

View full text Add to dashboard Cite

show abstract

“…The current erbium implementation uses dictionary encoding to reduce both the storage requirement and the online data movement. Therefore, queries must be encoded before being sent to the accelerators, just like data quantisation and normalisation in machine learning pipelines [33]. This process is carried out individually at the worker level in a pipeline manner, while the previous query batch is being executed by the FPGA kernel.…”

Section: Setupmentioning

confidence: 99%

“…This imbalance is pervasive and often quite large. For instance, a recent study by Meta of their ML pipelines [33] shows that GPUs used for training ML models are stalled up to 56 % of the time waiting for input data. They also show the increasing amount of compute power, network, and memory bandwidth needed on the CPU side to be able to match the throughput of the accelerator.…”

Section: Introductionmentioning

confidence: 99%

The Difficult Balance Between Modern Hardware and Conventional CPUs

Maschi

Alonso

2023

Proceedings of the 19th International Workshop on Data Management on New Hardware

View full text Add to dashboard Cite

Research has demonstrated the potential of accelerators in a wide range of use cases. However, there is a growing imbalance between modern hardware and the CPUs that submit the workload. Recent studies of GPUs on real systems have shown that many servers are often needed per accelerator to generate a high enough load so the computing power is leveraged. This fact is often ignored in research, although it often determines the actual feasibility and overall efficiency of a deployment. In this paper, we conduct a detailed study of the possible configurations and overall cost efficiency of deploying an FPGA-based accelerator on a commercial search engine. First, we show that there are many possible configurations balancing the upstream system and the way the accelerator is configured. Of these configurations, not all of them are suitable in practice, even if they provide some of the highest throughput. Second, we analyse the cost of a deployment capable of sustaining the required workload of the commercial search engine. We examine deployments both on-premises and in the cloud with and without FPGAs and with different board models. The results show that, while FPGAs have the potential to significantly improve overall performance, the performance imbalance between their host CPUs and the FPGAs can make the deployments economically unattractive. These findings are intended to inform the development and deployment of accelerators by showing what is needed on the CPU side to make them effective and also to provide important insights into their end-to-end integration within existing systems. CCS CONCEPTS• Hardware → Hardware accelerators; • Computer systems organization → Cloud computing; Client-server architectures; Heterogeneous (hybrid) systems.

show abstract

“…Thus, defined in the field of innovation performance more precisely forecasts the adoption of new products, leading to the development of innovation (domain-specific innovativeness) in specific areas, using innovative specific areas to forecast consumers' specific interest in the field of new products early adoption behavior and attitude [45]. Researchers worldwide have been adapting this model's structure to new fields and products in recent years [46][47][48][49][50][51]. However, many studies look at the product side of things, concentrating on the early degree of customers' adoption of new items while disregarding that some people only pay attention to the information about new products but do not necessarily buy them.…”

Section: Domain-specific Innovation (Dsi)mentioning

confidence: 99%

How Do Consumer Innovation Characteristics and Consumption Value Shape Users’ Willingness to Buy Innovative Car Safety Seats?

Jiang

Zhao

Lin³

et al. 2022

Sustainability

View full text Add to dashboard Cite

The intelligent innovation of child safety seats has brought new impacts and challenges to the Chinese market. Researchers in the car seat industry have been focusing on industry regulations and the abuse of car seats, but there is a lack of consumer-centered research. This study is the first to combine two theories of consumer subject-specific innovation (DSI) and the theory of consumption value (TCV). This study explores how consumer innovations influence consumers’ purchase of innovative child safety seats through perceived value. The proposed research model was evaluated using a partial least squares structural equation model, and data analysis revealed that the model had good model fit, reliability, and validity. Consumer product innovation has a significantly better impact on willingness to buy than consumer information innovation. In this study, in the relationship between consumers of information innovation and purchase intention in the automobile seat industry, a new kind of parallel multi-mediating relationship between the social value, hedonic value, and novelty value of perceived products was proposed. The study’s results address the need for more consumer research in the intelligent seating industry, as well as how to give researchers and marketing firms solutions and suggestions based on facts.

show abstract

Understanding data storage and ingestion for large-scale deep recommendation model training

Cited by 31 publications

References 48 publications

Amalur: Data Integration Meets Machine Learning

Amalur: Data Integration Meets Machine Learning

The Difficult Balance Between Modern Hardware and Conventional CPUs

How Do Consumer Innovation Characteristics and Consumption Value Shape Users’ Willingness to Buy Innovative Car Safety Seats?

Contact Info

Product

Resources

About