Accelerating recommendation system training by leveraging popular choices

M Adnan, YE Maboud, D Mahajan, PJ Nair - arXiv preprint arXiv …, 2021 - arxiv.org
Recommender models are commonly used to suggest relevant items to a user for e-
commerce and online advertisement-based applications. These models use massive …

Merlin hugeCTR: GPU-accelerated recommender system training and inference

Z Wang, Y Wei, M Lee, M Langer, F Yu, J Liu… - Proceedings of the 16th …, 2022 - dl.acm.org
In this talk, we introduce Merlin HugeCTR. Merlin HugeCTR is an open source, GPU-
accelerated integration framework for click-through rate estimation. It optimizes both training …

PICASSO: Unleashing the Potential of GPU-centric Training for Wide-and-deep Recommender Systems

Y Zhang, L Chen, S Yang, M Yuan, H Yi… - 2022 IEEE 38th …, 2022 - ieeexplore.ieee.org
The development of personalized recommendation has significantly improved the accuracy
of information matching and the revenue of e-commerce platforms. Recently, it has two …

A GPU-specialized inference parameter server for large-scale deep recommendation models

Y Wei, M Langer, F Yu, M Lee, J Liu, J Shi… - Proceedings of the 16th …, 2022 - dl.acm.org
Recommendation systems are of crucial importance for a variety of modern apps and web
services, such as news feeds, social networks, e-commerce, search, etc. To achieve peak …

Heterogeneous acceleration pipeline for recommendation system training

M Adnan, YE Maboud, D Mahajan… - 2024 ACM/IEEE 51st …, 2024 - ieeexplore.ieee.org
Recommendation models rely on deep learning networks and large embedding tables,
resulting in computationally and memory-intensive processes. These models are typically …

cDLRM: Look ahead caching for scalable training of recommendation models

K Balasubramanian, A Alshabanah, JD Choe… - Proceedings of the 15th …, 2021 - dl.acm.org
Deep learning recommendation models (DLRMs) are typically composed of two sets of
parameters: large embedding tables to handle sparse categorical inputs, and neural …

Learnable embedding sizes for recommender systems

S Liu, C Gao, Y Chen, D Jin, Y Li - arXiv preprint arXiv:2101.07577, 2021 - arxiv.org
The embedding-based representation learning is commonly used in deep learning
recommendation models to map the raw sparse features to dense vectors. The traditional …

Persia: An open, hybrid system scaling deep learning-based recommenders up to 100 trillion parameters

X Lian, B Yuan, X Zhu, Y Wang, Y He, H Wu… - Proceedings of the 28th …, 2022 - dl.acm.org
Recent years have witnessed an exponential growth of model scale in deep learning-based
recommender systems---from Google's 2016 model with 1 billion parameters to the latest …

Monolith: real time recommendation system with collisionless embedding table

Z Liu, L Zou, X Zou, C Wang, B Zhang, D Tang… - arXiv preprint arXiv …, 2022 - arxiv.org
Building a scalable and real-time recommendation system is vital for many businesses
driven by time-sensitive customer feedback, such as short-videos ranking or online ads …

EL-Rec: Efficient large-scale recommendation model training via tensor-train embedding table

Z Wang, Y Wang, B Feng, D Mudigere… - … Conference for High …, 2022 - ieeexplore.ieee.org
Deep learning Recommendation Models (DLRMs) plays an important role in various
application domains. However, existing DLRM training systems require a large number of …