The architectural implications of facebook's dnn-based personalized recommendation

U Gupta, CJ Wu, X Wang, M Naumov… - … Symposium on High …, 2020 - ieeexplore.ieee.org
The widespread application of deep learning has changed the landscape of computation in
data centers. In particular, personalized recommendation for content ranking is now largely …

Deep learning training in facebook data centers: Design of scale-up and scale-out systems

M Naumov, J Kim, D Mudigere, S Sridharan… - arXiv preprint arXiv …, 2020 - arxiv.org
Large-scale training is important to ensure high performance and accuracy of machine-
learning models. At Facebook we use many different models, including computer vision …

Recnmp: Accelerating personalized recommendation with near-memory processing

L Ke, U Gupta, BY Cho, D Brooks… - 2020 ACM/IEEE 47th …, 2020 - ieeexplore.ieee.org
Personalized recommendation systems leverage deep learning models and account for the
majority of data center AI cycles. Their performance is dominated by memory-bound sparse …

Deeprecsys: A system for optimizing end-to-end at-scale neural recommendation inference

U Gupta, S Hsia, V Saraph, X Wang… - 2020 ACM/IEEE 47th …, 2020 - ieeexplore.ieee.org
Neural personalized recommendation is the cornerstone of a wide collection of cloud
services and products, constituting significant compute demand of cloud infrastructure. Thus …

Understanding data storage and ingestion for large-scale deep recommendation model training: Industrial product

M Zhao, N Agarwal, A Basant, B Gedik, S Pan… - Proceedings of the 49th …, 2022 - dl.acm.org
Datacenter-scale AI training clusters consisting of thousands of domain-specific accelerators
(DSA) are used to train increasingly-complex deep learning models. These clusters rely on a …

Understanding capacity-driven scale-out neural recommendation inference

M Lui, Y Yetim, Ö Özkan, Z Zhao… - … Analysis of Systems …, 2021 - ieeexplore.ieee.org
Deep learning recommendation models have grown to the terabyte scale. Traditional
serving schemes-that load entire models to a single server-are unable to support this scale …

Machine learning at facebook: Understanding inference at the edge

CJ Wu, D Brooks, K Chen, D Chen… - … symposium on high …, 2019 - ieeexplore.ieee.org
At Facebook, machine learning provides a wide range of capabilities that drive many
aspects of user experience including ranking posts, content understanding, object detection …

Deep learning inference in facebook data centers: Characterization, performance optimizations and hardware implications

J Park, M Naumov, P Basu, S Deng, A Kalaiah… - arXiv preprint arXiv …, 2018 - arxiv.org
The application of deep learning techniques resulted in remarkable improvement of
machine learning models. In this paper provides detailed characterizations of deep learning …

Applied machine learning at facebook: A datacenter infrastructure perspective

K Hazelwood, S Bird, D Brooks… - … symposium on high …, 2018 - ieeexplore.ieee.org
Machine learning sits at the core of many essential products and services at Facebook. This
paper describes the hardware and software infrastructure that supports machine learning at …

Tensor casting: Co-designing algorithm-architecture for personalized recommendation training

Y Kwon, Y Lee, M Rhu - 2021 IEEE International Symposium …, 2021 - ieeexplore.ieee.org
Personalized recommendations are one of the most widely deployed machine learning (ML)
workload serviced from cloud datacenters. As such, architectural solutions for high …