M Lui, Y Yetim, O Ozkan, Z Zhao, SY Tsai… - … Analysis of Systems …, 2021 - computer.org
Deep learning recommendation models have grown to the terabyte scale. Traditional
serving schemes-that load entire models to a single server-are unable to support this scale …