Machine learning inference is becoming a core building block for interactive web applications. As a result, the underlying model serving systems on which these applications …
A Raybuck, T Stamler, W Zhang, M Erez… - Proceedings of the ACM …, 2021 - dl.acm.org
High-capacity non-volatile memory (NVM) is a new main memory tier. Tiered DRAM+ NVM servers increase total memory capacity by up to 8x, but can diminish memory bandwidth by …
This paper has the following ambitious goal: to convince the reader that content caching is an exciting research topic for the future communication systems and networks. Caching has …
In order to meet the growing demands for multimedia service access and release the pressure of the core network, edge caching and device-to-device (D2D) communication …
With the booming development of Internet-of-Things (IoT) and communication technologies such as 5G, our future world is envisioned as an interconnected entity where billions of …
Software switches are emerging as a vital measurement vantage point in many networked systems. Sketching algorithms or sketches, provide high-fidelity approximate measurements …
S Muralidhar, W Lloyd, S Roy, C Hill, E Lin… - … USENIX Symposium on …, 2014 - usenix.org
Facebook's corpus of photos, videos, and other Binary Large OBjects (BLOBs) that need to be reliably stored and quickly accessible is massive and continues to grow. As the footprint …
As a cache eviction algorithm, FIFO has a lot of attractive properties, such as simplicity, speed, scalability, and flash-friendliness. The most prominent criticism of FIFO is its low …
Today's explosively growing Internet video traffics and viewers' ever-increasing quality of experience (QoE) demands for video streaming bring tremendous pressures to the …