A large-scale analysis of hundreds of in-memory key-value cache clusters at twitter

J Yang, Y Yue, KV Rashmi - ACM Transactions on Storage (TOS), 2021 - dl.acm.org
Modern web services use in-memory caching extensively to increase throughput and reduce
latency. There have been several workload analyses of production systems that have fueled …

Serving {DNNs} like clockwork: Performance predictability from the bottom up

A Gujarati, R Karimi, S Alzayat, W Hao… - … USENIX Symposium on …, 2020 - usenix.org
Machine learning inference is becoming a core building block for interactive web
applications. As a result, the underlying model serving systems on which these applications …

Hemem: Scalable tiered memory management for big data applications and real nvm

A Raybuck, T Stamler, W Zhang, M Erez… - Proceedings of the ACM …, 2021 - dl.acm.org
High-capacity non-volatile memory (NVM) is a new main memory tier. Tiered DRAM+ NVM
servers increase total memory capacity by up to 8x, but can diminish memory bandwidth by …

The role of caching in future communication systems and networks

GS Paschos, G Iosifidis, M Tao… - IEEE Journal on …, 2018 - ieeexplore.ieee.org
This paper has the following ambitious goal: to convince the reader that content caching is
an exciting research topic for the future communication systems and networks. Caching has …

Attention-weighted federated deep reinforcement learning for device-to-device assisted heterogeneous collaborative edge caching

X Wang, R Li, C Wang, X Li, T Taleb… - IEEE Journal on …, 2020 - ieeexplore.ieee.org
In order to meet the growing demands for multimedia service access and release the
pressure of the core network, edge caching and device-to-device (D2D) communication …

Deep learning for edge computing applications: A state-of-the-art survey

F Wang, M Zhang, X Wang, X Ma, J Liu - IEEE Access, 2020 - ieeexplore.ieee.org
With the booming development of Internet-of-Things (IoT) and communication technologies
such as 5G, our future world is envisioned as an interconnected entity where billions of …

Nitrosketch: Robust and general sketch-based monitoring in software switches

Z Liu, R Ben-Basat, G Einziger, Y Kassner… - Proceedings of the …, 2019 - dl.acm.org
Software switches are emerging as a vital measurement vantage point in many networked
systems. Sketching algorithms or sketches, provide high-fidelity approximate measurements …

f4: Facebook's warm {BLOB} storage system

S Muralidhar, W Lloyd, S Roy, C Hill, E Lin… - … USENIX Symposium on …, 2014 - usenix.org
Facebook's corpus of photos, videos, and other Binary Large OBjects (BLOBs) that need to
be reliably stored and quickly accessible is massive and continues to grow. As the footprint …

FIFO queues are all you need for cache eviction

J Yang, Y Zhang, Z Qiu, Y Yue, R Vinayak - Proceedings of the 29th …, 2023 - dl.acm.org
As a cache eviction algorithm, FIFO has a lot of attractive properties, such as simplicity,
speed, scalability, and flash-friendliness. The most prominent criticism of FIFO is its low …

Intelligent video caching at network edge: A multi-agent deep reinforcement learning approach

F Wang, F Wang, J Liu, R Shea… - IEEE INFOCOM 2020 …, 2020 - ieeexplore.ieee.org
Today's explosively growing Internet video traffics and viewers' ever-increasing quality of
experience (QoE) demands for video streaming bring tremendous pressures to the …