Electrode: Accelerating Distributed Protocols with {eBPF}

Y Zhou, Z Wang, S Dharanipragada, M Yu - 20th USENIX Symposium …, 2023 - usenix.org
Implementing distributed protocols under a standard Linux kernel networking stack enjoys
the benefits of load-aware CPU scaling, high compatibility, and robust security and isolation …

A cloud-scale characterization of remote procedure calls

K Seemakhupt, BE Stephens, S Khan, S Liu… - Proceedings of the 29th …, 2023 - dl.acm.org
The global scale and challenging requirements of modern cloud applications have led to the
development of complex, widely distributed, service-oriented applications. One enabler of …

[HTML][HTML] Distributed artificial intelligence: Taxonomy, review, framework, and reference architecture

N Janbi, I Katib, R Mehmood - Intelligent Systems with Applications, 2023 - Elsevier
Artificial intelligence (AI) research and market have grown rapidly in the last few years, and
this trend is expected to continue with many potential advancements and innovations in this …

Dilos: Do not trade compatibility for performance in memory disaggregation

W Yoon, J Ok, J Oh, S Moon, Y Kwon - Proceedings of the Eighteenth …, 2023 - dl.acm.org
Memory disaggregation has replaced the landscape of dat-acenters by physically
separating compute and memory nodes, achieving improved utilization. As early efforts …

Technology trends in large-scale high-efficiency network computing

J Su, B Zhao, Y Dai, J Cao, Z Wei, N Zhao… - Frontiers of Information …, 2022 - Springer
Network technology is the basis for large-scale high-efficiency network computing, such as
supercomputing, cloud computing, big data processing, and artificial intelligence computing …

Poseidon: Efficient, Robust, and Practical Datacenter {CC} via Deployable {INT}

W Wang, M Moshref, Y Li, G Kumar, TSE Ng… - … USENIX Symposium on …, 2023 - usenix.org
The difficulty in gaining visibility into the fine-timescale hop-level congestion state of
networks has been a key challenge faced by congestion control (CC) protocols for decades …

Towards a fully disaggregated and programmable data center

Y Shan, W Lin, Z Guo, Y Zhang - Proceedings of the 13th ACM SIGOPS …, 2022 - dl.acm.org
Today, we are seeing two trends in the data center. On the one hand, applications are
becoming more fine-grained, driven by the recent trend of serverless computing and …

Predictable vFabric on informative data plane

S Wang, K Gao, K Qian, D Li, R Miao, B Li… - Proceedings of the …, 2022 - dl.acm.org
In multi-tenant data centers, each tenant desires reassuring predictability from the virtual
network fabric-bandwidth guarantee, work conservation, and bounded tail latency …

Flor: An open high performance {RDMA} framework over heterogeneous {RNICs}

Q Li, Y Gao, X Wang, H Qiu, Y Le, D Liu… - … USENIX Symposium on …, 2023 - usenix.org
Datacenter applications have been increasingly applying RDMA for the ultra-low latency
and low CPU overhead. However, RDMA-capable NICs (RNICs) of different vendors and …

Green With Envy: Unfair Congestion Control Algorithms Can Be More Energy Efficient

S Arslan, S Renganathan, B Spang - … of the 22nd ACM Workshop on Hot …, 2023 - dl.acm.org
Despite 40 years of active research on congestion control, there has been little or no
consideration of how it impacts the energy usage of end-hosts or networking equipment …