Zero-CPU collection with direct telemetry access

J Langlet, R Ben-Basat, S Ramanathan… - Proceedings of the 20th …, 2021 - dl.acm.org
Proceedings of the 20th ACM Workshop on Hot Topics in Networks, 2021dl.acm.org
Programmable switches are driving a massive increase in fine-grained measurements. This
puts significant pressure on telemetry collectors that have to process reports from many
switches. Past research acknowledged this problem by either improving collectors' stack
performance or by limiting the amount of data sent from switches. In this paper, we take a
different and radical approach: switches are responsible for directly inserting queryable
telemetry data into the collectors' memory, bypassing their CPU, and thereby improving their …
Programmable switches are driving a massive increase in fine-grained measurements. This puts significant pressure on telemetry collectors that have to process reports from many switches. Past research acknowledged this problem by either improving collectors' stack performance or by limiting the amount of data sent from switches. In this paper, we take a different and radical approach: switches are responsible for directly inserting queryable telemetry data into the collectors' memory, bypassing their CPU, and thereby improving their collection scalability. We propose to use a method we call direct telemetry access, where switches jointly write telemetry reports directly into the same collector's memory region, without coordination. Our solution, DART, is probabilistic, trading memory redundancy and query success probability for CPU resources at collectors. We prototype DART using commodity hardware such as P4 switches and RDMA NICs and show that we get high query success rates with a reasonable memory overhead. For example, we can collect INT path tracing information on a fat tree topology without a collector's CPU involvement while achieving 99.9% query success probability and using just 300 bytes per flow.
ACM Digital Library
以上显示的是最相近的搜索结果。 查看全部搜索结果