Differentially private linear sketches: Efficient implementations and applications

F Zhao, D Qiao, R Redberg… - Advances in …, 2022 - proceedings.neurips.cc
Linear sketches have been widely adopted to process fast data streams, and they can be
used to accurately answer frequency estimation, approximate top K items, and summarize …

Panakos: Chasing the tails for multidimensional data streams

F Zhao, PI Khan, D Agrawal, AE Abbadi… - Proceedings of the …, 2023 - dl.acm.org
System operators are often interested in extracting different feature streams from multi-
dimensional data streams; and reporting their distributions at regular intervals, including the …

Learning-Based Heavy Hitters and Flow Frequency Estimation in Streams

R Shahout, M Mitzenmacher - arXiv preprint arXiv:2406.16270, 2024 - arxiv.org
Identifying heavy hitters and estimating the frequencies of flows are fundamental tasks in
various network domains. Existing approaches to this challenge can broadly be categorized …

TreeSensing: Linearly Compressing Sketches with Flexibility

Z Liu, Y Zhang, Y Zhu, R Zhang, T Yang, K Xie… - Proceedings of the …, 2023 - dl.acm.org
A Sketch is an excellent probabilistic data structure, which records the approximate statistics
of data streams. Linear additivity is an important property of sketches. This paper studies …

Autumn: A Scalable Read Optimized LSM-tree based Key-Value Stores with Fast Point and Range Read Speed

F Zhao, L Reznikov, D Agrawal, AE Abbadi - arXiv preprint arXiv …, 2023 - arxiv.org
The Log Structured Merge Trees (LSM-tree) based key-value stores are widely used in
many storage systems to support a variety of operations such as updates, point reads, and …

A Detailed Analysis of the SpaceSaving Family of Algorithms with Bounded Deletions

F Zhao, D Agrawal, AE Abbadi, C Mathieu… - arXiv preprint arXiv …, 2023 - arxiv.org
In this paper, we present an advanced analysis of near optimal deterministic algorithms
using a small space budget to solve the frequency estimation, heavy hitters, frequent items …

Errata for" SpaceSaving±: An Optimal Algorithm for Frequency Estimation and Frequent Items in the Bounded-Deletion Model"

F Zhao, D Agrawal, AE Abbadi, A Metwally… - Proceedings of the …, 2023 - dl.acm.org
This errata article points out an implicit assumption in the work of four of us published in
VLDB 2022. The SpaceSaving±algorithm in bounded deletion data stream presented in the …

Scalable Overspeed Item Detection in Streams

Y Wu, H Wu, C Jia, B Peng, Z Zhang… - 2024 IEEE 40th …, 2024 - ieeexplore.ieee.org
In data stream mining, monitoring high-speed users and segregating their excessive use,
known as “Overspeed items,” is crucial for preventing system overload and maintaining …

Sampling Space-Saving Set Sketches

HK Lee, C Masson - arXiv preprint arXiv:2402.08604, 2024 - arxiv.org
Large, distributed data streams are now ubiquitous. High-accuracy sketches with low
memory overhead have become the de facto method for analyzing this data. For instance, if …

Approaching 100% Confidence in Stream Summary through ReliableSketch

Y Wu, H Wu, X Liu, Y Zhao, T Yang, K Yang… - arXiv preprint arXiv …, 2024 - arxiv.org
To approximate sums of values in key-value data streams, sketches are widely used in
databases and networking systems. They offer high-confidence approximations for any …