Lower bounds for pseudo-deterministic counting in a stream

V Braverman, R Krauthgamer, A Krishnan… - arXiv preprint arXiv …, 2023 - arxiv.org
arXiv preprint arXiv:2303.16287, 2023arxiv.org
Many streaming algorithms provide only a high-probability relative approximation. These
two relaxations, of allowing approximation and randomization, seem necessary--for many
streaming problems, both relaxations must be employed simultaneously, to avoid an
exponentially larger (and often trivial) space complexity. A common drawback of these
randomized approximate algorithms is that independent executions on the same input have
different outputs, that depend on their random coins. Pseudo-deterministic algorithms …
Many streaming algorithms provide only a high-probability relative approximation. These two relaxations, of allowing approximation and randomization, seem necessary -- for many streaming problems, both relaxations must be employed simultaneously, to avoid an exponentially larger (and often trivial) space complexity. A common drawback of these randomized approximate algorithms is that independent executions on the same input have different outputs, that depend on their random coins. Pseudo-deterministic algorithms combat this issue, and for every input, they output with high probability the same ``canonical'' solution. We consider perhaps the most basic problem in data streams, of counting the number of items in a stream of length at most . Morris's counter [CACM, 1978] is a randomized approximation algorithm for this problem that uses bits of space, for every fixed approximation factor (greater than ). Goldwasser, Grossman, Mohanty and Woodruff [ITCS 2020] asked whether pseudo-deterministic approximation algorithms can match this space complexity. Our main result answers their question negatively, and shows that such algorithms must use bits of space. Our approach is based on a problem that we call Shift Finding, and may be of independent interest. In this problem, one has query access to a shifted version of a known string , which is guaranteed to start with zeros and end with ones, and the goal is to find the unknown shift using a small number of queries. We provide for this problem an algorithm that uses queries. It remains open whether queries suffice; if true, then our techniques immediately imply a nearly-tight space bound for pseudo-deterministic approximate counting.
arxiv.org
以上显示的是最相近的搜索结果。 查看全部搜索结果