A comprehensive survey on internet outages

G Aceto, A Botta, P Marchetta, V Persico… - Journal of Network and …, 2018 - Elsevier
Internet outages are inevitable, frequent, opaque, and expensive. To make things worse,
they are poorly understood, while a deep understanding of them is essential for …

Flow event telemetry on programmable data plane

Y Zhou, C Sun, HH Liu, R Miao, S Bai, B Li… - Proceedings of the …, 2020 - dl.acm.org
Network performance anomalies (NPAs), eg long-tailed latency, bandwidth decline, etc., are
increasingly crucial to cloud providers as applications are getting more sensitive to …

Automatic test packet generation

H Zeng, P Kazemian, G Varghese… - Proceedings of the 8th …, 2012 - dl.acm.org
Networks are getting larger and more complex; yet administrators rely on rudimentary tools
such as ping and traceroute to debug problems. We propose an automated and systematic …

{NetBouncer}: Active device and link failure localization in data center networks

C Tan, Z Jin, C Guo, T Zhang, H Wu, K Deng… - … USENIX Symposium on …, 2019 - usenix.org
The availability of data center services is jeopardized by various network incidents. One of
the biggest challenges for network incident handling is to accurately localize the failures …

California fault lines: understanding the causes and impact of network failures

D Turner, K Levchenko, AC Snoeren… - Proceedings of the ACM …, 2010 - dl.acm.org
Of the major factors affecting end-to-end service availability, network component failure is
perhaps the least well understood. How often do failures occur, how long do they last, what …

007: Democratically finding the cause of packet drops

B Arzani, S Ciraci, L Chamon, Y Zhu, HH Liu… - … USENIX Symposium on …, 2018 - usenix.org
Network failures continue to plague datacenter operators as their symptoms may not have
direct correlation with where or why they occur. We introduce 007, a lightweight, always-on …

SketchINT: Empowering INT with TowerSketch for per-flow per-switch measurement

K Yang, S Long, Q Shi, Y Li, Z Liu, Y Wu… - … on Parallel and …, 2023 - ieeexplore.ieee.org
Network measurement is indispensable to network operations. INT solutions that can
provide fine-grained per-switch per-packet information serve as promising solutions for per …

Lightnestle: quick and accurate neural sequential tensor completion via meta learning

Y Li, W Liang, K Xie, D Zhang, S Xie… - IEEE INFOCOM 2023 …, 2023 - ieeexplore.ieee.org
Network operation and maintenance rely heavily on network traffic monitoring. Due to the
measurement overhead reduction, lack of measurement infrastructure, and unexpected …

On the origin of scanning: The impact of location on internet-wide scans

G Wan, L Izhikevich, D Adrian, K Yoshioka… - Proceedings of the …, 2020 - dl.acm.org
Fast IPv4 scanning has enabled researchers to answer a wealth of security and networking
questions. Yet, despite widespread use, there has been little validation of the methodology's …

Trinocular: Understanding internet reliability through adaptive probing

L Quan, J Heidemann, Y Pradkin - ACM SIGCOMM Computer …, 2013 - dl.acm.org
Natural and human factors cause Internet outages---from big events like Hurricane Sandy in
2012 and the Egyptian Internet shutdown in Jan. 2011 to small outages every day that go …