Testing configuration changes in context to prevent production failures

X Sun, R Cheng, J Chen, E Ang, O Legunsen… - … USENIX Symposium on …, 2020 - usenix.org
Large-scale cloud services deploy hundreds of configuration changes to production systems
daily. At such velocity, configuration changes have inevitably become prevalent causes of …

Configuration validation with large language models

X Lian, Y Chen, R Cheng, J Huang, P Thakkar… - arXiv preprint arXiv …, 2023 - arxiv.org
Misconfigurations are the major causes of software failures. Existing configuration validation
techniques rely on manually written rules or test cases, which are expensive to implement …

Acto: Automatic end-to-end testing for operation correctness of cloud system management

JT Gu, X Sun, W Zhang, Y Jiang, C Wang… - Proceedings of the 29th …, 2023 - dl.acm.org
Cloud systems are increasingly being managed by operation programs termed operators,
which automate tedious, human-based operations. Operators of modern management …

Static detection of silent misconfigurations with deep interaction analysis

J Zhang, R Piskac, E Zhai, T Xu - Proceedings of the ACM on …, 2021 - dl.acm.org
The behavior of large systems is guided by their configurations: users set parameters in the
configuration file to dictate which corresponding part of the system code is executed …

An evolutionary study of configuration design and implementation in cloud systems

Y Zhang, H He, O Legunsen, S Li… - 2021 IEEE/ACM 43rd …, 2021 - ieeexplore.ieee.org
Many techniques were proposed for detecting software misconfigurations in cloud systems
and for diagnosing unintended behavior caused by such misconfigurations. Detection and …

Test-case prioritization for configuration testing

R Cheng, L Zhang, D Marinov, T Xu - Proceedings of the 30th ACM …, 2021 - dl.acm.org
Configuration changes are among the dominant causes of failures of large-scale software
system deployment. Given the velocity of configuration changes, typically at the scale of …

What constitutes the deployment and runtime configuration system? An empirical study on openstack projects

N Bessghaier, M Sayagh, A Ouni… - ACM Transactions on …, 2023 - dl.acm.org
Modern software systems are designed to be deployed in different configured environments
(eg, permissions, virtual resources, network connections) and adapted at runtime to different …

Fail through the cracks: Cross-system interaction failures in modern cloud systems

L Tang, C Bhandari, Y Zhang, A Karanika, S Ji… - Proceedings of the …, 2023 - dl.acm.org
Modern cloud systems are orchestrations of independent and interacting (sub-) systems,
each specializing in important services (eg, data processing, storage, resource …

Repeated builds during code review: An empirical study of the OpenStack community

R Maipradit, D Wang, P Thongtanunam… - 2023 38th IEEE/ACM …, 2023 - ieeexplore.ieee.org
Code review is a popular practice where developers critique each others' changes. Since
automated builds can identify low-level issues (eg, syntactic errors, regression bugs), it is not …

ECFuzz: Effective Configuration Fuzzing for Large-Scale Systems

J Li, S Li, K Li, F Luo, H Yu, S Li, X Li - Proceedings of the 46th IEEE …, 2024 - dl.acm.org
A large-scale system contains a huge configuration space because of its large number of
configuration parameters. This leads to a combination explosion among configuration …