Root cause analysis of failures in microservices through causal discovery

A Ikram, S Chakraborty, S Mitra… - Advances in …, 2022 - proceedings.neurips.cc
Most cloud applications use a large number of smaller sub-components (called
microservices) that interact with each other in the form of a complex graph to provide the …

Microscope: Pinpoint performance issues with causal graphs in micro-service environments

JJ Lin, P Chen, Z Zheng - … , ICSOC 2018, Hangzhou, China, November 12 …, 2018 - Springer
Driven by the emerging business models (eg, digital sales) and IT technologies (eg, DevOps
and Cloud computing), the architecture of software is shifting from monolithic to microservice …

Self-adaptive root cause diagnosis for large-scale microservice architecture

M Ma, W Lin, D Pan, P Wang - IEEE Transactions on Services …, 2020 - ieeexplore.ieee.org
The emergence of microservice architecture in Cloud systems poses a new challenges for
the reliability operation and maintenance. Due to numerous services and diverse types of …

Performance diagnosis in cloud microservices using deep learning

L Wu, J Bogatinovski, S Nedelkoski, J Tordsson… - … Conference on Service …, 2020 - Springer
Microservice architectures are increasingly adopted to design large-scale applications.
However, the highly distributed nature and complex dependencies of microservices …

Ms-rank: Multi-metric and self-adaptive root cause diagnosis for microservice applications

M Ma, W Lin, D Pan, P Wang - 2019 IEEE International …, 2019 - ieeexplore.ieee.org
This paper presents a self-adaptive root cause diagnosis framework, named MS-Rank, to
analyze multiple metrics collected from micro-service architecture. MS-Rank decomposes …

Causeinfer: Automatic and distributed performance diagnosis with hierarchical causality graph in large distributed systems

P Chen, Y Qi, P Zheng, D Hou - IEEE INFOCOM 2014-IEEE …, 2014 - ieeexplore.ieee.org
Modern applications especially cloud-based or cloud-centric applications always have many
components running in the large distributed environment with complex interactions. They …

CauseInfer: Automated End-to-End Performance Diagnosis with Hierarchical Causality Graph in Cloud Environment

P Chen, Y Qi, D Hou - IEEE transactions on services computing, 2016 - ieeexplore.ieee.org
Modern computing systems especially cloud-based and cloud-centric systems always
consist of a mass of components running in large distributed environments with complicated …

[HTML][HTML] CausalRCA: causal inference based precise fine-grained root cause localization for microservice applications

R Xin, P Chen, Z Zhao - Journal of Systems and Software, 2023 - Elsevier
Effectively localizing root causes of performance anomalies is crucial to enabling the rapid
recovery and loss mitigation of microservice applications in the cloud. Depending on the …

Causal inference-based root cause analysis for online service systems with intervention recognition

M Li, Z Li, K Yin, X Nie, W Zhang, K Sui… - Proceedings of the 28th …, 2022 - dl.acm.org
Fault diagnosis is critical in many domains, as faults may lead to safety threats or economic
losses. In the field of online service systems, operators rely on enormous monitoring data to …

Microdiag: Fine-grained performance diagnosis for microservice systems

L Wu, J Tordsson, J Bogatinovski… - 2021 IEEE/ACM …, 2021 - ieeexplore.ieee.org
Microservice architecture has emerged as a popular pattern for developing large-scale
applications for its benefits of flexibility, scalability, and agility. However, the large number of …