作者
Mingjie Li, Zeyan Li, Kanglin Yin, Xiaohui Nie, Wenchi Zhang, Kaixin Sui, Dan Pei
发表日期
2022/8/14
图书
Proceedings of the 28th ACM SIGKDD Conference on Knowledge Discovery and Data Mining
页码范围
3230-3240
简介
Fault diagnosis is critical in many domains, as faults may lead to safety threats or economic losses. In the field of online service systems, operators rely on enormous monitoring data to detect and mitigate failures. Quickly recognizing a small set of root cause indicators for the underlying fault can save much time for failure mitigation. In this paper, we formulate the root cause analysis problem as a new causal inference task namedintervention recognition. We proposed a novel unsupervised causal inference-based method namedCausal Inference-based Root Cause Analysis (CIRCA). The core idea is a sufficient condition for a monitoring variable to be a root cause indicator,i.e., the change of probability distribution conditioned on the parents in the Causal Bayesian Network (CBN). Towards the application in online service systems, CIRCA constructs a graph among monitoring metrics based on the knowledge of …
引用总数
学术搜索中的文章
M Li, Z Li, K Yin, X Nie, W Zhang, K Sui, D Pei - Proceedings of the 28th ACM SIGKDD Conference on …, 2022