Illuminating the Gray Zone: Non-intrusive Gray Failure Localization in Server Operating Systems

S Zhang, Y Zhao, X Xiong, Y Sun, X Nie… - … Proceedings of the …, 2024 - dl.acm.org
Timely localization of the root causes of gray failure is essential for maintaining the stability
of the server OS. The previous intrusive gray failure localization methods usually require …

A new model based on belief rule base and membership function (BRB-MF) for health state prediction in sensor

X Yin, G Shi, S Peng, B Zhang… - Advances in Mechanical …, 2022 - journals.sagepub.com
Health state prediction is an effective way to improve the reliability for sensors. In the
process of sensor degradation, it is difficult to obtain more effective monitoring data. And in …

[PDF][PDF] Root cause analysis for large-scale cloud-native applications

B Zurkowski - 2022 - doktoraty.iet.agh.edu.pl
This chapter introduces the research problem considered in this dissertation. Followed by
presenting the problem motivation, the dissertation thesis is formulated. Then, the research …

Anomaly Diagnosis Method and Condition Assessment of Power Metering Device Based on SSD Algorithm

C Jiang, J Wang, Y Wang, W Zhao - Scalable Computing: Practice and …, 2023 - scpe.org
The advancement of anomaly diagnosis methods plays a crucial role in classifying and
analyzing data, particularly in distinguishing between normal and abnormal patterns. This …

Advancing Root Cause Analysis in Cloud-native System with Knowledge Graph Path Embedding Translation

P Li, Q Du, S Zhao, P Fang - 2024 27th International …, 2024 - ieeexplore.ieee.org
Cloud computing technologies, including cloud-native and containerization, have gained
prominence in recent years, attributed to their exceptional scalability, enhanced resource …

Gwad: Greedy workflow graph anomaly detection framework for system traces

W Setiawan, Y Thounaojam… - 2020 IEEE International …, 2020 - ieeexplore.ieee.org
System traces are a collection of time-stamped messages recorded by the operating system
while the system is running. Analysis of these traces is crucial for tasks such as system fault …

MicroCBR: Case-Based Reasoning on Spatio-temporal Fault Knowledge Graph for Microservices Troubleshooting

F Liu, Y Wang, Z Li, R Ren, H Guan, X Yu… - … Conference on Case …, 2022 - Springer
With the growing market of cloud-native applications, microservices architectures are widely
used for rapid and automated deployments, scaling, and management. However, behind the …

AmazeMap: A Microservices Fault Localization Method Based on Multi-Level Impact Graph

李亚晓, 李青山, 王璐, 姜宇轩 - Journal of Software, 2024 - jos.org.cn
微服务软件系统由于其具有大量复杂的服务依赖关系和组件化模块, 一个服务发生故障往往造成
与之相关的一个或多个服务发生故障, 导致故障定位的难度不断提高. 因此 …

[图书][B] Self-Supervised Distributed Machine Learning for Robust Containerized Systems

Y Lin - 2023 - search.proquest.com
Containers are widely embraced in production computing environments due to their
efficiency and minimal isolation overhead. However, these applications are vulnerable to …

Automating inventory composition management for bulk purchasing cloud brokerage strategy

C Boonprasop - 2024 - research-repository.st-andrews.ac …
Cloud providers offer end-users various pricing schemes to allow them to tailor VMs to their
needs, eg, a pay-as-you-go billing scheme, called on-demand, and a discounted contract …