[PDF][PDF] Mining invariants from console logs for system problem detection

JG Lou, Q Fu, S Yang, Y Xu, J Li - 2010 USENIX Annual Technical …, 2010 - usenix.org
Detecting execution anomalies is very important to the maintenance and monitoring of large-
scale distributed systems. People often use console logs that are produced by distributed …

Performance issue monitoring, identification and diagnosis of SaaS software: a survey

R Wang, X Tian, S Ying - Frontiers of Computer Science, 2025 - Springer
Abstract SaaS (Software-as-a-Service) is a service model provided by cloud computing. It
has a high requirement for QoS (Quality of Software) due to its method of providing software …

FD4C: Automatic fault diagnosis framework for Web applications in cloud computing

T Wang, W Zhang, C Ye, J Wei… - IEEE Transactions on …, 2015 - ieeexplore.ieee.org
The large-scale dynamic cloud computing environment has raised great challenges for fault
diagnosis in Web applications: First, fluctuating workloads cause traditional application …

Fault detection for cloud computing systems with correlation analysis

T Wang, W Zhang, J Wei… - 2015 IFIP/IEEE …, 2015 - ieeexplore.ieee.org
The large-scale dynamic cloud computing environment has raised great challenges for fault
diagnosis in Web applications. First, fluctuating workloads cause traditional application …

Workload-aware anomaly detection for web applications

T Wang, J Wei, W Zhang, H Zhong, T Huang - Journal of Systems and …, 2014 - Elsevier
The failure of Web applications often affects a large population of customers, and leads to
severe economic loss. Anomaly detection is essential for improving the reliability of Web …

[PDF][PDF] Survey on complex event processing and predictive analytics

LJ Fülöp, G Tóth, R Rácz, J Pánczél, T Gergely… - Proceedings of the Fifth …, 2010 - inf.szte.hu
Observing failures and other–desired or undesired–behavior patterns in large scale
software systems of specific domains (telecommunication systems, information systems …

Automated modeling and tracking of transaction flow dynamics for fault detection in complex systems

G Jiang, H Chen, C Ungureanu, K Yoshihira - US Patent 7,590,513, 2009 - Google Patents
(57) ABSTRACT A method and system that automatically derives models between
monitored quantities under non-faulty conditions so that subsequent faults can be detected …

Exploiting local and global invariants for the management of large scale information systems

H Chen, H Cheng, G Jiang… - 2008 Eighth IEEE …, 2008 - ieeexplore.ieee.org
This paper presents a data oriented approach to modeling the complex computing systems,
in which an ensemble of correlation models are discovered to represent the system status. If …

Failure detection in large-scale internet services by principal subspace mapping

H Chen, G Jiang, K Yoshihira - IEEE Transactions on …, 2007 - ieeexplore.ieee.org
Fast and accurate failure detection is becoming essential in managing large scale Internet
services. This paper proposes a novel detection approach based on the subspace mapping …

Workload-aware online anomaly detection in enterprise applications with local outlier factor

T Wang, W Zhang, J Wei… - 2012 IEEE 36th Annual …, 2012 - ieeexplore.ieee.org
Detecting anomalies are essential for improving the reliability of enterprise applications.
Current approaches set thresholds for metrics or model correlations between metrics, and …