Artificial Intelligence for IT operations (AIOps) aims to combine the power of AI with the big data generated by IT Operations processes, particularly in cloud infrastructures, to provide …
J Zhu, S He, J Liu, P He, Q Xie… - 2019 IEEE/ACM 41st …, 2019 - ieeexplore.ieee.org
Logs are imperative in the development and maintenance process of many software systems. They record detailed runtime information that allows developers and support …
Q Lin, H Zhang, JG Lou, Y Zhang, X Chen - Proceedings of the 38th …, 2016 - dl.acm.org
Logs play an important role in the maintenance of large-scale online service systems. When an online service fails, engineers need to examine recorded logs to gain insights into the …
Logs are often used for troubleshooting in large-scale software systems. For a cloud-based online system that provides 24/7 service, a huge number of logs could be generated every …
The management of cloud service incidents (unplanned interruptions or outages of a service/product) greatly affects customer satisfaction and business revenue. After years of …
Many software services today are hosted on cloud computing platforms, such as Amazon EC2, due to many benefits like reduced operational costs. However, node failures in these …
Logging has been a common practice for monitoring and diagnosing performance issues. However, logging comes at a cost, especially for large-scale online service systems. First …
AIOps (Artificial Intelligence for IT Operations) leverages machine learning models to help practitioners handle the massive data produced during the operations of large-scale …
Y Lyu, H Li, ZM Jiang, AE Hassan - ACM Transactions on Software …, 2024 - dl.acm.org
AIOps (Artificial Intelligence for IT Operations) solutions leverage the massive data produced during the operation of large-scale systems and machine learning models to assist software …