The missing piece: a distributed system-level diagnosis model for the implementation of unreliable failure detectors

EP Duarte Jr, LA Rodrigues, ET Camargo, RC Turchetti - Computing, 2023 - Springer
… of failure detectors require the periodic transmissionfailure detection service based on the
distributed diagnosis. The user is a distributed application that accesses the failure detection

A probability-based fault tolerance strategy for service-based systems

F Wang, T Hong, D Wang… - 2021 3rd International …, 2021 - ieeexplore.ieee.org
… the evaluation and identification of critical system components in … The failure to consider
services' quality dynamics will lead to … probability distribution model into a Gaussian distribution. …

Towards failure correlation for improved cloud application service resilience

DR Mathews, M Verma, P Aggarwal… - Proceedings of the 14th …, 2021 - dl.acm.org
… The packets transmitted on the failed link will be lost until the … include strategies for fault
diagnosis and fault recovery. Fault … and failure prediction for a cloud application service based

Service-oriented reliability modeling and autonomous optimization of reliability for public cloud computing systems

S Meng, L Luo, X Qiu, Y Dai - IEEE Transactions on Reliability, 2022 - ieeexplore.ieee.org
failures through anomaly detection of the running state of … link failure and task data
transmission time in a cloud service … environment to construct a service-based reliability model. …

Analysis of a fault-tolerant framework for reliability prediction of service-oriented architecture systems

MC Chiang, CY Huang, CY Wu… - IEEE Transactions on …, 2020 - ieeexplore.ieee.org
system is an elastic structure that utilizes services discovery … It enables distributed computing
and service integration to … whether the service failure will be transmitted to the next service. …

A highly reliable metadata service for large-scale distributed file systems

J Zhou, Y Chen, W Wang, S He… - … and Distributed Systems, 2019 - ieeexplore.ieee.org
… metadata transmission. To keep metadata consistent, several … active server has detected
failures, it stops providing service and no … of multiple metadata service based on HDFS, we first …

Engineering antifragile self-adaptive systems in service-based architecture

M Giovagnola - 2023 - politesi.polimi.it
… can cause infectious outbreaks, harmful to the vulnerable system, … that does not just stand
up to failure but grows through failuresystem requirements and, if discrepancies are detected, …

A survey of graph-based deep learning for anomaly detection in distributed systems

AD Pazho, GA Noghre, AA Purkayastha… - … on Knowledge and …, 2023 - ieeexplore.ieee.org
… For privacy purposes, a supervised amount of information is transmitted from each location
… Such systems are prone to failures, require longer access times, and are vulnerable to …

A survey on observability of distributed edge & container-based microservices

M Usman, S Ferlin, A Brunstrom, J Taheri - IEEE Access, 2022 - ieeexplore.ieee.org
… , resulting in hard to detect and troubleshoot outages on … are looking for a specific collection
of failures. However, in a … of each cloud to boost network transmission and data processing at …

A multi-agent approach to monitor and manage container-based distributed systems

V Pfeifer, WF Passini, WF Dorante… - IEEE Latin America …, 2021 - ieeexplore.ieee.org
… sistemas SBS (do inglês, ServiceBased Systems) em um contexto … Detection of transmissible
service failure in distributed service-based systems,” Journal of Parallel and Distributed