Proactive error prediction to improve storage system reliability

F Mahdisoltani, I Stefanovici, B Schroeder - 2017 USENIX Annual …, 2017 - usenix.org
This paper proposes the use of machine learning techniques to make storage systems more
reliable in the face of sector errors. Sector errors are partial drive failures, where individual …

[PDF][PDF] Improving storage system reliability with proactive error prediction

F Mahdisoltani, I Stefanovici, B Schroeder - Proceedings of the 2017 …, 2017 - just.edu.jo
This paper proposes the use of machine learning techniques to make storage systems more
reliable in the face of sector errors. Sector errors are partial drive failures, where individual …

Understanding latent sector errors and how to protect against them

B Schroeder, S Damouras, P Gill - ACM Transactions on storage (TOS), 2010 - dl.acm.org
Latent sector errors (LSEs) refer to the situation where particular sectors on a drive become
inaccessible. LSEs are a critical factor in data reliability, since a single LSE can lead to data …

Enhancing data availability in disk drives through background activities

N Mi, A Riska, E Smirni, E Riedel - 2008 IEEE International …, 2008 - ieeexplore.ieee.org
Latent sector errors in disk drives affect only a few data sectors. They occur silently and are
detected only when the affected area is accessed again. If a latent error is detected while the …

Failure prediction models for proactive fault tolerance within storage systems

B Eckart, X Chen, X He, SL Scott - 2008 IEEE International …, 2008 - ieeexplore.ieee.org
The increasingly large demand for data storage has spurred on the development of systems
that rely on the aggregate performance of multiple hard drives. In many of these …

RAIDShield: characterizing, monitoring, and proactively protecting against disk failures

A Ma, R Traylor, F Douglis, M Chamness, G Lu… - ACM Transactions on …, 2015 - dl.acm.org
Modern storage systems orchestrate a group of disks to achieve their performance and
reliability goals. Even though such systems are designed to withstand the failure of …

[PDF][PDF] A Clean-Slate Look at Disk Scrubbing.

A Oprea, A Juels - FAST, 2010 - usenix.org
A number of techniques have been proposed to reduce the risk of data loss in hard-drives,
from redundant disks (eg, RAID systems) to error coding within individual drives. Disk …

Minority disk failure prediction based on transfer learning in large data centers of heterogeneous disk systems

J Zhang, K Zhou, P Huang, X He, M Xie… - … on Parallel and …, 2020 - ieeexplore.ieee.org
The storage system in large scale data centers is typically built upon thousands or even
millions of disks, where disk failures constantly happen. A disk failure could lead to serious …

Hard drive failure prediction using classification and regression trees

J Li, X Ji, Y Jia, B Zhu, G Wang, Z Li… - 2014 44th annual ieee …, 2014 - ieeexplore.ieee.org
Some statistical and machine learning methods have been proposed to build hard drive
prediction models based on the SMART attributes, and have achieved good prediction …

Dependability analysis of data storage systems in presence of soft errors

M Kishani, M Tahoori, H Asadi - IEEE Transactions on …, 2019 - ieeexplore.ieee.org
In recent years, high availability and reliability of data storage systems (DSS) have been
significantly threatened by soft errors occurring in storage controllers. Due to their specific …