Understanding latent sector errors and how to protect against them

B Schroeder, S Damouras, P Gill - ACM Transactions on storage (TOS), 2010 - dl.acm.org
Latent sector errors (LSEs) refer to the situation where particular sectors on a drive become
inaccessible. LSEs are a critical factor in data reliability, since a single LSE can lead to data …

[PDF][PDF] A Clean-Slate Look at Disk Scrubbing.

A Oprea, A Juels - FAST, 2010 - usenix.org
A number of techniques have been proposed to reduce the risk of data loss in hard-drives,
from redundant disks (eg, RAID systems) to error coding within individual drives. Disk …

An analysis of latent sector errors in disk drives

LN Bairavasundaram, GR Goodson… - Proceedings of the …, 2007 - dl.acm.org
The reliability measures in today's disk drive-based storage systems focus predominantly on
protecting against complete disk failures. Previous disk reliability studies have analyzed …

Undetected disk errors in RAID arrays

JL Hafner, V Deenadhayalan… - IBM Journal of …, 2008 - ieeexplore.ieee.org
Though remarkably reliable, disk drives do fail occasionally. Most failures can be detected
immediately; moreover, such failures can be modeled and addressed using technologies …

Proactive error prediction to improve storage system reliability

F Mahdisoltani, I Stefanovici, B Schroeder - 2017 USENIX Annual …, 2017 - usenix.org
This paper proposes the use of machine learning techniques to make storage systems more
reliable in the face of sector errors. Sector errors are partial drive failures, where individual …

Disk scrubbing versus intradisk redundancy for RAID storage systems

I Iliadis, R Haas, XY Hu, E Eleftheriou - ACM transactions on storage …, 2011 - dl.acm.org
Two schemes proposed to cope with unrecoverable or latent media errors and enhance the
reliability of RAID systems are examined. The first scheme is the established, widely used …

RAIDShield: characterizing, monitoring, and proactively protecting against disk failures

A Ma, R Traylor, F Douglis, M Chamness, G Lu… - ACM Transactions on …, 2015 - dl.acm.org
Modern storage systems orchestrate a group of disks to achieve their performance and
reliability goals. Even though such systems are designed to withstand the failure of …

Disk scrubbing versus intra-disk redundancy for high-reliability raid storage systems

I Iliadis, R Haas, XY Hu, E Eleftheriou - ACM SIGMETRICS Performance …, 2008 - dl.acm.org
Two schemes proposed to cope with unrecoverable or latent media errors and enhance the
reliability of RAID systems are examined. The first scheme is the established, widely used …

[PDF][PDF] SFS: random write considered harmful in solid state drives.

C Min, K Kim, H Cho, SW Lee, YI Eom - FAST, 2012 - usenix.org
Over the last decade we have witnessed the relentless technological improvement in flash-
based solidstate drives (SSDs) and they have many advantages over hard disk drives …

The tail at store: A revelation from millions of hours of disk and {SSD} deployments

M Hao, G Soundararajan… - … USENIX Conference on …, 2016 - usenix.org
We study storage performance in over 450,000 disks and 4,000 SSDs over 87 days for an
overall total of 857 million (disk) and 7 million (SSD) drive hours. We find that storage …