Bug 169329

Summary: infinite retry with RAID1 with a single disk that has bad blocks
Product: [Retired] Fedora Legacy Reporter: David Ellsworth <ellswort>
Component: kernelAssignee: Fedora Legacy Bugs <bugs>
Status: CLOSED WONTFIX QA Contact: Brian Brock <bbrock>
Severity: medium Docs Contact:
Priority: medium    
Version: fc2CC: wtogami
Target Milestone: ---   
Target Release: ---   
Hardware: i386   
OS: Linux   
Whiteboard: DEFER
Fixed In Version: Doc Type: Bug Fix
Doc Text:
Story Points: ---
Clone Of: Environment:
Last Closed: 2006-08-13 14:41:44 UTC Type: ---
Regression: --- Mount Type: ---
Documentation: --- CRM:
Verified Versions: Category: ---
oVirt Team: --- RHEL 7.3 requirements from Atomic Host:
Cloudforms Team: --- Target Upstream Version:
Embargoed:
Attachments:
Description Flags
/var/log/messages output showing problem none

Description David Ellsworth 2005-09-27 03:56:38 UTC
From Bugzilla Helper:
User-Agent: Mozilla/5.0 (X11; U; Linux i686; en-US; rv:1.7.6) Gecko/20050328 Fedora/1.7.6-1.2.5

Description of problem:
I had a RAID 1 mirror when one disk failed.  When backing up the data on the remaining disk, I found that it had bad blocks for a few files -- i.e. it had errors when trying to read the files.  The RAID1 kernel driver repeatedly tried to retry the failed read until the system was rebooted.  No I/O errors were returned.  Mounting the partition without using the RAID software just tried to read the blocks once and then returned an I/O error.

Version-Release number of selected component (if applicable):
kernel-smp-2.6.10-1.771_FC2

How reproducible:
Always

Steps to Reproduce:
1. Get a disk with bad blocks.
2. Set it up as a RAID 1 disk with only one disk (i.e. one failed disk, one running one)
3. Somehow allocate a file that uses those bad blocks.
4. Read the file (cat it).
  

Additional info:

See /var/log/messages in the attachment.  The lines with
    raid1: hdc4: rescheduling sector 36551664
and
    raid1: hdc4: redirecting sector 36551664 to another mirror
point to a problem in the kernel in drivers/md/raid1.c where it doesn't
check to see if there is only one disk in the set before retrying a
failed I/O operation.  

Also, from comparing raid1.c from 2.6.10-1.771_FC2smp and kernel.org's 
2.6.13.2, it appears that the bug is also in the mainline kernel.

Comment 1 David Ellsworth 2005-09-27 03:59:24 UTC
Created attachment 119287 [details]
/var/log/messages output showing problem

Comment 2 Pekka Savola 2005-11-16 13:30:33 UTC
This doesn't seem to be important enough to fix just on its own, so mark it DEFER.

Comment 3 Jesse Keating 2006-08-13 14:41:44 UTC
Not a security issue.