Bug 108613
Summary: | raid5 corruption whenever drive is lost | ||
---|---|---|---|
Product: | [Retired] Red Hat Linux | Reporter: | Hrunting Johnson <hrunting> |
Component: | kernel | Assignee: | Arjan van de Ven <arjanv> |
Status: | CLOSED DUPLICATE | QA Contact: | Brian Brock <bbrock> |
Severity: | high | Docs Contact: | |
Priority: | medium | ||
Version: | 9 | CC: | riel |
Target Milestone: | --- | ||
Target Release: | --- | ||
Hardware: | i686 | ||
OS: | Linux | ||
Whiteboard: | |||
Fixed In Version: | Doc Type: | Bug Fix | |
Doc Text: | Story Points: | --- | |
Clone Of: | Environment: | ||
Last Closed: | 2006-02-21 18:59:34 UTC | Type: | --- |
Regression: | --- | Mount Type: | --- |
Documentation: | --- | CRM: | |
Verified Versions: | Category: | --- | |
oVirt Team: | --- | RHEL 7.3 requirements from Atomic Host: | |
Cloudforms Team: | --- | Target Upstream Version: | |
Embargoed: |
Description
Hrunting Johnson
2003-10-30 18:09:23 UTC
I now have systems, again with 3ware cards (7810s, JBOD mode, PATA cards), that produce this corruption, this time with Seagate drives. This is definitely a problem with the raid5 code. I have a system with 16 drives. I partition each drive into 4 partitions, then create 4 RAID5 arrays with 1 partition from each drive. I then put an array under intensive I/O (both read and write) and then do 'raidsetfaulyt <array> <partition>'. Immediately I see filesystem corruption on the filesystem on that array. Here are a few of the messages: Nov 4 08:17:29 r25 kernel: attempt to access beyond end of device Nov 4 08:17:29 r25 kernel: 09:04: rw=0, want=1949435972, limit=1751284800 Nov 4 08:17:29 r25 kernel: attempt to access beyond end of device Nov 4 08:17:29 r25 kernel: 09:04: rw=0, want=1773623376, limit=1751284800 Nov 4 08:17:29 r25 kernel: attempt to access beyond end of device Nov 4 08:17:29 r25 kernel: 09:04: rw=0, want=1847329996, limit=1751284800 Nov 4 08:17:29 r25 kernel: attempt to access beyond end of device Nov 4 08:17:29 r25 kernel: 09:04: rw=0, want=2080538872, limit=1751284800 Nov 4 08:17:29 r25 kernel: attempt to access beyond end of device Nov 4 08:17:29 r25 kernel: 09:04: rw=0, want=1963576516, limit=1751284800 Nov 4 08:17:29 r25 kernel: attempt to access beyond end of device Nov 4 08:17:29 r25 kernel: 09:04: rw=0, want=1813626640, limit=1751284800 Nov 4 08:17:29 r25 kernel: attempt to access beyond end of device Nov 4 08:17:29 r25 kernel: 09:04: rw=0, want=1945409144, limit=1751284800 Nov 4 08:17:29 r25 kernel: attempt to access beyond end of device Nov 4 08:17:29 r25 kernel: 09:04: rw=0, want=1781930152, limit=1751284800 Nov 4 08:17:29 r25 kernel: attempt to access beyond end of device Nov 4 08:17:29 r25 kernel: 09:04: rw=0, want=1795790536, limit=1751284800 Nov 4 08:17:29 r25 kernel: attempt to access beyond end of device Nov 4 08:17:29 r25 kernel: 09:04: rw=0, want=1872381408, limit=1751284800 Nov 4 08:17:29 r25 kernel: attempt to access beyond end of device Nov 4 08:17:29 r25 kernel: 09:04: rw=0, want=2020518828, limit=1751284800 Nov 4 08:17:29 r25 kernel: attempt to access beyond end of device Nov 4 08:17:29 r25 kernel: 09:04: rw=0, want=1796667100, limit=1751284800 Nov 4 08:17:29 r25 kernel: attempt to access beyond end of device Nov 4 08:17:29 r25 kernel: 09:04: rw=0, want=2006427588, limit=1751284800 Nov 4 08:17:29 r25 kernel: attempt to access beyond end of device Nov 4 08:17:29 r25 kernel: 09:04: rw=0, want=1949435972, limit=1751284800 Nov 4 08:17:29 r25 kernel: attempt to access beyond end of device Nov 4 08:17:29 r25 kernel: 09:04: rw=0, want=1949435972, limit=1751284800 Also, with the direct raidsetfaulty method, this is ALWAYS reproducible. The bug report in 109251 gives a much more detailed explanation of why I believe this happening. *** This bug has been marked as a duplicate of 109251 *** Changed to 'CLOSED' state since 'RESOLVED' has been deprecated. |