Bug 86227
Description
Göran Uddeborg
2003-03-17 17:48:36 UTC
Created attachment 90628 [details]
Configuration for kernel which fails to write.
Created attachment 90629 [details]
Configuration for kernel where I can't trigger this problem
Created attachment 90666 [details]
Contents of /proc/scsi/sum53c8xx/0 with version 1 driver (stable case)
Created attachment 90667 [details]
SCSI-related dmesg messages with version 1 driver (stable case)
Created attachment 90680 [details]
Contents of /proc/scsi/sum53c8xx/0 with version 2 driver (unstable case)
Created attachment 90681 [details]
SCSI-related dmesg messages with version 2 driver (unstable case)
Created attachment 90736 [details]
Messages from driver in log
After putting the logs on a different partition as Alan suggested, I've got a
crash now where a lot of info was written to the messages file.
The complete messages are in the attachment. It comes in a number of phases,
briefly shown below. To me it seems like the the driver is trying harder and
harder to reset things, and then gives up, consequently causing problems
problems for the file system using the disk.
But don't know how to figure out why this happens only to the version 2 driver.
Phase 1 consists of some initial messages
Mar 25 17:51:16 uebn kernel: sym0:0:0: ABORT operation started.
Mar 25 17:51:16 uebn kernel: sym0:0:control msgout: 80 20 63 d.
Mar 25 17:51:16 uebn kernel: sym0:0:0: ABORT operation complete.
Mar 25 17:51:16 uebn kernel: sym0:0:0: ABORT operation started.
Mar 25 17:51:16 uebn kernel: sym0:0:0: ABORT operation failed.
The last two are then repeated a lot of times. Next phase does this
once:
Mar 25 17:51:17 uebn kernel: sym0:0:0: DEVICE RESET operation started.
Mar 25 17:51:17 uebn kernel: sym0:0:0: DEVICE RESET operation failed.
Then a lot of times this:
Mar 25 17:51:17 uebn kernel: sym0:0:0: BUS RESET operation started.
Mar 25 17:51:17 uebn kernel: sym0:0:0: BUS RESET operation failed.
Then, again a lot of times:
Mar 25 17:52:36 uebn kernel: sym0:0:0: HOST RESET operation started.
Mar 25 17:52:36 uebn kernel: sym0:0:0: HOST RESET operation failed.
Then this once:
Mar 25 17:55:16 uebn kernel: scsi: device set offline - command error
recover failed: host 0 channel 0 id 0 lun 0
Mar 25 17:55:16 uebn kernel: SCSI disk error : host 0 channel 0 id 0 lun 0
return code = 6000028
This a lot of times, for different sectors:
Mar 25 17:55:16 uebn kernel: I/O error: dev 08:02, sector 1458226
Then there is this a couple of times. The return code varies between
these two, the sector varies:
Mar 25 17:55:17 uebn kernel: SCSI disk error : host 0 channel 0 id 0 lun 0
return code = 6000028
Mar 25 17:55:17 uebn kernel: I/O error: dev 08:02, sector 2
Mar 25 17:55:17 uebn kernel: SCSI disk error : host 0 channel 0 id 0 lun 0
return code = 6050000
Mar 25 17:55:17 uebn kernel: I/O error: dev 08:02, sector 4853152
Mar 25 17:55:17 uebn kernel: I/O error: dev 08:02, sector 4853154
Final phase also gives file system error messages. Repeats for
various sectors until I reboot:
Mar 25 17:55:18 uebn kernel: I/O error: dev 08:02, sector 1458232
Mar 25 17:55:18 uebn kernel: I/O error: dev 08:02, sector 2
Mar 25 17:55:18 uebn kernel: EXT2-fs error (device sd(8,2)):
ext2_write_inode: unable to read inode block - inode=182342, block=729116
Thanks for the bug report. However, Red Hat no longer maintains this version of the product. Please upgrade to the latest version and open a new bug if the problem persists. The Fedora Legacy project (http://fedoralegacy.org/) maintains some older releases, and if you believe this bug is interesting to them, please report the problem in the bug tracker at: http://bugzilla.fedora.us/ |