Red Hat Bugzilla – Bug 162212
st causes system hang and kernel panic when writing to tape on x86_64
Last modified: 2008-01-10 13:44:14 EST
From Bugzilla Helper:
User-Agent: Mozilla/5.0 (Windows; U; Windows NT 5.1; en-US; rv:1.7.8) Gecko/20050511 Firefox/1.0.4
Description of problem:
While testing Veritas NetBackup 5.1 MP3 on RedHat 3 x86_64 we ran across this issue. The system locks up when beginning to write to a second tape path. This forces a reboot and the system throws a kernel panic:
0f 0b b6 7c 2d 80 ff ff ff ff 2b 00 eb 5d 48 8b 4b 08 48 85
Kernel panic: Fatal exception
The system shows this panic everytime it is booted until the device is unattached from the host.
Version-Release number of selected component (if applicable):
Steps to Reproduce:
1. Install RedHat 3 x86 64_bit
2. Install NetBackup 5.1 MP3
3. Create a policy that will write to two devices at the same time
4. Fire off the policy and the system locks
5. Reboot to see kernel panic
6. Unplug the device or else the system will panic everytime it boots
Actual Results: System locked.
Expected Results: Should keep writing to tapes.
This issue was seen on two different machine types including a Sun Fire V20z (dual Opteron's) and Supermicro (dual EM64T's). The exact same machines were loaded with the 32-bit build of RH3 and worked without a problem.
Tried recreating this using tar and mt but the system does not lock. The 32-bit Linux build of NetBackup was used on both RH3 32-bit, which didn't show this issue, and RH3 x86_64 which did. This, and the system trace make us believe that it is not a NetBackup issue. Qlogic and Emulex have investigated the problem since it was their driver that originally called st but they have not found anything in their debug logs that indicate an issue with their driver.
Found a similar bug (but different) here:
Sebastien BLAISOT (email@example.com)
Comment #14 has the same panic trace that I had.
Engineering is waiting on the output from the latest IT post requesting the
The kernel rpms can be downloaded from
A new set of test kernel RPMs have been posted to the same place as before.
These include the fix for the sg+st write bug and one other tweak that might
help with this problem. Please test these out and let me know the results.
NEEDINFO_REPORTER does not seem to be the correct state for this, moving back to
A fix for this problem has just been committed to the RHEL3 U7
patch pool this evening (in kernel version 2.4.21-37.6.EL).
*** Bug 156396 has been marked as a duplicate of this bug. ***
An advisory has been issued which should help the problem
described in this bug report. This report is therefore being
closed with a resolution of ERRATA. For more information
on the solution and/or where to find the updated files,
please follow the link below. You may reopen this bug report
if the solution does not work for you.
I tested RC1 of Update 7 and it is causing the same panic. The hotfix provided
a few months back did resolve the issue and several customers are using it
Can you tell me the kernel version on the U7 RC1 system you're using? I want to
make sure it should have had the fix incorporated into it.
I obtained the Update from the FTP location here:
(In reply to comment #71)
josef, let's move this thread to bug 182996 because this current bug was for
RHEL 3 U7 and bug 182996 is for RHEL 3 U8. Let's consider this CLOSED and bug
182996 as a regression to the "fix".
Fixing bug's disposition (reverting to ERRATA).