Red Hat Linux Beta - beta3 Feb 7, 2001 Passing HW: ML370 (no ROC) Failing HW: DL360 (no ROC), ML570 (no ROC) SCSI Timeout Errors on NCR53c1510D / sym53c1510D After successful installation and after rebooting, the system displays the following repeating error: SCSI host 0 abort PID90) times out-resetting SCSI bus is being reset for host 0 channel 0 sym53c8xx_reset PID=0, reset flags=2 Tried 3 separate ML570s and one DL360. ML370 has the same chipset and installs OK. Duplicatable: Yes. Compaq Bug #: 194115 Compaq consideres this defect a MUST-FIX for Florence.
This defect is considered MUST-FIX for Florence Gold release
Does this hardware work with the 2.2 kernel? Does the ncr53c8xx driver work with this hardware?
Does the boot argument "noapic" affect this?
Also, have you tried changing the OS type in the bios? Answers to these questions would make it a lot easier for us to try to track these down...
If we do not get feedback on this report, we won't have much else to do with it other than close it...
The following SCSI issues should prove to be dependent upon the APIC fix found in Linux Kernel 2.4.2-patch5. These SCSI issues are being tested at this time, now that the latest beta seems to have the needed patch. 30170 - Full-Table-Mapped APIC mode causes a problem with installing Kernel Linux version 2.4.1-0.1.9smp or later. 26632 - Cannot install to a Compaq array controller - hang 26633 - SCSI Timeout Errors on NCR53c1510D / sym53c1510D 26634 - [aic7xxx] Kernel Bug, 64bit slot, adaptec ctrlr, DL360
qa0309 does not have the fix needed. Drivers still error out (timeouts, hangs, ...)
This information was already forwarded to RedHat. Just wanted to update Bugzilla. I think weve found the problem thats breaking SMP/APIC support on ServerWorks servers. It appears that when the 2.4.0 kernel was released (on Jan. 4), that an old version of drivers/ide/osb4.c crept back in. Whenever we build a kernel with this suspect version enabled, it breaks SMP APIC interrupts. If you do a diff of osb4.c from 2.4.0-prerelease and 2.4.0, youll find a ton of changes. Our guess is that its one of these changes (probably chipset initialization) thats breaking APIC support with SMP kernels. Here is some more information from our developers. The below fix works! I see 2 basic ways for this bug to be fixed: 1) Make sure that the CONFIG_BLK_DEV_OSB4 line in the kernel .config file is NOT set, or 2) Replace the osb4.c file with the osb4.c file from the 2.4.0-prerelease source tree. (this is probably a better solution)
CONFIG_BLK_DEV_OSB4 is off in all our config files
Waiting for Compaq to officially confirm fix.
Compaq confirms that the 2.4.2-0.1.35smp kernel (with CONFIG_BLK_DEV_OSB4 turned OFF) resolves the SCSI Timeout and CPQARRAY lockups during post- installation boot. HOWEVER, Compaq would also like to see the CONFIG_BLK_DEV_OSB4 option marked as EXPERIMENTAL in drivers/ide/Config.in in all Florence kernel packages, as well as in the "ac" series kernels. This will keep customers from easily stumbling across this problem.
This driver is now marked as "Dangerous" in our kernelsource. Alan has agreed to this concept and the patch will be mailed to him in a few minutes.