Bug 118326
Summary: | kernel hangs under nfs/apache access | ||
---|---|---|---|
Product: | Red Hat Enterprise Linux 3 | Reporter: | John Sopko <sopko> |
Component: | kernel | Assignee: | Ernie Petrides <petrides> |
Status: | CLOSED WORKSFORME | QA Contact: | Brian Brock <bbrock> |
Severity: | medium | Docs Contact: | |
Priority: | medium | ||
Version: | 3.0 | CC: | petrides, riel, sopko |
Target Milestone: | --- | ||
Target Release: | --- | ||
Hardware: | i686 | ||
OS: | Linux | ||
Whiteboard: | |||
Fixed In Version: | Doc Type: | Bug Fix | |
Doc Text: | Story Points: | --- | |
Clone Of: | Environment: | ||
Last Closed: | 2004-03-19 16:24:46 UTC | Type: | --- |
Regression: | --- | Mount Type: | --- |
Documentation: | --- | CRM: | |
Verified Versions: | Category: | --- | |
oVirt Team: | --- | RHEL 7.3 requirements from Atomic Host: | |
Cloudforms Team: | --- | Target Upstream Version: | |
Embargoed: |
Description
John Sopko
2004-03-15 16:38:55 UTC
Can this crash be reproduced in a kernel that is not tainted? The oops was caused inside adestroy_inode(), which isn't even part of the Red Hat Enterprise Linux kernel as released. As I mentioned we run Open AFS, this machine has the Open AFS client installed. Our web server root dir is in AFS. It would be difficult to simulate this without AFS. This is a production server. I will continue to run the single processor kernel since we can not run an un-tainted kernel. (Wish openafs came with the RedHat release). The kernel panic that occured on the Dell 6250 is a bit different then the way the IBM system was locking up. I just got my IBM system to lock up again, (no panic messages), you can still ping the system and I could connect to the ssh and httpd ports but they do not respond so the network interface is partially working. I got it to hang by accessing another NFS server from the IBM, I also have gotten it to hang by accessing the IBM as a NFS server using another system as a client. I run this to get it to hang From the IBM: cd /net/sunhost/large_data_dir find . -type f -exec cat {} \; > /dev/null the cpu load with uptime stays around 1 and the system is responsive, eventually it just stops responding. I know this will be difficult if not impossible to fix without more data. This system is also running a openafs/tainted kernel. Give me some time and I will test on an untainted kernel. Thanks for your help Thanks for the update, John. We will assume that the oops and the lock-ups that you have encountered are due to an AFS bug or an incompatibility between the AFS code and the RHEL 3 kernel. If you can reproduce a problem on an untainted RHEL 3 kernel, please update this Bugzilla entry. Thanks. -ernie Since this is a tainted kernel I am going to close this bug. FYI, I did find one issue that may help and our server has not crashed since. The openafs /etc/init.d/afs startup script was loading the wrong openafs module for the lates kernel: lsmod|grep afs libafs-2.4.21-9.EL-i686.mp 566192 2 and the system is running: uname -r 2.4.21-9.0.1.ELsmp I fixed this and am running the smp kernel and the system has been stable so far. |