Bug 234600
Summary: | [Emulex 5.2 bug] Soft lockup detected when FC storage array is disconnected while IO running | ||
---|---|---|---|
Product: | Red Hat Enterprise Linux 5 | Reporter: | Bino J Sebastian <bino.sebastian> |
Component: | kernel | Assignee: | Mike Christie <mchristi> |
Status: | CLOSED NOTABUG | QA Contact: | Martin Jenner <mjenner> |
Severity: | high | Docs Contact: | |
Priority: | medium | ||
Version: | 5.0 | CC: | abdel.sadek, andriusb, coughlan, james.smart, jamie.wellnitz, jturner, khtan, laurie.barry, prarit, yanling.qi |
Target Milestone: | --- | Keywords: | OtherQA |
Target Release: | --- | ||
Hardware: | i386 | ||
OS: | Linux | ||
Whiteboard: | |||
Fixed In Version: | Doc Type: | Bug Fix | |
Doc Text: | Story Points: | --- | |
Clone Of: | Environment: | ||
Last Closed: | 2008-03-15 03:53:51 UTC | Type: | --- |
Regression: | --- | Mount Type: | --- |
Documentation: | --- | CRM: | |
Verified Versions: | Category: | --- | |
oVirt Team: | --- | RHEL 7.3 requirements from Atomic Host: | |
Cloudforms Team: | --- | Target Upstream Version: | |
Embargoed: | |||
Bug Depends On: | |||
Bug Blocks: | 217217, 425461 |
Description
Bino J Sebastian
2007-03-30 14:31:20 UTC
Added to the RHEL 5.1 prioritization list. This request was evaluated by Red Hat Product Management for inclusion in a Red Hat Enterprise Linux maintenance release. Product Management has requested further review of this request by Red Hat Engineering, for potential inclusion in a Red Hat Enterprise Linux Update release for currently deployed products. This request is not yet committed for inclusion in an Update release. From Bino via email: Andrius, The stack trace of bug 234600 is also present in the log file of the bug 240473. But there are other stacks also present in bug 240473 log file. This could be due to the multipathing driver running in bug 240473 test environment. There is a good probability that the root cause of these two issues are same. -bino Out of runway for 5.1 - deferring to 5.2. Bino - are you still experiencing this with 5.1 Beta? We've heard from other partners and customers that this may have gone away between 5.0 and 5.1. We are seeing a similar issueon PPC system with RHEL5.1 following is the stack trace from Power PC system. See also Bugzilla 234600. This bugzilla might be related to this issue. smp_call_function on cpu 1: other cpus not responding (2) 1:mon> t [c0000000c8ad3a10] c000000000070900 .on_each_cpu+0x24/0x88 [c0000000c8ad3ab0] c0000000000ee128 .invalidate_bh_lrus+0x28/0x40 [c0000000c8ad3b30] c0000000000f64b4 .kill_bdev+0x34/0x60 [c0000000c8ad3bb0] c0000000000f6e8c .__blkdev_put+0x88/0x220 [c0000000c8ad3c50] c0000000000eca1c .__fput+0x108/0x25c [c0000000c8ad3d00] c0000000000e8fa4 .filp_close+0xac/0xd4 [c0000000c8ad3d90] c0000000000eacf4 .sys_close+0xc4/0x110 [c0000000c8ad3e30] c0000000000086a4 syscall_exit+0x0/0x40 Bugzilla number in the above comment is wrong. Please also see Bugzilla 408541. There are enough differences so that these may not be related, but note that we're seeing a soft-lockup in 5.1 when trying to unmount a volume that's gone readonly after an error. I reported it in bug #429054 , see that for more info. In our situation, the device (/dev/md7) is a RAID1 made up of two volumes, each coming off a different SAN array. We're using QLogic cards, though, not Emulex. Emulex - have you tested this on the latest 5.2 Beta RC bits to see if this still persists? Apparently there has been a workaround discovered by Emulex by doing the following: "The default syslog.conf file for RHEL makes all the printk statements sync with /var/log/messages file. After I changed the syslog.conf file to not to sync the printk with /var/log/messages file, the issue was not re-produceable." If this continues to be a problem, please re-open this bugzilla. |