Bug 172435
Summary: | multipath fails in case of service processor failure on emx box | ||
---|---|---|---|
Product: | Red Hat Enterprise Linux 4 | Reporter: | Thomas Krieger <thomas.krieger> |
Component: | device-mapper-multipath | Assignee: | Ben Marzinski <bmarzins> |
Status: | CLOSED INSUFFICIENT_DATA | QA Contact: | |
Severity: | high | Docs Contact: | |
Priority: | medium | ||
Version: | 4.0 | CC: | agk, bmarzins, christophe.varoqui, dmo, dwysocha, egoggin, lmb, mbroz, thomas.krieger, tranlan |
Target Milestone: | --- | Keywords: | Reopened |
Target Release: | --- | ||
Hardware: | i386 | ||
OS: | Linux | ||
Whiteboard: | |||
Fixed In Version: | Doc Type: | Bug Fix | |
Doc Text: | Story Points: | --- | |
Clone Of: | Environment: | ||
Last Closed: | 2007-07-24 10:39:44 UTC | Type: | --- |
Regression: | --- | Mount Type: | --- |
Documentation: | --- | CRM: | |
Verified Versions: | Category: | --- | |
oVirt Team: | --- | RHEL 7.3 requirements from Atomic Host: | |
Cloudforms Team: | --- | Target Upstream Version: | |
Embargoed: |
Description
Thomas Krieger
2005-11-04 13:04:23 UTC
Ed, any thoughts? This works fine for me copying entire 5GB block device with dd(1) using upstream code (2.6.14-rc4 & multipath-tools in git head) while re-assigning the block device's logical unit via my own utility. I've been testing this use case in order to test a fix to multipathd(8) which will reduce the number of events which will cause it to failback to the highest priority path group. This is needed to keep multipathd from failing back to the default group when a block device is reassigned to a different path group (e.g., CLARiiON trespass) by software external to the current multipathing software (SAN management software, another cluster node, or storage services software on the CLARiiON itself). Possibly they are not running with the queue_if_no_path attribute and the combination of trespass followed by multipathd induced failback is causing a small time period where all paths are down. I'll think about it some more. Is this still an issue? Hi, sorry for the delay in answering but I was on holidays for a few days. I think that's no longer an issue. The mentioned oracle racs are in production and we do not have this behavior again. In the meantime we had flar code updates on the emc boxes which result in a trasspass of the luns to the other service processor during update. There were no strange results. Kind regards Thomas |