Bug 257241
| Summary: | failing sync'ed primary leg along with node doing I/O can result in corrupt mirror | ||
|---|---|---|---|
| Product: | [Retired] Red Hat Cluster Suite | Reporter: | Corey Marthaler <cmarthal> |
| Component: | cmirror-kernel | Assignee: | Jonathan Earl Brassow <jbrassow> |
| Status: | CLOSED DUPLICATE | QA Contact: | Cluster QE <mspqa-list> |
| Severity: | low | Docs Contact: | |
| Priority: | medium | ||
| Version: | 4 | ||
| Target Milestone: | --- | ||
| Target Release: | --- | ||
| Hardware: | All | ||
| OS: | All | ||
| Whiteboard: | |||
| Fixed In Version: | Doc Type: | Bug Fix | |
| Doc Text: | Story Points: | --- | |
| Clone Of: | Environment: | ||
| Last Closed: | 2008-03-26 17:51:18 UTC | Type: | --- |
| Regression: | --- | Mount Type: | --- |
| Documentation: | --- | CRM: | |
| Verified Versions: | Category: | --- | |
| oVirt Team: | --- | RHEL 7.3 requirements from Atomic Host: | |
| Cloudforms Team: | --- | Target Upstream Version: | |
| Embargoed: | |||
|
Description
Corey Marthaler
2007-08-27 18:10:42 UTC
I think the problem here is that when you kill a node, there _are_ regions that are out-of-sync. Thus, if you kill the primary leg, mirror will complain. I'm not sure what to do about this yet. The same problem should exist on single machine mirrors if you do the following: 0) do I/O to a mirror 1) kill machine 2) kill primary leg of mirror before mirror has a chance to resync after reboot If there is no clean solution, our only option may be to shutdown the mirror and forcibly convert it to linear. Jon, Isn't this possibly a similar case to BA's setup? Where one leg of a mirror along with one of the nodes in a cluster is attached to one power supply and another leg and node could be attached to another. Then you could potentially lose a primary leg and node at the same time due to one switch failing. Or does this bug require a pause between killing the leg and killing a machine? It's a little different in that case because of the way failover happens in HA LVM. In the case of HA LVM, if the machine dies and everything fails over, then during the window of time where the mirror is resyncing, the disk fails... then you might be able to hit something like this. IOW, the primary device needs to die when the mirror is active, but not in-sync. This request was evaluated by Red Hat Product Management for inclusion in a Red Hat Enterprise Linux maintenance release. Product Management has requested further review of this request by Red Hat Engineering, for potential inclusion in a Red Hat Enterprise Linux Update release for currently deployed products. This request is not yet committed for inclusion in an Update release. |