Bug 430127
| Summary: | clustat has inconsistent view of cluster membership | ||
|---|---|---|---|
| Product: | Red Hat Enterprise Linux 5 | Reporter: | Nick Strugnell <nstrug> |
| Component: | cman | Assignee: | Lon Hohberger <lhh> |
| Status: | CLOSED NOTABUG | QA Contact: | |
| Severity: | low | Docs Contact: | |
| Priority: | low | ||
| Version: | 5.1 | CC: | cluster-maint |
| Target Milestone: | rc | ||
| Target Release: | --- | ||
| Hardware: | All | ||
| OS: | Linux | ||
| Whiteboard: | |||
| Fixed In Version: | Doc Type: | Bug Fix | |
| Doc Text: | Story Points: | --- | |
| Clone Of: | Environment: | ||
| Last Closed: | 2008-01-25 17:17:37 UTC | Type: | --- |
| Regression: | --- | Mount Type: | --- |
| Documentation: | --- | CRM: | |
| Verified Versions: | Category: | --- | |
| oVirt Team: | --- | RHEL 7.3 requirements from Atomic Host: | |
| Cloudforms Team: | --- | Target Upstream Version: | |
| Embargoed: | |||
|
Description
Nick Strugnell
2008-01-24 17:43:18 UTC
Did you try restarting rgmanager ? Yes, tried restarting rgmanager, and even cman - same problem. Only cleared by restarting cman on all nodes. Ok - this is caused by an inconsistency in what CMAN sees. Nicholas reproduced this, and we were able to figure out that * clustat output was inconsistent - two nodes thought another was offline; this time it was node 1 that was offline * cman_tool nodes were inconsistent * logs from CLM (openais) were consistent - all 3 nodes were in the most recent new CLM configuration * clustat and cman_tool were consistent with each other on a given node (which is expected) This also caused rgmanager to try to migrate a VM to the same node it was running on, which can't actually happen (and doesn't work, obviously). Currently, the theory is that because 'uname -n' didn't match any bound IP on the system that CMAN was kind of just 'picking' interfaces and it just happened to work sometimes. We're trying using 'cman_tool join -n' by editing /etc/init.d/cman. Channel bonding is in use, FWIW. Assigning with cman_tool join -n gave consistent results. |