Bug 173633
Summary: | cman/sm nodeid lookup fails | ||||||
---|---|---|---|---|---|---|---|
Product: | [Retired] Red Hat Cluster Suite | Reporter: | Scott Cannata <scott.cannata> | ||||
Component: | cman | Assignee: | David Teigland <teigland> | ||||
Status: | CLOSED WORKSFORME | QA Contact: | Cluster QE <mspqa-list> | ||||
Severity: | medium | Docs Contact: | |||||
Priority: | medium | ||||||
Version: | 4 | CC: | ccaulfie, cluster-maint | ||||
Target Milestone: | --- | ||||||
Target Release: | --- | ||||||
Hardware: | x86_64 | ||||||
OS: | Linux | ||||||
Whiteboard: | |||||||
Fixed In Version: | Doc Type: | Bug Fix | |||||
Doc Text: | Story Points: | --- | |||||
Clone Of: | Environment: | ||||||
Last Closed: | 2006-05-04 16:52:31 UTC | Type: | --- | ||||
Regression: | --- | Mount Type: | --- | ||||
Documentation: | --- | CRM: | |||||
Verified Versions: | Category: | --- | |||||
oVirt Team: | --- | RHEL 7.3 requirements from Atomic Host: | |||||
Cloudforms Team: | --- | Target Upstream Version: | |||||
Embargoed: | |||||||
Attachments: |
|
Description
Scott Cannata
2005-11-18 19:14:07 UTC
Created attachment 121241 [details]
ascii file output from kdb, see above description
Could you verify that cluster.conf was the same on all nodes?, and a copy of that may be helpful to see, along with 'cman_tool nodes' from one of the other nodes if they are still running. This assertion failure may indicate some sort of internal consistency problem within cman: the sm portion is looking up a nodeid that the cnxman portion doesn't know about, which shouldn't be possible. If the assignment of nodeid's to nodes was changing while the cluster was running (different versions of cluster.conf on the nodes possibly), that might lead to this kind of error. If untypically large nodeids are being used, that may point toward the cnxman code that dynamically increases the standard node arrays. This assertion failure was reported once before to me in an email (in May by Dan Phung) and he had been updating cluster.conf on some nodes. I'm adding a printk to the code to provide a little more information in the assertion message if this happens again. waiting for someone to see this again and report with the additional info from the panic |