Bug 889098
| Summary: | rgmanager did not recognized config updates from ccs | ||
|---|---|---|---|
| Product: | Red Hat Enterprise Linux 5 | Reporter: | Josef Zimek <pzimek> |
| Component: | rgmanager | Assignee: | Ryan McCabe <rmccabe> |
| Status: | CLOSED ERRATA | QA Contact: | Cluster QE <mspqa-list> |
| Severity: | urgent | Docs Contact: | |
| Priority: | urgent | ||
| Version: | 5.8 | CC: | ahoness, ccaulfie, chad, cluster-maint, djansa, edamato, fdinitto, jentrena, jruemker, klaus.steinberger, mjuricek, mkarg, rmccabe, syeghiay |
| Target Milestone: | rc | ||
| Target Release: | --- | ||
| Hardware: | All | ||
| OS: | Linux | ||
| Whiteboard: | |||
| Fixed In Version: | rgmanager-2.0.52-44.el5 | Doc Type: | Bug Fix |
| Doc Text: | Story Points: | --- | |
| Clone Of: | Environment: | ||
| Last Closed: | 2013-09-30 22:37:16 UTC | Type: | Bug |
| Regression: | --- | Mount Type: | --- |
| Documentation: | --- | CRM: | |
| Verified Versions: | Category: | --- | |
| oVirt Team: | --- | RHEL 7.3 requirements from Atomic Host: | |
| Cloudforms Team: | --- | Target Upstream Version: | |
| Embargoed: | |||
| Bug Depends On: | |||
| Bug Blocks: | 928849 | ||
|
Description
Josef Zimek
2012-12-20 08:37:28 UTC
can you please check that the ondisk version of cluster.conf on all nodes is at 187 and please collect all of /var/log/messages from all the nodes. The ccsd update appears to have succeeded, but cman is not using the configuration. We will need the full logs to try to understand why. Also a copy of cluster.conf at 186 and 187 might be useful. Also it's worth checking that cman has seen the update, it might just be rgmanager that is using the older version. Comparing the output of "cman_tool status" with the rgmanager dump will clear this up. I don´t have access to the ticket, can you please give me the information I asked for in comment #1 and also for Chrissie in comment #2 ? This looks a lot like https://bugzilla.redhat.com/show_bug.cgi?id=822104 but fencing is not in progress. rgmanager is simply stuck at config version 184 vs on disk (and cman/ccsd) 187. Interesting enough, rgmanager daemon did not produce one single line of log (despite configured to log_level="7") since: Aug 23 22:26:45 sapproclt01 clurgmgrd: [18949]: <notice> Getting status Aug 19 03:56:14 sapproclt02 clurgmgrd: [20392]: <notice> Getting status Aug 23 22:21:57 sapproclt03 clurgmgrd: [18512]: <notice> Getting status Aug 23 23:01:19 sapproclt04 clurgmgrd[18930]: <info> Starting changed resources. [huge no log gap] Oct 22 22:00:16 sapproclt04 clurgmgrd[18930]: <notice> Reconfiguring ... Oct 22 22:31:21 sapproclt04 clurgmgrd: [18930]: <notice> Getting status [no more logs] It appears that log has stopped working at the same time of: sapproclt01 messages.9:Aug 23 23:01:03 sapproclt01 ccsd[15551]: Update of cluster.conf complete (version 184 -> 185). sapproclt02 ccsd has not logged 184 -> 185 update but based on rgmanager dump the config is live. sapproclt03 messages.9:Aug 23 23:01:03 sapproclt03 ccsd[15529]: Update of cluster.conf complete (version 184 -> 185). sapproclt04 messages.9:Aug 23 23:01:03 sapproclt04 ccsd[15664]: Update of cluster.conf complete (version 184 -> 185). Assuming that the cluster.conf stored in the sosreports are the same that have been pushed to production, the differences between 184 and 185 are only confined to few <fs/> services. diff -u sapproclt03-92388/etc/cluster/cluster.conf.230812 sapproclt04-380775/etc/cluster/cluster.conf.221012 (In reply to comment #4) > I don´t have access to the ticket, can you please give me the information I > asked for in comment #1 and also for Chrissie in comment #2 ? (In reply to comment #4) > I don´t have access to the ticket, can you please give me the information I > asked for in comment #1 and also for Chrissie in comment #2 ? On disk version of cluster.conf is updated in all nodes: $ cat sapproclt0*-*/etc/cluster/cluster.conf|grep config_version <cluster alias="cl_PepeJeans" config_version="187" name="cl_PepeJeans"> <cluster alias="cl_PepeJeans" config_version="187" name="cl_PepeJeans"> <cluster alias="cl_PepeJeans" config_version="187" name="cl_PepeJeans"> <cluster alias="cl_PepeJeans" config_version="187" name="cl_PepeJeans"> cman has latest version in all nodes: $ cat sapproclt0*-*/sos_commands/cluster/cman_tool_status | egrep "Node ID|Config Version" Config Version: 187 Node ID: 1 Config Version: 187 Node ID: 2 Config Version: 187 Node ID: 3 Config Version: 187 Node ID: 4 rgmanager doesn't: $ cat sapproclt0*_rgmanager-dump*|grep version Cluster configuration version 184 Cluster configuration version 184 Cluster configuration version 184 Cluster configuration version 187 *** Bug 822104 has been marked as a duplicate of this bug. *** Since the problem described in this bug report should be resolved in a recent advisory, it has been closed with a resolution of ERRATA. For information on the advisory, and where to find the updated files, follow the link below. If the solution does not work for you, open a new bug report. http://rhn.redhat.com/errata/RHBA-2013-1316.html |