Bug 1087286
| Summary: | The cman service could not start after updating the RHEL version from 6.0 to RHEL6.3 | ||
|---|---|---|---|
| Product: | Red Hat Enterprise Linux 6 | Reporter: | henry <xbai> |
| Component: | cluster | Assignee: | Christine Caulfield <ccaulfie> |
| Status: | CLOSED ERRATA | QA Contact: | cluster-qe <cluster-qe> |
| Severity: | medium | Docs Contact: | |
| Priority: | unspecified | ||
| Version: | 6.3 | CC: | ccaulfie, cluster-maint, jkortus, jpayne, jpokorny, rpeterso, teigland |
| Target Milestone: | rc | ||
| Target Release: | --- | ||
| Hardware: | x86_64 | ||
| OS: | Linux | ||
| Whiteboard: | |||
| Fixed In Version: | Doc Type: | Bug Fix | |
| Doc Text: |
Previously, errors generated while updating the resource-agents scheme were sometimes not reported. As a consequence, if an error occurred when updating the resource-agents schema, the update failed silently and later attempts to start the cman service could fail as well. With this update, schema errors are reported, and remedial action can be taken at upgrade time in case of problems.
|
Story Points: | --- |
| Clone Of: | Environment: | ||
| Last Closed: | 2015-07-22 07:04:28 UTC | Type: | Bug |
| Regression: | --- | Mount Type: | --- |
| Documentation: | --- | CRM: | |
| Verified Versions: | Category: | --- | |
| oVirt Team: | --- | RHEL 7.3 requirements from Atomic Host: | |
| Cloudforms Team: | --- | Target Upstream Version: | |
| Embargoed: | |||
| Bug Depends On: | |||
| Bug Blocks: | 1075802 | ||
|
Description
henry
2014-04-14 08:16:07 UTC
Is this reproducable at all or has it only happened to this one customer? (In reply to Christine Caulfield from comment #2) > Is this reproducable at all or has it only happened to this one customer? Hi Christine, Thanks for you reply. The issue occurs in several ha-cluster (4 or 6 two-nodes clusters) in this CU's productioin environment. Actually, I only got this specified customer did the cluster upgrade operation and hit the problem. Henry Bai GSS China Hi Engineering team, Is there any feedback about the issue ? We took a sbr-cluster meeting last week and talked about this issue, Ryan Mitchell in SEG team said the other customer also hit the same issue. So, I think it is not a special case. Would you pls help me double confirm if it is a bug ? Thanks, Henry Bai GSS China A little proposal to make such alleged situation better identifiable next time (there can be variants): https://www.redhat.com/archives/cluster-devel/2014-April/msg00089.html (there can be *better* variants) Patch landed upstream (STABLE32 branch). commit 8fd4192e154384a4e5a7f4b16dc5365118ac98d1
Author: Jan Pokorný <jpokorny>
Date: Tue Apr 29 23:24:30 2014 +0200
xml: ccs_update_schema: be verbose about extraction fail
Previously, the distillation of resource-agents' metadata could fail
from unexpected reasons without any evidence ever being made, unlike
in case of fence-agents. Also "no metadata" and "issue with their
extraction" will allegedly yield the same outcome, so it is reflected
in the comments being emitted to the schema for both sorts of agents.
Signed-off-by: Jan Pokorný <jpokorny>
Verified in cman-3.0.12.1-73.el6: [root@host-134 ~]# cat /etc/redhat-release Red Hat Enterprise Linux Server release 6.0 (Santiago) [root@host-134 ~]# rpm -q cman cman-3.0.12-23.el6_0.7.x86_64 [root@host-134 ~]# /etc/init.d/cman start Starting cluster: Checking Network Manager... [ OK ] Global setup... [ OK ] Loading kernel modules... [ OK ] Mounting configfs... [ OK ] Starting cman... [ OK ] Waiting for quorum... [ OK ] Starting fenced... [ OK ] Starting dlm_controld... [ OK ] Starting gfs_controld... [ OK ] Unfencing self... [ OK ] Joining fence domain... [ OK ] [root@host-135 ~]# cat /etc/redhat-release Red Hat Enterprise Linux Server release 6.0 (Santiago) [root@host-135 ~]# rpm -q cman cman-3.0.12-23.el6_0.7.x86_64 [root@host-135 ~]# /etc/init.d/cman start Starting cluster: Checking Network Manager... [ OK ] Global setup... [ OK ] Loading kernel modules... [ OK ] Mounting configfs... [ OK ] Starting cman... [ OK ] Waiting for quorum... [ OK ] Starting fenced... [ OK ] Starting dlm_controld... [ OK ] Starting gfs_controld... [ OK ] Unfencing self... [ OK ] Joining fence domain... [ OK ] [root@host-136 ~]# cat /etc/redhat-release Red Hat Enterprise Linux Server release 6.0 (Santiago) [root@host-136 ~]# rpm -q cman cman-3.0.12-23.el6_0.7.x86_64 [root@host-136 ~]# /etc/init.d/cman start Starting cluster: Checking Network Manager... [ OK ] Global setup... [ OK ] Loading kernel modules... [ OK ] Mounting configfs... [ OK ] Starting cman... [ OK ] Waiting for quorum... [ OK ] Starting fenced... [ OK ] Starting dlm_controld... [ OK ] Starting gfs_controld... [ OK ] Unfencing self... [ OK ] Joining fence domain... [ OK ] [root@host-133 ~]# for i in `seq 4 6`; do qarsh root@host-13$i rpm -q cman; done cman-3.0.12.1-73.el6.x86_64 cman-3.0.12.1-73.el6.x86_64 cman-3.0.12.1-73.el6.x86_64 [root@host-134 ~]# /etc/init.d/cman start Starting cluster: Checking if cluster has been disabled at boot... [ OK ] Checking Network Manager... [ OK ] Global setup... [ OK ] Loading kernel modules... [ OK ] Mounting configfs... [ OK ] Starting cman... [ OK ] Waiting for quorum... [ OK ] Starting fenced... [ OK ] Starting dlm_controld... [ OK ] Tuning DLM kernel config... [ OK ] Starting gfs_controld... [ OK ] Unfencing self... [ OK ] Joining fence domain... [ OK ] [root@host-135 ~]# /etc/init.d/cman start Starting cluster: Checking if cluster has been disabled at boot... [ OK ] Checking Network Manager... [ OK ] Global setup... [ OK ] Loading kernel modules... [ OK ] Mounting configfs... [ OK ] Starting cman... [ OK ] Waiting for quorum... [ OK ] Starting fenced... [ OK ] Starting dlm_controld... [ OK ] Tuning DLM kernel config... [ OK ] Starting gfs_controld... [ OK ] Unfencing self... [ OK ] Joining fence domain... [ OK ] [root@host-136 ~]# /etc/init.d/cman start Starting cluster: Checking if cluster has been disabled at boot... [ OK ] Checking Network Manager... [ OK ] Global setup... [ OK ] Loading kernel modules... [ OK ] Mounting configfs... [ OK ] Starting cman... [ OK ] Waiting for quorum... [ OK ] Starting fenced... [ OK ] Starting dlm_controld... [ OK ] Tuning DLM kernel config... [ OK ] Starting gfs_controld... [ OK ] Unfencing self... [ OK ] Joining fence domain... [ OK ] Since the problem described in this bug report should be resolved in a recent advisory, it has been closed with a resolution of ERRATA. For information on the advisory, and where to find the updated files, follow the link below. If the solution does not work for you, open a new bug report. https://rhn.redhat.com/errata/RHBA-2015-1363.html |