Bug 160646
Summary: | GFS cluster node does not shutdown: CMANsendmsg failed: -101 | ||
---|---|---|---|
Product: | [Fedora] Fedora | Reporter: | Axel Thimm <axel.thimm> |
Component: | cman | Assignee: | Jonathan Earl Brassow <jbrassow> |
Status: | CLOSED CURRENTRELEASE | QA Contact: | |
Severity: | high | Docs Contact: | |
Priority: | medium | ||
Version: | 4 | CC: | jbrassow |
Target Milestone: | --- | ||
Target Release: | --- | ||
Hardware: | All | ||
OS: | Linux | ||
Whiteboard: | |||
Fixed In Version: | RHEL3 U5 | Doc Type: | Bug Fix |
Doc Text: | Story Points: | --- | |
Clone Of: | Environment: | ||
Last Closed: | 2005-10-04 17:17:32 UTC | Type: | --- |
Regression: | --- | Mount Type: | --- |
Documentation: | --- | CRM: | |
Verified Versions: | Category: | --- | |
oVirt Team: | --- | RHEL 7.3 requirements from Atomic Host: | |
Cloudforms Team: | --- | Target Upstream Version: | |
Embargoed: |
Description
Axel Thimm
2005-06-16 10:54:43 UTC
Are the init scripts active so the system shuts down in the right order? Shutting down the network before shutting cman would cause a problem like this. Yes, all GFS related init scripts have ben chkconfig-enabled. The shutdown is performed by normal init scripts in the script-given ordering. Perhaps cman shutdown fails for any reason and later on cman holds the final rebooting? Then there would be two bugs, one for not having cman properly shutdown (and I can imagine fencing to take part in this), and another one for the not-stopped cman not allowing a system to shutdown/reboot. I believe this was solved by alewis by altering the clvm init script. hrm...not sure - are there any initscript errors before this happens? I don't think there were any errors that were reported by the clvmd init script. Previously, it would shutdown volumes, but not kill the clvmd daemon. Since the daemon was still logged into cman, cman would refuse to shutdown and start spitting out errors like described above.... The way to get to the bottom of this hypothesis is to have the user attach their clvmd init script and check to make sure that it is killing off the daemon during all shutdown cases. The clvmd init script now kills off the daemon when shutting down The resolution of this bug is RHEL3 (aka RHCS 3), while the bug was opened against FC4 which is more like RHEL4 wrt to RHCS/RHGFS. Also there seems to still be some racing in RHEL4 in shutting down the cluster with for service in rgmanager gfs clvmd fenced cman ccsd; do service $service stop done Sometimes cman fails to stop, and service cman stop needs to be reissued. |