Bug 275751
Summary: | New service gets stuck in recovering state if start script exits with status 1 | ||||||||||||
---|---|---|---|---|---|---|---|---|---|---|---|---|---|
Product: | [Retired] Red Hat Cluster Suite | Reporter: | Mark Huth <mhuth> | ||||||||||
Component: | rgmanager | Assignee: | Lon Hohberger <lhh> | ||||||||||
Status: | CLOSED WONTFIX | QA Contact: | Cluster QE <mspqa-list> | ||||||||||
Severity: | low | Docs Contact: | |||||||||||
Priority: | medium | ||||||||||||
Version: | 4 | CC: | cfeist, cluster-maint, tao | ||||||||||
Target Milestone: | --- | ||||||||||||
Target Release: | --- | ||||||||||||
Hardware: | All | ||||||||||||
OS: | Linux | ||||||||||||
Whiteboard: | |||||||||||||
Fixed In Version: | Doc Type: | Bug Fix | |||||||||||
Doc Text: | Story Points: | --- | |||||||||||
Clone Of: | Environment: | ||||||||||||
Last Closed: | 2008-12-03 17:06:39 UTC | Type: | --- | ||||||||||
Regression: | --- | Mount Type: | --- | ||||||||||
Documentation: | --- | CRM: | |||||||||||
Verified Versions: | Category: | --- | |||||||||||
oVirt Team: | --- | RHEL 7.3 requirements from Atomic Host: | |||||||||||
Cloudforms Team: | --- | Target Upstream Version: | |||||||||||
Embargoed: | |||||||||||||
Attachments: |
|
Description
Mark Huth
2007-09-04 05:46:33 UTC
Created attachment 185661 [details]
gdb backtrace on one node
Created attachment 185671 [details]
gdb backtrace on second node
Created attachment 185681 [details]
/proc/cluster/dlm_debug on one node
Created attachment 185691 [details]
/proc/cluster/dlm_debug on second node
This looks like a race between node 1 reconfiguring and node 2 reconfiguring. The easiest thing to do here is synchronize reconfiguration. Ex: Node 1 reconfigures, gets new resource(s) Node 1 decides to start resources, but fails - Node 1 stops resources Node 1 tells node 2 to start resources Node 2 says "Ehhhhh?" Node 2 reconfigures, gets new resources. Service gets stuck in recovering state. This request was evaluated by Red Hat Product Management for inclusion in a Red Hat Enterprise Linux maintenance release. Product Management has requested further review of this request by Red Hat Engineering, for potential inclusion in a Red Hat Enterprise Linux Update release for currently deployed products. This request is not yet committed for inclusion in an Update release. Fixing this would be too invasive and may require adding config versions to rgmanager messages (e.g. "try again if your config version is newer than mine"). You can disable and enable the service again after the transition is complete. |