Bug 1611631
| Summary: | 'cibadmin --upgrade' times out on non-DC nodes if schema is already the latest available | ||
|---|---|---|---|
| Product: | [Fedora] Fedora | Reporter: | Tomas Jelinek <tojeline> |
| Component: | pacemaker | Assignee: | Jan Pokorný [poki] <jpokorny> |
| Status: | CLOSED RAWHIDE | QA Contact: | Fedora Extras Quality Assurance <extras-qa> |
| Severity: | medium | Docs Contact: | |
| Priority: | low | ||
| Version: | rawhide | CC: | andrew, anprice, jpokorny, lhh, tojeline |
| Target Milestone: | --- | ||
| Target Release: | --- | ||
| Hardware: | Unspecified | ||
| OS: | Unspecified | ||
| Whiteboard: | |||
| Fixed In Version: | pacemaker-2.0.0-2.fc29 | Doc Type: | If docs needed, set a value |
| Doc Text: | Story Points: | --- | |
| Clone Of: | Environment: | ||
| Last Closed: | 2018-08-09 14:54:03 UTC | Type: | Bug |
| Regression: | --- | Mount Type: | --- |
| Documentation: | --- | CRM: | |
| Verified Versions: | Category: | --- | |
| oVirt Team: | --- | RHEL 7.3 requirements from Atomic Host: | |
| Cloudforms Team: | --- | Target Upstream Version: | |
| Embargoed: | |||
|
Description
Tomas Jelinek
2018-08-02 13:42:43 UTC
The exit status of 1 on the DC is also a bug. It should be 0, and the message should be "Upgrade unnecessary: Schema is already the latest available". We can include that issue in this bz, too. Upon investigation, I found the timeout issue has existed since at least upstream 1.1.11 (I reproduced as far back as 1.1.16) and possibly always. When a non-DC node gets an upgrade request, it forwards it to the DC. When the DC gets it, if an upgrade is required, it resends the request to all nodes asking for an upgrade to a specific version, and all the nodes perform that upgrade locally, notifying any clients (such as cibadmin) of the result. The problem is that if an upgrade is not required, the DC does not do anything further, so the non-DC nodes never do anything either, and the client doesn't get any notification. We will need to change it such that the DC always sends a result to at least the requesting node, even if an upgrade is not required. This will only work once all cluster nodes in a cluster are upgraded to a pacemaker version with the fix (that is, in a rolling upgrade, the fix will not take effect until all nodes are upgraded). The timeout issue is fixed upstream by commit 1f05f5e2 and the exit status issue by commit f5e936fb. Re-assigning to Jan Pokorný for release |