Cause: Corosync sometimes fails to start correctly in IPv6 environments
Consequence: Corosync can fail to start on reboot of controller nodes
Workaround (if any): On reboot of the controller nodes, when the host comes back up, check for corosync status. If it failed, start the following services manually and in the following order: corosync, pacemaker, pcsd
Result: The corosync service and related cluster services should come up correctly when restarted.
Created attachment 1144242[details]
corosync fail
Description of problem:
After reboot corosync fails to start on one of the controllers in an upgraded 7.3->8 IPv6 environment.
Steps to Reproduce:
1. Upgrade overcloud from 7.3->8
2. Once the upgrade is complete reboot the controllers serially
Actual results:
Corosync failed to start when rebooting the last controller.
Expected results:
All the controllers get back online after reboot.
Additional info:
The issue I am seeing looks pretty similar to the one describe by BZ#1245951
I am attaching the corosync log on the failed controller.
Moving back to 8. This isn't targeted for a specific milestone because we're dependent on a RHEL bug fix, but once it's fixed, this will become testonly.
Comment 10Fabio Massimo Di Nitto
2017-02-22 12:08:33 UTC
*** This bug has been marked as a duplicate of bug 1245951 ***