Bug 1313529

Summary: rhel-osp-director: [Doc] Replacing Controller Nodes for 8.0 is missing.
Product: Red Hat OpenStack Reporter: Alexander Chuzhoy <sasha>
Component: documentationAssignee: Dan Macpherson <dmacpher>
Status: CLOSED CURRENTRELEASE QA Contact: Alexander Chuzhoy <sasha>
Severity: unspecified Docs Contact:
Priority: high    
Version: 8.0 (Liberty)CC: dmacpher, michele, sasha, srevivo
Target Milestone: ---Keywords: Documentation
Target Release: 8.0 (Liberty)   
Hardware: Unspecified   
OS: Unspecified   
Whiteboard:
Fixed In Version: Doc Type: Bug Fix
Doc Text:
Story Points: ---
Clone Of: Environment:
Last Closed: 2016-06-16 04:41:11 UTC Type: Bug
Regression: --- Mount Type: ---
Documentation: --- CRM:
Verified Versions: Category: ---
oVirt Team: --- RHEL 7.3 requirements from Atomic Host:
Cloudforms Team: --- Target Upstream Version:
Embargoed:
Bug Depends On:    
Bug Blocks: 1286302    

Description Alexander Chuzhoy 2016-03-01 19:37:25 UTC
rhel-osp-director: [Doc] Replacing Controller Nodes for 8.0 is missing.


Tried to follow the steps for 7:
https://access.redhat.com/documentation/en-US/Red_Hat_Enterprise_Linux_OpenStack_Platform/7/html/Director_Installation_and_Usage/Replacing_Controller_Nodes.html


The pcs cluster was in maintenance mode, so I couldn't even start the keystone.

After manually disabling the maintenance mode on  pcs cluster, still wasn't able to successfully complete re-deployment of the overcloud as mentioned in step 7 in the guide above.

Comment 2 Mike Burns 2016-03-01 19:43:38 UTC
*** Bug 1313528 has been marked as a duplicate of this bug. ***

Comment 4 Dan Macpherson 2016-04-04 17:23:53 UTC
Sorry this took so long. I've only just been able to successfully deploy a HA Overcloud on OSP 8.

Sasha, I ran into the same error you did but I think I managed to figure it out. Essentially, there's still some rouge entries for overcloud-controller-1 still in Pacemaker/Corosync, which is causing pcs to auth in a loop. However, once I cleared them away, the deployment seems to be progressing fine. I'll update the procedure tomorrow and that way you should be to test it out.

Comment 5 Dan Macpherson 2016-04-05 19:23:24 UTC
Okay, tested it out. It seems to be the failed node entry in corosync.conf that is stopping the deployment from continuing. I've got that step closer to the end, but I'll bump it up to before step 6 (restarting pacemaker and corosync). Based on my testing, this should work.

Comment 7 Dan Macpherson 2016-05-12 05:51:30 UTC
I think this might be a duplicate of https://bugzilla.redhat.com/show_bug.cgi?id=1327701

In any case, I'll switch this over the ON_QA too. As mentioned int he other BZ, here's the latest draft:

https://access.stage.redhat.com/documentation/en/red-hat-openstack-platform/8/director-installation-and-usage/94-replacing-controller-nodes

Sasha, is there anything else needed for this procedure?

Comment 8 Alexander Chuzhoy 2016-05-12 13:09:47 UTC
Verified:

This section of the doc looks good.

Comment 9 Dan Macpherson 2016-06-16 04:41:11 UTC
Changes now live on the customer portal.