Bugzilla will be upgraded to version 5.0. The upgrade date is tentatively scheduled for 2 December 2018, pending final testing and feedback.
Bug 1569621 - openshift-ansible aws provisioning masters need to be detached from scale group
openshift-ansible aws provisioning masters need to be detached from scale group
Status: CLOSED ERRATA
Product: OpenShift Container Platform
Classification: Red Hat
Component: Installer (Show other bugs)
3.9.0
Unspecified Unspecified
unspecified Severity high
: ---
: 3.11.0
Assigned To: Chris Callegari
sheng.lao
: OpsBlocker
Depends On:
Blocks:
  Show dependency treegraph
 
Reported: 2018-04-19 11:24 EDT by Matt Woodson
Modified: 2018-10-11 03:19 EDT (History)
6 users (show)

See Also:
Fixed In Version:
Doc Type: Enhancement
Doc Text:
Feature: Master nodes will now be deployed as standalone EC2 instances. Reason: When deploying to AWS OCP master instances cannot be EC2s under an AutoScale Group due to unexpected redeployments that will damage static etcd membership. Result: OCP on AWS will have stable control plane instances
Story Points: ---
Clone Of:
Environment:
Last Closed: 2018-10-11 03:19:10 EDT
Type: Bug
Regression: ---
Mount Type: ---
Documentation: ---
CRM:
Verified Versions:
Category: ---
oVirt Team: ---
RHEL 7.3 requirements from Atomic Host:
Cloudforms Team: ---


Attachments (Terms of Use)


External Trackers
Tracker ID Priority Status Summary Last Updated
Red Hat Product Errata RHBA-2018:2652 None None None 2018-10-11 03:19 EDT

  None (edit)
Description Matt Woodson 2018-04-19 11:24:05 EDT
Description of problem:

Currently the openshift-ansible cluster provisioning code provisions the masters inside of a scale group.  masters need to not be in an AWS scalegroup while running.  

Currently masters are too much of a pet to be added to a scalegroup.

Running in the scalegroup has too many potentials issues:

- A node may not be shutdown/stopped.  If so, the scalegroup will terminate them.  Even if enabling termination protection, the aws instance will be terminated.

- A master node may never be resized.  In order to resize a node, it must be shutdown.  When this happens, the scalegroup will delete it.

- If a scalegroup spans multi az's, and an AZ goes down, the instance will be terminated and created in a new AZ.


Version-Release number of selected component (if applicable):

openshift-ansible 3.9, 3.10


Additional info:

One solution would be to provision the nodes with a scalegroup, but once done, could detach the instances from
Comment 1 Steve Speicher 2018-05-17 16:13:56 EDT
Moving to high, since ops blocker
Comment 5 Chris Callegari 2018-08-27 09:42:52 EDT
pull/9736 has been accepted and ready for QE
Comment 6 Matt Woodson 2018-08-27 10:36:35 EDT
Is this change multi-az aware?
Comment 7 Chris Callegari 2018-08-27 11:16:08 EDT
Matt, yes!  Masters can also to be resized without redeployment as well.
Comment 8 sheng.lao 2018-08-31 02:07:05 EDT
Fixed at: openshift-ansible-3.11.0-0.25.0-34-g04f8519

1. checking: standalone master instances
# aws autoscaling describe-auto-scaling-groups |grep "master group name"
#
Empty means standalone master.
Comment 11 errata-xmlrpc 2018-10-11 03:19:10 EDT
Since the problem described in this bug report should be
resolved in a recent advisory, it has been closed with a
resolution of ERRATA.

For information on the advisory, and where to find the updated
files, follow the link below.

If the solution does not work for you, open a new bug report.

https://access.redhat.com/errata/RHBA-2018:2652

Note You need to log in before you can comment on or make changes to this bug.