Description of problem: On native ha installs atomic-openshift-master.service needs to be removed. $ rm /usr/lib/systemd/system/atomic-openshift-master.service To many users are starting this service expecting it to be running. If this service is running then the api and controllers will not start. Version-Release number of selected component (if applicable): 3.1.x How reproducible: 100% Steps to Reproduce: 1. run the following on all masters $ systemctl stop atomic-openshift-master-controllers 2. run the following on all masters $ systemctl stop atomic-openshift-master-api 3. then start atomic-openshift-master and then api and controllers Actual results: Failures on the api and controllers. Expected results: For atomic-openshift-master.service to kick off the api and controllers instead of a single master process. Or for this service to be removed. Additional info: Suggested change: Change the service to start both api and controllers: $cat /usr/lib/systemd/system/atomic-openshift-master.service [Unit] Description=Atomic OpenShift Master Documentation=https://github.com/openshift/origin Requires=atomic-openshift-master-api.service Requires=atomic-openshift-master-controllers.service [Service] Type=simple ExecStart=/usr/bin/echo "Starting atomic-openshift-master-api.service and atomic-openshift-master-controllers.service" [Install] WantedBy=multi-user.target
We can't really remove it since it is tracked by the RPM package and any update would replace it. What we could do though is to mask the service to make it difficult for them to start it up unintentionally. That does make me wonder whether we could use an override unit to make it basically do nothing but kick off both the api and controllers services instead though... We could basically drop a replacement unit into /etc/systemd/system and it could override the behavior of the default shipped unit.
I've created the following PR to mask the service: https://github.com/openshift/openshift-ansible/pull/1342 While we could try to create a unit to control both, I suspect we'll hit edge cases where it doesn't behave properly. For example, the suggested changes you've proposed would work for start, but would not work for stop/restart.
Check on atomic-openshift-utils-3.0.40-1.git.22.da21865.el7aos.noarch Install native-ha env. Check the atomic-openshift-master service # service atomic-openshift-master restart Redirecting to /bin/systemctl restart atomic-openshift-master.service Failed to restart atomic-openshift-master.service: Unit atomic-openshift-master.service is masked. Move this issue to VERIFIED.
Since the problem described in this bug report should be resolved in a recent advisory, it has been closed with a resolution of ERRATA. For information on the advisory, and where to find the updated files, follow the link below. If the solution does not work for you, open a new bug report. https://access.redhat.com/errata/RHBA-2016:0311