1306011 – Deployer pods incorrectly using the host entry from openshiftLoopbackKubeconfig

Bug 1306011 - Deployer pods incorrectly using the host entry from openshiftLoopbackKubeconfig

Summary: Deployer pods incorrectly using the host entry from openshiftLoopbackKubeconfig

Keywords:
Status:	CLOSED ERRATA
Alias:	None
Product:	OpenShift Container Platform
Classification:	Red Hat
Component:	openshift-controller-manager
Sub Component:
Version:	3.1.0
Hardware:	Unspecified
OS:	Unspecified
Priority:	medium
Severity:	medium
Target Milestone:	---
Target Release:	---
Assignee:	Dan Mace
QA Contact:	Johnny Liu
Docs Contact:
URL:
Whiteboard:
Depends On:
Blocks:
TreeView+	depends on / blocked

Reported:	2016-02-09 20:16 UTC by Jason DeTiberus
Modified:	2019-11-14 07:25 UTC (History)
CC List:	6 users (show)
Fixed In Version:	v3.1.1.902
Doc Type:	Bug Fix
Doc Text:
Clone Of:
Environment:
Last Closed:	2016-05-12 16:28:35 UTC
Target Upstream Version:
Embargoed:

Attachments	(Terms of Use)

Links
System	ID	Priority	Status	Summary	Last Updated
Red Hat Knowledge Base (Solution)	2264301	None	None	None	2017-01-18 06:19:34 UTC
Red Hat Product Errata	RHSA-2016:1064	normal	SHIPPED_LIVE	Important: Red Hat OpenShift Enterprise 3.2 security, bug fix, and enhancement update	2016-05-12 20:19:17 UTC
Red Hat Product Errata	RHSA-2016:1094	normal	SHIPPED_LIVE	Important: Red Hat OpenShift Enterprise 3.2 security update	2016-05-20 00:12:27 UTC

Description Jason DeTiberus 2016-02-09 20:16:15 UTC

Description of problem:
When a multi-master deployment has updated the context of the openshiftLoopbackKubeconfig to avoid traversing the load-balancer deployer pods attempt to contact the master using the updated hostname.

Version-Release number of selected component (if applicable):
3.1.1.6

How reproducible:
Everytime

Steps to Reproduce:
1. Block node -> master traffic on port 8443
2. Install a native HA install using openshift-ansible
3. Update /etc/origin/master/openshift-master.kubeconfig with a new cluster, context and updating the current-context for directly connecting to itself rather than the clustered hostname.
4. Initiate a deployment

Actual results:
deployer pod fails attempting to contact the master directly rather than through the load balancer, causing the request to fail/timeout.

Expected results:
The deployer pod should contact the master over the internal clustered hostname and the deployment should succeed.

Additional info:

Comment 1 Jason DeTiberus 2016-02-09 20:19:01 UTC

I discussed this with deads2k and liggitt on #openshift-dev about this. The deployer pod should be using a serviceaccount and dns for config rather than the ENV variables injected from the openshiftLoopbackKubeconfig like it currently is.The deployment should be successful

Comment 2 Dan Mace 2016-02-10 14:21:04 UTC

I agree with David and Jordan's assessment. The cited deployer code predates the user of service accounts and was never updated when SAs were introduced.

Comment 3 Dan Mace 2016-02-10 18:47:52 UTC

https://github.com/openshift/origin/pull/7197

Comment 4 zhou ying 2016-02-15 06:08:47 UTC

Since this bug is OSE's bug, could you please provide the puddle or package version. thanks.

Comment 5 Dan Mace 2016-02-15 19:54:50 UTC

Still waiting for https://github.com/openshift/origin/pull/7197 to be incorporated into OSE. Once I have a tag containing the fix, I'll update the bz and clear the needinfo. Thanks!

Comment 7 Jason DeTiberus 2016-02-19 22:01:36 UTC

Steps for updating the context:

> oc config view

Take note of the name value of the user listed under users, it will start with 'system:admin/'

> oc config set-cluster --certificate-authority=/etc/origin/master/ca.crt \
  --embed-certs=true --server=https://<openshift_hostname>:<api port> \
  my_loopback_cluster --config=/etc/origin/master/openshift-master.kubeconfig

> oc config set-context --cluster=my_loopback_cluster --namespace=default \
  --user=<system admin name> my_loopback_context \
  --config=/etc/origin/master/openshift-master.kubeconfig

> oc config use-context my_loopback_context \
  --config=/etc/origin/master/openshift-master.kubeconfig

Comment 16 errata-xmlrpc 2016-05-12 16:28:35 UTC

Since the problem described in this bug report should be
resolved in a recent advisory, it has been closed with a
resolution of ERRATA.

For information on the advisory, and where to find the updated
files, follow the link below.

If the solution does not work for you, open a new bug report.

https://access.redhat.com/errata/RHSA-2016:1064

Note You need to log in before you can comment on or make changes to this bug.