Bug 1676399
Summary: | upgrade failed due to can not pull image from an wrong registry address | ||
---|---|---|---|
Product: | OpenShift Container Platform | Reporter: | liujia <jiajliu> |
Component: | Cluster Version Operator | Assignee: | Joseph Callen <jcallen> |
Status: | CLOSED ERRATA | QA Contact: | liujia <jiajliu> |
Severity: | medium | Docs Contact: | |
Priority: | medium | ||
Version: | 3.11.0 | CC: | aos-bugs, jokerman, kalexand, mmccomas |
Target Milestone: | --- | ||
Target Release: | 3.11.z | ||
Hardware: | Unspecified | ||
OS: | Unspecified | ||
Whiteboard: | |||
Fixed In Version: | Doc Type: | If docs needed, set a value | |
Doc Text: | Story Points: | --- | |
Clone Of: | Environment: | ||
Last Closed: | 2019-06-26 09:07:54 UTC | Type: | Bug |
Regression: | --- | Mount Type: | --- |
Documentation: | --- | CRM: | |
Verified Versions: | Category: | --- | |
oVirt Team: | --- | RHEL 7.3 requirements from Atomic Host: | |
Cloudforms Team: | --- | Target Upstream Version: | |
Embargoed: |
Description
liujia
2019-02-12 08:02:00 UTC
This would be limited in scope to those customers who are changing oreg_url during an upgrade which I suspect is rare but we probably did handle that pattern in 3.9 and earlier and we may want to attempt to patch all configmaps with the new oreg_url. Just finished testing and confirm that the registry is not being updated. imageConfig: format: registry.redhat.io/openshift3/ose-${component}:${version} latest: false Hi liujia, There is a playbook to change the imageConfig.format after upgrade. See this PR for additional context: https://github.com/openshift/openshift-ansible/pull/9784 Here is the path to the playbook "playbooks/openshift-node/imageconfig.yml" I think the PR looks good. @Jia Liu, will you please confirm? Still hit the same issue on openshift-ansible-3.11.106-1.git.0.2d027da.el7.noarch. Steps: 1. Install ocp v3.11.88 with default registry. # docker images|grep ose-pod registry.redhat.io/openshift3/ose-pod v3.11 d5f897cfbb0d 13 days ago 238 MB registry.redhat.io/openshift3/ose-pod v3.11.88 ff8efa1e789c 6 weeks ago 238 MB 2. Edit inventory file to specify a private registry oreg_url=registry.reg-aws.openshift.com:443/openshift3/ose-${component}:${version} oreg_auth_user={{ lookup('env','REG_AUTH_USER') }} oreg_auth_password={{ lookup('env','REG_AUTH_PASSWORD') }} 3. Upgrade above ocp to latest v3.11(v3.11.106-) Upgrade failed at the same task. TASK [openshift_node : Wait for master API to come back online] **************** task path: /usr/share/ansible/openshift-ansible/roles/openshift_node/tasks/upgrade/restart.yml:66 Tuesday 16 April 2019 07:12:09 +0000 (0:00:00.881) 0:12:57.237 ********* fatal: [x]: FAILED! => {"changed": false, "elapsed": 600, "msg": "Timeout when waiting for ip-172-18-13-102.ec2.internal:8443"} some node logs: Apr 16 03:51:03 ip-172-18-13-102.ec2.internal atomic-openshift-node[5251]: logging error output: "Unauthorized" Apr 16 03:51:04 ip-172-18-13-102.ec2.internal atomic-openshift-node[5251]: E0416 03:51:04.238172 5251 remote_runtime.go:92] RunPodSandbox from runtime service failed: rpc error: code = Unknown desc = error parsing HTTP 404 response body: invalid character 'F' looking for beginning of value: "File not found.\"" Apr 16 03:51:04 ip-172-18-13-102.ec2.internal atomic-openshift-node[5251]: E0416 03:51:04.238228 5251 kuberuntime_sandbox.go:56] CreatePodSandbox for pod "master-api-ip-172-18-13-102.ec2.internal_kube-system(9d066f84b20195c767ec4ed9d7ac3ba2)" failed: rpc error: code = Unknown desc = error parsing HTTP 404 response body: invalid character 'F' looking for beginning of value: "File not found.\"" Apr 16 03:51:04 ip-172-18-13-102.ec2.internal atomic-openshift-node[5251]: E0416 03:51:04.238253 5251 kuberuntime_manager.go:646] createPodSandbox for pod "master-api-ip-172-18-13-102.ec2.internal_kube-system(9d066f84b20195c767ec4ed9d7ac3ba2)" failed: rpc error: code = Unknown desc = error parsing HTTP 404 response body: invalid character 'F' looking for beginning of value: "File not found.\"" Apr 16 03:51:04 ip-172-18-13-102.ec2.internal atomic-openshift-node[5251]: E0416 03:51:04.238355 5251 pod_workers.go:186] Error syncing pod 9d066f84b20195c767ec4ed9d7ac3ba2 ("master-api-ip-172-18-13-102.ec2.internal_kube-system(9d066f84b20195c767ec4ed9d7ac3ba2)"), skipping: failed to "CreatePodSandbox" for "master-api-ip-172-18-13-102.ec2.internal_kube-system(9d066f84b20195c767ec4ed9d7ac3ba2)" with CreatePodSandboxError: "CreatePodSandbox for pod \"master-api-ip-172-18-13-102.ec2.internal_kube-system(9d066f84b20195c767ec4ed9d7ac3ba2)\" failed: rpc error: code = Unknown desc = error parsing HTTP 404 response body: invalid character 'F' looking for beginning of value: \"File not found.\\\"\"" Apr 16 03:51:04 ip-172-18-13-102.ec2.internal atomic-openshift-node[5251]: I0416 03:51:04.238403 5251 server.go:470] Event(v1.ObjectReference{Kind:"Pod", Namespace:"kube-system", Name:"master-api-ip-172-18-13-102.ec2.internal", UID:"9d066f84b20195c767ec4ed9d7ac3ba2", APIVersion:"v1", ResourceVersion:"", FieldPath:""}): type: 'Warning' reason: 'FailedCreatePodSandBox' Failed create pod sandbox: rpc error: code = Unknown desc = error parsing HTTP 404 response body: invalid character 'F' looking for beginning of value: "File not found.\"" Apr 16 03:51:05 ip-172-18-13-102.ec2.internal atomic-openshift-node[5251]: E0416 03:51:05.779921 5251 server.go:226] Unable to authenticate the request due to an error: Post https://ip-172-18-13-102.ec2.internal:8443/apis/authentication.k8s.io/v1beta1/tokenreviews: dial tcp 172.18.13.102:8443: connect: connection refused some docker logs. Apr 16 03:45:16 ip-172-18-13-102.ec2.internal dockerd-current[4934]: time="2019-04-16T03:45:16.155768049-04:00" level=error msg="Handler for GET /v1.26/images/registry.redhat.io/openshift3/ose-pod:v3.11.106/json returned error: No such image: registry.redhat.io/openshift3/ose-pod:v3.11.106" Apr 16 03:45:16 ip-172-18-13-102.ec2.internal dockerd-current[4934]: time="2019-04-16T03:45:16.156197806-04:00" level=error msg="Handler for GET /v1.26/images/registry.redhat.io/openshift3/ose-pod:v3.11.106/json returned error: No such image: registry.redhat.io/openshift3/ose-pod:v3.11.106" Apr 16 03:45:17 ip-172-18-13-102.ec2.internal dockerd-current[4934]: time="2019-04-16T03:45:17.138926749-04:00" level=error msg="Error trying v2 registry: error parsing HTTP 404 response body: invalid character 'F' looking for beginning of value: \"File not found.\\\"\"" Apr 16 03:45:17 ip-172-18-13-102.ec2.internal dockerd-current[4934]: time="2019-04-16T03:45:17.138998697-04:00" level=error msg="Attempting next endpoint for pull after error: error parsing HTTP 404 response body: invalid character 'F' looking for beginning of value: \"File not found.\\\"\"" [root@ip-172-18-13-102 ~]# cat /etc/origin/master/master-config.yaml |grep -A 1 imageConfig imageConfig: format: registry.reg-aws.openshift.com:443/openshift3/ose-${component}:${version} [root@ip-172-18-13-102 ~]# cat /etc/origin/node/node-config.yaml |grep -A 1 imageConfig imageConfig: format: registry.redhat.io/openshift3/ose-${component}:${version} # docker images|grep ose-pod registry.reg-aws.openshift.com:443/openshift3/ose-pod v3.11 b676db41573e 30 hours ago 238 MB registry.redhat.io/openshift3/ose-pod v3.11 d5f897cfbb0d 13 days ago 238 MB registry.redhat.io/openshift3/ose-pod v3.11.88 ff8efa1e789c 6 weeks ago 238 MB Build: openshift-ansible-3.11.110-1 According to comment11, add one step before run upgrade playbook. Version: ansible-2.6.17-1.el7ae.noarch openshift-ansible-3.11.119-1.git.0.c9a8ebf.el7.noarch 1. Install ocp v3.11.104 with default registry. # docker images|grep ose-pod registry.redhat.io/openshift3/ose-pod v3.11 6759d8752074 3 weeks ago 1.03 GB registry.redhat.io/openshift3/ose-pod v3.11.88 ff8efa1e789c 3 months ago 238 MB 2. Edit inventory file to specify a private registry oreg_url=registry.reg-aws.openshift.com:443/openshift3/ose-${component}:${version} oreg_auth_user={{ lookup('env','REG_AUTH_USER') }} oreg_auth_password={{ lookup('env','REG_AUTH_PASSWORD') }} 3. Run playbooks/openshift-node/imageconfig.yml 4. Run upgrade to update above ocp to latest v3.11(v3.11.117) Upgrade succeed and images are pulled from specified registry. # docker images REPOSITORY TAG IMAGE ID CREATED SIZE registry.reg-aws.openshift.com:443/openshift3/ose-node v3.11 85e87675ef7b 5 days ago 1.2 GB registry.reg-aws.openshift.com:443/openshift3/ose-control-plane v3.11 bb262ffdc4ff 5 days ago 820 MB registry.reg-aws.openshift.com:443/openshift3/ose-deployer v3.11.117 146bca0da64b 5 days ago 373 MB registry.reg-aws.openshift.com:443/openshift3/ose-kube-rbac-proxy v3.11 b7bd1af18a65 5 days ago 276 MB registry.reg-aws.openshift.com:443/openshift3/ose-console v3.11 0bff93a1dcef 5 days ago 266 MB registry.reg-aws.openshift.com:443/openshift3/ose-web-console v3.11 cbddf00dc079 5 days ago 334 MB registry.reg-aws.openshift.com:443/openshift3/ose-pod v3.11 8d0bf3c3b7f3 5 days ago 250 MB registry.reg-aws.openshift.com:443/openshift3/ose-pod v3.11.117 8d0bf3c3b7f3 5 days ago 250 MB registry.reg-aws.openshift.com:443/openshift3/ose-service-catalog v3.11 0df332403c62 5 days ago 321 MB registry.reg-aws.openshift.com:443/openshift3/ose-template-service-broker v3.11 3b2d527c17b9 5 days ago 324 MB registry.reg-aws.openshift.com:443/openshift3/prometheus-node-exporter v3.11 4970129aac25 5 days ago 237 MB registry.reg-aws.openshift.com:443/openshift3/registry-console v3.11 33ebc54a7694 5 days ago 246 MB registry.redhat.io/openshift3/ose-node v3.11 be8a09b5514c 3 weeks ago 1.97 GB registry.redhat.io/openshift3/ose-control-plane v3.11 c33fa4c530a3 3 weeks ago 1.6 GB registry.redhat.io/openshift3/ose-deployer v3.11.104 1500740029de 3 weeks ago 1.16 GB registry.redhat.io/openshift3/ose-console v3.11 6e555a73ff6e 3 weeks ago 1.05 GB registry.redhat.io/openshift3/ose-pod v3.11 6759d8752074 3 weeks ago 1.03 GB registry.redhat.io/openshift3/ose-service-catalog v3.11 410f55e8c706 3 weeks ago 1.1 GB registry.redhat.io/openshift3/ose-web-console v3.11 4c147a14b66f 3 weeks ago 1.12 GB registry.redhat.io/openshift3/ose-kube-rbac-proxy v3.11 cdfa9d0da060 3 weeks ago 1.06 GB registry.redhat.io/openshift3/ose-template-service-broker v3.11 e0f28a2f2555 3 weeks ago 1.11 GB registry.redhat.io/openshift3/registry-console v3.11 38a5af0ed6c5 3 weeks ago 1.03 GB registry.redhat.io/openshift3/prometheus-node-exporter v3.11 0f508556d522 3 weeks ago 1.02 GB registry.redhat.io/rhel7/etcd 3.2.22 d636cc8689ea 2 months ago 259 MB registry.redhat.io/openshift3/ose-pod v3.11.88 ff8efa1e789c 3 months ago 238 MB registry.reg-aws.openshift.com:443/openshift3/ose-pod v3.11.88 ff8efa1e789c 3 months ago 238 MB registry.reg-aws.openshift.com:443/rhel7/etcd 3.2.22 bb2f1d4dd3a7 12 months ago 256 MB Since the problem described in this bug report should be resolved in a recent advisory, it has been closed with a resolution of ERRATA. For information on the advisory, and where to find the updated files, follow the link below. If the solution does not work for you, open a new bug report. https://access.redhat.com/errata/RHBA-2019:1605 |