Bug 1577960 - [3.9] Fail to upgrade ocp without docker registry deployed
Summary: [3.9] Fail to upgrade ocp without docker registry deployed
Keywords:
Status: CLOSED ERRATA
Alias: None
Product: OpenShift Container Platform
Classification: Red Hat
Component: Cluster Version Operator
Version: 3.9.0
Hardware: Unspecified
OS: Unspecified
urgent
urgent
Target Milestone: ---
: 3.9.z
Assignee: Vadim Rutkovsky
QA Contact: liujia
URL:
Whiteboard:
: 1581243 (view as bug list)
Depends On: 1559248 1581243
Blocks:
TreeView+ depends on / blocked
 
Reported: 2018-05-14 14:01 UTC by Scott Dodson
Modified: 2022-03-13 14:59 UTC (History)
14 users (show)

Fixed In Version:
Doc Type: If docs needed, set a value
Doc Text:
Clone Of: 1559248
Environment:
Last Closed: 2018-06-15 13:54:40 UTC
Target Upstream Version:
Embargoed:


Attachments (Terms of Use)


Links
System ID Private Priority Status Summary Last Updated
Github openshift openshift-ansible pull 8354 0 'None' 'closed' 'Don''t validate registry certificates when registry is disabled' 2019-11-27 14:19:59 UTC
Red Hat Knowledge Base (Solution) 3441211 0 None None None 2018-05-14 14:01:54 UTC

Comment 2 liujia 2018-05-18 07:07:30 UTC
Still failed at openshift-ansible-3.9.29-1.git.0.051bc5c.el7.noarch
Checked that the pr has not in the latest build.

Comment 4 Vadim Rutkovsky 2018-05-22 12:18:39 UTC
*** Bug 1581243 has been marked as a duplicate of this bug. ***

Comment 5 Vadim Rutkovsky 2018-05-22 13:10:53 UTC
Additional fix to resolve remaining issues: https://github.com/openshift/openshift-ansible/pull/8474

Comment 6 Vadim Rutkovsky 2018-05-28 08:17:02 UTC
Fix is available in openshift-ansible-3.9.30-1

Comment 7 liujia 2018-05-29 06:05:00 UTC
Version:
openshift-ansible-3.9.30-1.git.0.a91a657.el7.noarch


Upgrade still failed at a new task [openshift_web_console : Verify that the web console is running], original failure after this task couldn't be checked. 

TASK [openshift_web_console : Verify that the web console is running] **********
task path: /usr/share/ansible/openshift-ansible/roles/openshift_web_console/tasks/install.yml:137
FAILED - RETRYING: Verify that the web console is running (60 retries left).
...
FAILED - RETRYING: Verify that the web console is running (1 retries left).
fatal: [x]: FAILED! => {"attempts": 60, "changed": false, "content": "", "failed": true, "msg": "Status code was not [200]: Request failed: <urlopen error [Errno 111] Connection refused>", "redirected": false, "status": -1, "url": "https://webconsole.openshift-web-console.svc/healthz"}
<--snip-->
TASK [openshift_web_console : Report console errors] ***************************
task path: /usr/share/ansible/openshift-ansible/roles/openshift_web_console/tasks/install.yml:190
fatal: [x]: FAILED! => {"changed": false, "failed": true, "msg": "Console install failed."}

This was because web-console needed to be installed during upgrade from v3.7 to v3.9, and web-console can not be deployed successfully without docker-registry.  But our upgrade process would be abort if web-console was not running.

Comment 8 Vadim Rutkovsky 2018-05-29 15:22:16 UTC
(In reply to liujia from comment #7)
> This was because web-console needed to be installed during upgrade from v3.7
> to v3.9, and web-console can not be deployed successfully without
> docker-registry.  But our upgrade process would be abort if web-console was
> not running.

I can't reproduce this - both upgrades with 'openshift_web_console_install' set to 'true' and 'false' have succeeded here.

Please attach inventory, 3.7 install log and 3.9 upgrade log

Comment 11 liujia 2018-05-31 07:37:26 UTC
@ Vadim Rutkovsky

Thx for the remind, I just forgot the image setting issue for 3.7-3.9 upgrade. Sorry for the delay verify. Now original issue can be verified.

Verified on openshift-ansible-3.9.30-1.git.0.a91a657.el7.noarch

TASK [Redeploy docker registry] ************************************************
task path: /usr/share/ansible/openshift-ansible/playbooks/openshift-hosted/private/redeploy-registry-certificates.yml:90
skipping: [x] => {"changed": false, "skip_reason": "Conditional result was False", "skipped": true}

Comment 13 Scott Dodson 2018-06-15 13:54:40 UTC
Fixed in openshift-ansible-3.9.30-1 which has already shipped.


Note You need to log in before you can comment on or make changes to this bug.