Bug 2304073

Summary: remove client-op deployed subscription webhook before it is scaled down by odf-op
Product: [Red Hat Storage] Red Hat OpenShift Data Foundation Reporter: Leela Venkaiah Gangavarapu <lgangava>
Component: odf-operatorAssignee: Nitin Goyal <nigoyal>
Status: CLOSED ERRATA QA Contact: Jilju Joy <jijoy>
Severity: unspecified Docs Contact:
Priority: unspecified    
Version: 4.17CC: muagarwa, odf-bz-bot
Target Milestone: ---Keywords: AutomationTriaged
Target Release: ODF 4.17.0   
Hardware: Unspecified   
OS: Unspecified   
Whiteboard:
Fixed In Version: 4.17.0-87 Doc Type: No Doc Update
Doc Text:
Story Points: ---
Clone Of:
: 2304074 (view as bug list) Environment:
Last Closed: 2024-10-30 14:30:37 UTC Type: Bug
Regression: --- Mount Type: ---
Documentation: --- CRM:
Verified Versions: Category: ---
oVirt Team: --- RHEL 7.3 requirements from Atomic Host:
Cloudforms Team: --- Target Upstream Version:
Embargoed:
Bug Depends On:    
Bug Blocks: 2304074    

Description Leela Venkaiah Gangavarapu 2024-08-12 08:20:39 UTC
Description of problem (please be detailed as possible and provide log
snippests):
odf-op hits an error when upgrading from 4.16 to 4.17 as below

2024-08-06T12:21:40Z	ERROR	Reconciler error	{"controller": "storagesystem", "controllerGroup": "odf.openshift.io", "controllerKind": "StorageSystem", "StorageSystem": {"name":"ocs-storagecluster-storagesystem","namespace":"openshift-storage"}, "namespace": "openshift-storage", "name": "ocs-storagecluster-storagesystem", "reconcileID": "1cd915b8-a271-4308-afd3-49ea67ac8911", "error": "Internal error occurred: failed calling webhook \"subscription.ocs.openshift.io\": failed to call webhook: Post \"https://ocs-client-operator-webhook-server.openshift-storage.svc:443/validate-subscription?timeout=30s\": no endpoints available for service \"ocs-client-operator-webhook-server\""}

Version of all relevant components (if applicable):
4.17, 4.16

Does this issue impact your ability to continue to work with the product
(please explain in detail what is the user impact)?


Is there any workaround available to the best of your knowledge?


Rate from 1 - 5 the complexity of the scenario you performed that caused this
bug (1 - very simple, 5 - very complex)?


Can this issue reproducible?
Yes

Can this issue reproduce from the UI?


If this is a regression, please provide more details to justify this:


Steps to Reproduce:
1. Install 4.16.z and upgrade to 4.17
2.
3.


Actual results:
odf-op should be able to upgrade all dependents

Expected results:
odf-op should upgrade successfully

Additional info:
With https://bugzilla.redhat.com/show_bug.cgi?id=2299443 odf-op scales down client-op if not running in provider mode however by that time a validating webhook w/ name `subscription.ocs.openshift.io` would be deployed and odf-op needs to delete that before scaling down client-op

Comment 11 errata-xmlrpc 2024-10-30 14:30:37 UTC
Since the problem described in this bug report should be
resolved in a recent advisory, it has been closed with a
resolution of ERRATA.

For information on the advisory (Important: Red Hat OpenShift Data Foundation 4.17.0 Security, Enhancement, & Bug Fix Update), and where to find the updated
files, follow the link below.

If the solution does not work for you, open a new bug report.

https://access.redhat.com/errata/RHSA-2024:8676