Bug 2304073 - remove client-op deployed subscription webhook before it is scaled down by odf-op
Summary: remove client-op deployed subscription webhook before it is scaled down by od...
Keywords:
Status: CLOSED ERRATA
Alias: None
Product: Red Hat OpenShift Data Foundation
Classification: Red Hat Storage
Component: odf-operator
Version: 4.17
Hardware: Unspecified
OS: Unspecified
unspecified
unspecified
Target Milestone: ---
: ODF 4.17.0
Assignee: Nitin Goyal
QA Contact: Jilju Joy
URL:
Whiteboard:
Depends On:
Blocks: 2304074
TreeView+ depends on / blocked
 
Reported: 2024-08-12 08:20 UTC by Leela Venkaiah Gangavarapu
Modified: 2024-10-30 14:30 UTC (History)
2 users (show)

Fixed In Version: 4.17.0-87
Doc Type: No Doc Update
Doc Text:
Clone Of:
: 2304074 (view as bug list)
Environment:
Last Closed: 2024-10-30 14:30:37 UTC
Embargoed:


Attachments (Terms of Use)


Links
System ID Private Priority Status Summary Last Updated
Github red-hat-storage odf-operator pull 462 0 None open controllers: delete the webhook created by the ocs-client-operator 2024-08-12 12:41:04 UTC
Github red-hat-storage odf-operator pull 465 0 None open Bug 2304073:[release-4.17] controllers: delete the webhook created by the ocs-client-operator 2024-08-13 16:15:35 UTC
Red Hat Issue Tracker OCSBZM-8867 0 None None None 2024-08-27 12:25:01 UTC
Red Hat Product Errata RHSA-2024:8676 0 None None None 2024-10-30 14:30:45 UTC

Description Leela Venkaiah Gangavarapu 2024-08-12 08:20:39 UTC
Description of problem (please be detailed as possible and provide log
snippests):
odf-op hits an error when upgrading from 4.16 to 4.17 as below

2024-08-06T12:21:40Z	ERROR	Reconciler error	{"controller": "storagesystem", "controllerGroup": "odf.openshift.io", "controllerKind": "StorageSystem", "StorageSystem": {"name":"ocs-storagecluster-storagesystem","namespace":"openshift-storage"}, "namespace": "openshift-storage", "name": "ocs-storagecluster-storagesystem", "reconcileID": "1cd915b8-a271-4308-afd3-49ea67ac8911", "error": "Internal error occurred: failed calling webhook \"subscription.ocs.openshift.io\": failed to call webhook: Post \"https://ocs-client-operator-webhook-server.openshift-storage.svc:443/validate-subscription?timeout=30s\": no endpoints available for service \"ocs-client-operator-webhook-server\""}

Version of all relevant components (if applicable):
4.17, 4.16

Does this issue impact your ability to continue to work with the product
(please explain in detail what is the user impact)?


Is there any workaround available to the best of your knowledge?


Rate from 1 - 5 the complexity of the scenario you performed that caused this
bug (1 - very simple, 5 - very complex)?


Can this issue reproducible?
Yes

Can this issue reproduce from the UI?


If this is a regression, please provide more details to justify this:


Steps to Reproduce:
1. Install 4.16.z and upgrade to 4.17
2.
3.


Actual results:
odf-op should be able to upgrade all dependents

Expected results:
odf-op should upgrade successfully

Additional info:
With https://bugzilla.redhat.com/show_bug.cgi?id=2299443 odf-op scales down client-op if not running in provider mode however by that time a validating webhook w/ name `subscription.ocs.openshift.io` would be deployed and odf-op needs to delete that before scaling down client-op

Comment 11 errata-xmlrpc 2024-10-30 14:30:37 UTC
Since the problem described in this bug report should be
resolved in a recent advisory, it has been closed with a
resolution of ERRATA.

For information on the advisory (Important: Red Hat OpenShift Data Foundation 4.17.0 Security, Enhancement, & Bug Fix Update), and where to find the updated
files, follow the link below.

If the solution does not work for you, open a new bug report.

https://access.redhat.com/errata/RHSA-2024:8676


Note You need to log in before you can comment on or make changes to this bug.