Bug 1801365 - [RFE] (for IBM) expose the Rook ROOK_CSI_KUBELET_DIR_PATH variable in the OCS operator
Summary: [RFE] (for IBM) expose the Rook ROOK_CSI_KUBELET_DIR_PATH variable in the OCS...
Keywords:
Status: CLOSED ERRATA
Alias: None
Product: Red Hat OpenShift Container Storage
Classification: Red Hat Storage
Component: ocs-operator
Version: 4.2
Hardware: Unspecified
OS: Unspecified
high
high
Target Milestone: ---
: OCS 4.5.0
Assignee: umanga
QA Contact: Elad
URL:
Whiteboard:
: 1823417 1857114 (view as bug list)
Depends On:
Blocks: 1859307
TreeView+ depends on / blocked
 
Reported: 2020-02-10 17:56 UTC by svolkov
Modified: 2020-09-23 09:04 UTC (History)
16 users (show)

Fixed In Version:
Doc Type: Enhancement
Doc Text:
.Support for non default Kubelet directory for CSI plugin pods Administrators can now change the default Kubelet directory path in their environments using the `ROOK_CSI_KUBELET_DIR_PATH` environment variable.
Clone Of:
Environment:
Last Closed: 2020-09-15 10:16:04 UTC
Embargoed:


Attachments (Terms of Use)


Links
System ID Private Priority Status Summary Last Updated
Github openshift ocs-operator issues 454 0 None closed [ROKS] csi-cephfsplugin and csi-rbdplugin needs to change kubelet path 2021-01-18 13:41:26 UTC
Github openshift ocs-operator pull 477 0 None closed creates default rook-ceph-operator-config ConfigMap 2021-01-18 13:41:27 UTC
Red Hat Product Errata RHBA-2020:3754 0 None None None 2020-09-15 10:16:45 UTC

Description svolkov 2020-02-10 17:56:36 UTC
Had some issues installing OCS4.2.1 on IBM Cloud.
One of the issues is that IBM Cloud set root-dir for kubelet at /var/data/kubelet.
the csi plugin pods don't really get this change and are trying to use the default /var/lib/kubelet.

The Rook operator have the option to set ROOK_CSI_KUBELET_DIR_PATH, so we just need to expose that in the OCS operator.

This of course might surface in other cloud providers so might as well do it.

Comment 2 Yaniv Kaul 2020-02-11 17:27:52 UTC
Is that with UPI or IPI?
Why is it set differently? How does OCP cope with that?

Comment 3 svolkov 2020-02-11 21:42:18 UTC
UPI of course. I don't think there's IPI for IBM Cloud.
it is set different because that's how it has been used in all IBM environments for a while (not only Cloud).
Not sure how they install (most likely Ansible) so they provide the path they want. I can ask how they install, however, this is something that might show up in other places as well (other clouds or on-prem bare-metal).

Comment 4 Yaniv Kaul 2020-02-14 00:06:05 UTC
Eric - this is what we've briefly discussed at LX. Should we 'auto-detect' this, or can they move to a standard location?

Comment 5 umanga 2020-03-05 10:55:54 UTC
If this is really needed,

1. OLM already supports overriding Env Vars in deployments.
2. Rook operator will be using Config Map to mix and match with Env Vars for any CSI configuration.

Do we still need to expose it in OCS or will the already available ways work?

Comment 6 Jose A. Rivera 2020-04-08 14:38:03 UTC
This will be available in Rook-Ceph 1.3 for OCS 4.5. We still need to discuss if we want to automate something via the ocs-operator, or if we want to leverage a functionality of the OLM to override environment variables as part of hte Subscription process. Either way, ACKing this for 4.5.

Comment 7 Jose A. Rivera 2020-04-15 14:44:19 UTC
*** Bug 1823417 has been marked as a duplicate of this bug. ***

Comment 8 Eran Tamir 2020-05-07 13:55:35 UTC
@umanga any decision here? Is this still on track?

Comment 9 umanga 2020-05-07 13:59:46 UTC
Yes, we have added default config to ocs-operator which admin can change.
Currently, we are blocked on upgrading Rook to v1.3. It should be done in time for 4.5.

Comment 12 Elad 2020-06-02 08:08:06 UTC
Thanks! Acking

Comment 14 svolkov 2020-06-03 14:22:45 UTC
if the fix works then there is no need to test in IBM Cloud, you just need to test it on a k8s cluster that have kubelet in a different than default path.
we cab also provide also some RC of 4.5 (when available) and they can test themselves - if needed.

Comment 16 Elad 2020-06-15 11:36:02 UTC
Removing needinfo as per comments #14 and #15

Comment 21 svolkov 2020-07-15 17:12:28 UTC
The idea for the variable is to help automation. This workaround is known (this is how they install it right now in IBM cloud), but they can't continue to spin clusters manually.

Comment 22 Yaniv Kaul 2020-07-16 13:35:16 UTC
*** Bug 1857114 has been marked as a duplicate of this bug. ***

Comment 23 Mudit Agarwal 2020-07-20 04:15:27 UTC
Removing needonfo which came as part of the duped BZ.

Comment 26 Petr Balogh 2020-07-29 13:45:26 UTC
IIUC on each of our deployment we have currently we have  root-dir for kubelet in /var/lib/kubelet but on IBM ROKS they have actually different one (/var/data/kubelet).

Haven't done this change of kublet dir path on some existing deployed cluster.
We can take a look how to change this, I thought that this is dependent on some specific package installed which can be different on ROKS system.
Do you have some documentation how to change this kublet root-dir on our regular cluster on AWS for example?

Thanks

Comment 32 Petr Balogh 2020-08-10 11:30:08 UTC
I have asked Akash on Friday to update this BZ that this feature is not needed anymore for ROKS cluster running on OCP 4.4 and higher.
Please Akash confirm it here and I think this feature can be removed IMO if what we discussed in mail thread an on slack is not needed anymore for you.

Thanks, Petr

Comment 33 akgunjal@in.ibm.com 2020-08-10 13:34:08 UTC
OCP 4.4 in IBM Cloud is now having kubelet path as "/var/lib/kubelet" as needed by OCS. So we do not need to validate this fix any more as indicated by Petr.

Comment 34 Elad 2020-08-10 16:03:45 UTC
Moving this BZ to VERIFIED based on latest 4.5 regression cycle results and the fact that the new variable is not needed to be changed with OCP 4.4 anymore.


=====================

ocs-operator.v4.5.0-518.ci 
OCP 4.5.0-0.nightly-2020-08-07-024812

Comment 36 errata-xmlrpc 2020-09-15 10:16:04 UTC
Since the problem described in this bug report should be
resolved in a recent advisory, it has been closed with a
resolution of ERRATA.

For information on the advisory (Red Hat OpenShift Container Storage 4.5.0 bug fix and enhancement update), and where to find the updated
files, follow the link below.

If the solution does not work for you, open a new bug report.

https://access.redhat.com/errata/RHBA-2020:3754


Note You need to log in before you can comment on or make changes to this bug.