Bug 2315651 - [Provider Mode] Some pods under openshift-storage namespace does not contain 'node.ocs.openshift.io/storage' toleration
Summary: [Provider Mode] Some pods under openshift-storage namespace does not contain ...
Keywords:
Status: CLOSED ERRATA
Alias: None
Product: Red Hat OpenShift Data Foundation
Classification: Red Hat Storage
Component: ceph-csi-operator
Version: 4.17
Hardware: Unspecified
OS: Unspecified
unspecified
high
Target Milestone: ---
: ODF 4.17.0
Assignee: Leela Venkaiah Gangavarapu
QA Contact: Jilju Joy
URL:
Whiteboard: isf-provider
Depends On:
Blocks:
TreeView+ depends on / blocked
 
Reported: 2024-09-30 10:40 UTC by Jilju Joy
Modified: 2024-10-30 14:36 UTC (History)
6 users (show)

Fixed In Version: 4.17.0-117
Doc Type: No Doc Update
Doc Text:
Clone Of:
Environment:
Last Closed: 2024-10-30 14:36:09 UTC
Embargoed:


Attachments (Terms of Use)


Links
System ID Private Priority Status Summary Last Updated
Github red-hat-storage ceph-csi-operator pull 38 0 None open Bug 2315651:[release-4.17] add ocs default toleration to csi-op podspec 2024-10-04 10:00:29 UTC
Github red-hat-storage ceph-csi-operator pull 40 0 None open Bug 2315651:[release-4.17] add missing tolerations field in nodeplugin podspec 2024-10-04 10:04:59 UTC
Github red-hat-storage kubernetes-csi-addons pull 212 0 None open Bug 2315651:[release-4.17] [DOWNSTREAM-ONLY] add ocs default toleration to controller podspec 2024-10-04 10:35:20 UTC
Github red-hat-storage ocs-client-operator pull 252 0 None open Bug 2315651:[release-4.17] controllers: add ocs default toleration to csi driver spec 2024-10-07 06:16:00 UTC
Red Hat Issue Tracker OCSBZM-9298 0 None None None 2024-10-03 15:04:06 UTC
Red Hat Product Errata RHSA-2024:8676 0 None None None 2024-10-30 14:36:12 UTC

Description Jilju Joy 2024-09-30 10:40:12 UTC
Description of problem:

The 'ctrlplugin' and 'nodeplugin' pods does not have the toleration "node.ocs.openshift.io/storage" in provider cluster. In the client cluster, in addition to "ctrlplugin" and "nodeplugin" pods, the ceph-csi-controller-manager and csi-addons-controller-manager pods does not have toleration "node.ocs.openshift.io/storage"

>>> check_toleration_on_pods()
The pod openshift-storage.cephfs.csi.ceph.com-ctrlplugin-76db9dcf5lcspz does not have toleration node.ocs.openshift.io/storage
The pod openshift-storage.cephfs.csi.ceph.com-ctrlplugin-76db9dcf5wdzxm does not have toleration node.ocs.openshift.io/storage
The pod openshift-storage.cephfs.csi.ceph.com-nodeplugin-h7qt4 does not have toleration node.ocs.openshift.io/storage
The pod openshift-storage.cephfs.csi.ceph.com-nodeplugin-ldp8t does not have toleration node.ocs.openshift.io/storage
The pod openshift-storage.cephfs.csi.ceph.com-nodeplugin-qk2nd does not have toleration node.ocs.openshift.io/storage
The pod openshift-storage.rbd.csi.ceph.com-ctrlplugin-7f75d86c95-497df does not have toleration node.ocs.openshift.io/storage
The pod openshift-storage.rbd.csi.ceph.com-ctrlplugin-7f75d86c95-5988p does not have toleration node.ocs.openshift.io/storage
The pod openshift-storage.rbd.csi.ceph.com-nodeplugin-6662j does not have toleration node.ocs.openshift.io/storage
The pod openshift-storage.rbd.csi.ceph.com-nodeplugin-jsvmd does not have toleration node.ocs.openshift.io/storage
The pod openshift-storage.rbd.csi.ceph.com-nodeplugin-l7ztl does not have toleration node.ocs.openshift.io/storage


The outputs given above are from provider cluster 4.17.0-101.


Given below is from client cluster 4.17.0-103. In the client cluster, in addition to "ctrlplugin" and "nodeplugin" pods, the ceph-csi-controller-manager and csi-addons-controller-manager pods does not have toleration "node.ocs.openshift.io/storage". If the usage of the taint is not restricted in client cluster, the toleration will be required on these pods as well.

>>> check_toleration_on_pods()
The pod ceph-csi-controller-manager-74b54fc579-2qcs7 does not have toleration node.ocs.openshift.io/storage
The pod csi-addons-controller-manager-7d7b754d7-2jwcs does not have toleration node.ocs.openshift.io/storage
The pod openshift-storage.cephfs.csi.ceph.com-ctrlplugin-bff885c654hfsp does not have toleration node.ocs.openshift.io/storage
The pod openshift-storage.cephfs.csi.ceph.com-ctrlplugin-bff885c65df7qc does not have toleration node.ocs.openshift.io/storage
The pod openshift-storage.cephfs.csi.ceph.com-nodeplugin-n26j9 does not have toleration node.ocs.openshift.io/storage
The pod openshift-storage.cephfs.csi.ceph.com-nodeplugin-rqmkw does not have toleration node.ocs.openshift.io/storage
The pod openshift-storage.rbd.csi.ceph.com-ctrlplugin-846986475f-kh9cf does not have toleration node.ocs.openshift.io/storage
The pod openshift-storage.rbd.csi.ceph.com-ctrlplugin-846986475f-q64pd does not have toleration node.ocs.openshift.io/storage
The pod openshift-storage.rbd.csi.ceph.com-nodeplugin-mwnvv does not have toleration node.ocs.openshift.io/storage
The pod openshift-storage.rbd.csi.ceph.com-nodeplugin-z52bp does not have toleration node.ocs.openshift.io/storage

Version-Release number of selected component (if applicable):
ODF 4.17.0-103 and 4.17.0-101
OCP 4.16 and 4.17

How reproducible:
Always

Steps to Reproduce:
1. In provider-cluster, verify whether the toleration "node.ocs.openshift.io/storage" is present on the pods.
Example command:
% oc get pod <pod name> -o=jsonpath='{.spec.tolerations}' | jq     

Actual results:
Toleration "node.ocs.openshift.io/storage" is not present on some pods.

Expected results:
Pods should have the toleration "node.ocs.openshift.io/storage"

Additional info:

Comment 9 Sunil Kumar Acharya 2024-10-08 13:17:11 UTC
Please update the RDT flag/text appropriately.

Comment 11 errata-xmlrpc 2024-10-30 14:36:09 UTC
Since the problem described in this bug report should be
resolved in a recent advisory, it has been closed with a
resolution of ERRATA.

For information on the advisory (Important: Red Hat OpenShift Data Foundation 4.17.0 Security, Enhancement, & Bug Fix Update), and where to find the updated
files, follow the link below.

If the solution does not work for you, open a new bug report.

https://access.redhat.com/errata/RHSA-2024:8676


Note You need to log in before you can comment on or make changes to this bug.