Bug 2037080

Summary: openshift-cluster-csi-drivers pods crashing on PSI
Product: OpenShift Container Platform Reporter: OpenShift BugZilla Robot <openshift-bugzilla-robot>
Component: StorageAssignee: Emilien Macchi <emacchi>
Storage sub component: OpenStack CSI Drivers QA Contact: Itay Matza <imatza>
Status: CLOSED ERRATA Docs Contact:
Severity: high    
Priority: high CC: aos-bugs, emacchi, jsafrane, m.andre, mbooth, ppitonak, pprinett, tsze
Version: 4.9Keywords: TestBlocker, Triaged
Target Milestone: ---   
Target Release: 4.9.z   
Hardware: Unspecified   
OS: Unspecified   
Whiteboard:
Fixed In Version: Doc Type: Bug Fix
Doc Text:
Cause: The csi-driver pod's livenessProbe was too strict Consequence: the probe would fail on slower clouds causing the cluster to be degraded. Fix: Relax the livenessProbe to more realistic values to accommodate slower environments Result: the cluster is no longer degraded on clouds with slow cinder.
Story Points: ---
Clone Of: Environment:
Last Closed: 2022-01-24 16:50:19 UTC Type: ---
Regression: --- Mount Type: ---
Documentation: --- CRM:
Verified Versions: Category: ---
oVirt Team: --- RHEL 7.3 requirements from Atomic Host:
Cloudforms Team: --- Target Upstream Version:
Embargoed:
Bug Depends On: 2027685    
Bug Blocks:    

Comment 1 ShiftStack Bugwatcher 2022-01-05 07:04:09 UTC
Removing the Triaged keyword because:
* the QE automation assessment (flag qe_test_coverage) is missing

Comment 4 Itay Matza 2022-01-19 13:04:02 UTC
Verified with OCP 4.9.0-0.nightly-2022-01-11-155222 on top of OSP RHOS-16.1-RHEL-8-20211126.n.1:

- Confirmed that the values of the parameters were updated successfully.
- Confirmed on several different CI jobs - the current timeout values are working in the PSI cloud.

Comment 7 errata-xmlrpc 2022-01-24 16:50:19 UTC
Since the problem described in this bug report should be
resolved in a recent advisory, it has been closed with a
resolution of ERRATA.

For information on the advisory (OpenShift Container Platform 4.9.17 bug fix update), and where to find the updated
files, follow the link below.

If the solution does not work for you, open a new bug report.

https://access.redhat.com/errata/RHBA-2022:0195