Bug 2126566

Summary: [GSS][rook] core: increase liveness probe timeout to 5s
Product: [Red Hat Storage] Red Hat OpenShift Data Foundation Reporter: Randy Martinez <r.martinez>
Component: rookAssignee: Travis Nielsen <tnielsen>
Status: CLOSED CURRENTRELEASE QA Contact: Prasad Desala <tdesala>
Severity: urgent Docs Contact:
Priority: unspecified    
Version: 4.10CC: ebenahar, jcrumple, mmuench, msweiker, muagarwa, ocs-bugs, odf-bz-bot, tnielsen
Target Milestone: ---Keywords: TestCannotAutomate
Target Release: ODF 4.12.0   
Hardware: All   
OS: All   
Whiteboard:
Fixed In Version: 4.12.0-80 Doc Type: Bug Fix
Doc Text:
Cause: Ceph daemon liveness probe may be too short Consequence: Stability issues have been observed with a liveness probe timeout of 1s. Fix: Increase the liveness probe timeout to 5s Result: More stable Ceph daemon pods.
Story Points: ---
Clone Of: Environment:
Last Closed: 2023-02-08 14:06:28 UTC Type: Bug
Regression: --- Mount Type: ---
Documentation: --- CRM:
Verified Versions: Category: ---
oVirt Team: --- RHEL 7.3 requirements from Atomic Host:
Cloudforms Team: --- Target Upstream Version:
Embargoed:

Comment 7 Elad 2022-10-27 07:10:50 UTC
Hi Travis,

Anything special to test here for validating the fix?

Comment 8 Travis Nielsen 2022-10-27 15:19:36 UTC
(In reply to Elad from comment #7)
> Hi Travis,
> 
> Anything special to test here for validating the fix?

A simple analysis of the liveness probe to see its timeout is 5s on the pod is sufficient, thanks

Comment 11 Prasad Desala 2022-11-16 08:39:51 UTC
Verified this BZ on ODF version: 4.12.0-113

      livenessProbe:
        exec:
        ...
        failureThreshold: 3
        initialDelaySeconds: 10
        periodSeconds: 10
        successThreshold: 1
        timeoutSeconds: 5

Liveness probe timeout for ceph pods{osd,mgr,mon. mds} is now set to 5sec.