Bug 2158773

Summary: ClusterObjectStoreState reports critical alert
Product: [Red Hat Storage] Red Hat OpenShift Data Foundation Reporter: Alexander Chuzhoy <sasha>
Component: ceph-monitoringAssignee: Divyansh Kamboj <dkamboj>
Status: CLOSED ERRATA QA Contact: Vishakha Kathole <vkathole>
Severity: unspecified Docs Contact:
Priority: high    
Version: 4.11CC: dkamboj, ebenahar, muagarwa, nthomas, odf-bz-bot, uchapaga, vkathole
Target Milestone: ---   
Target Release: ODF 4.14.0   
Hardware: Unspecified   
OS: Unspecified   
Whiteboard:
Fixed In Version: Doc Type: No Doc Update
Doc Text:
Story Points: ---
Clone Of: Environment:
Last Closed: 2023-11-08 18:49:53 UTC Type: Bug
Regression: --- Mount Type: ---
Documentation: --- CRM:
Verified Versions: Category: ---
oVirt Team: --- RHEL 7.3 requirements from Atomic Host:
Cloudforms Team: --- Target Upstream Version:
Embargoed:

Description Alexander Chuzhoy 2023-01-06 14:17:09 UTC
Versions:
openshift-local-storage local-storage-operator.4.11.0-202212070335
openshift-storage mcg-operator.v4.11.4
openshift-storage ocs-operator.v4.11.4
openshift-storage odf-csi-addons-operator.v4.11.4
openshift-storage odf-operator.v4.11.4
OCP: 4.12.0-rc.5

Alertname Starts At Summary State
ClusterObjectStoreState 2022-12-26 01:30:40 UTC active

Severity: Critical

Description: Cluster Object Store is in unhealthy state for more than 15s. Please check Ceph cluster health.

Message: Cluster Object Store is in unhealthy state. Please check Ceph cluster health.

But the ceph state is healthy:

ceph status
  cluster:
    id:     0947c6ed-6881-4c2f-8c8c-3ec84b3446a4
    health: HEALTH_OK
 
  services:
    mon: 3 daemons, quorum a,b,c (age 2w)
    mgr: a(active, since 2w)
    mds: 1/1 daemons up, 1 hot standby
    osd: 12 osds: 12 up (since 13d), 12 in (since 13d)
    rgw: 1 daemon active (1 hosts, 1 zones)
 
  data:
    volumes: 1/1 healthy
    pools:   11 pools, 641 pgs
    objects: 7.96k objects, 29 GiB
    usage:   94 GiB used, 17 TiB / 17 TiB avail
    pgs:     641 active+clean
 
  io:
    client:   1.4 KiB/s rd, 220 KiB/s wr, 1 op/s rd, 0 op/s wr

Comment 5 Divyansh Kamboj 2023-04-11 14:06:44 UTC
The alert represents the Phase of CephObjectStore CRD. The alert message is misleading to users while debugging, we need to change the alert message to reflect that to avoid users going in the wrong direction.
Can't provide devel_acks as I don't have permissions.

Comment 14 Mudit Agarwal 2023-06-05 11:32:38 UTC
Not a 4.13 blocker, moving this out

Comment 19 Divyansh Kamboj 2023-07-12 11:30:47 UTC
@vkathole can you take a look at it again? looks like the changes to the description didn't pop up on your side. Just tested it out with a 4.13 build.

Comment 25 errata-xmlrpc 2023-11-08 18:49:53 UTC
Since the problem described in this bug report should be
resolved in a recent advisory, it has been closed with a
resolution of ERRATA.

For information on the advisory (Important: Red Hat OpenShift Data Foundation 4.14.0 security, enhancement & bug fix update), and where to find the updated
files, follow the link below.

If the solution does not work for you, open a new bug report.

https://access.redhat.com/errata/RHSA-2023:6832