Bug 2172089 - [Tracker for Ceph BZ 2174461] rook-ceph-nfs pod is stuck at status 'CreateContainerError' after enabling NFS in ODF 4.13
Summary: [Tracker for Ceph BZ 2174461] rook-ceph-nfs pod is stuck at status 'CreateCon...
Keywords:
Status: CLOSED ERRATA
Alias: None
Product: Red Hat OpenShift Data Foundation
Classification: Red Hat Storage
Component: ceph
Version: 4.13
Hardware: Unspecified
OS: Unspecified
high
high
Target Milestone: ---
: ODF 4.13.0
Assignee: Tamil
QA Contact: Amrita Mahapatra
URL:
Whiteboard:
Depends On: 2174461
Blocks:
TreeView+ depends on / blocked
 
Reported: 2023-02-21 13:19 UTC by Amrita Mahapatra
Modified: 2023-08-09 16:37 UTC (History)
16 users (show)

Fixed In Version:
Doc Type: No Doc Update
Doc Text:
Clone Of:
: 2174461 (view as bug list)
Environment:
Last Closed: 2023-06-21 15:24:01 UTC
Embargoed:


Attachments (Terms of Use)


Links
System ID Private Priority Status Summary Last Updated
Red Hat Product Errata RHBA-2023:3742 0 None None None 2023-06-21 15:24:27 UTC

Description Amrita Mahapatra 2023-02-21 13:19:07 UTC
Description of problem (please be detailed as possible and provide log
snippests): rook-ceph-nfs pod is stuck at status 'CreateContainerError' after enabling nfs feature,

$ oc get cephnfs 
NAME                         AGE
ocs-storagecluster-cephnfs   21m

$ oc get pods|grep rook-ceph-nfs
rook-ceph-nfs-ocs-storagecluster-cephnfs-a-dd598659f-bb8xn        1/2     CreateContainerError   0          19m

The error msg is,

Error: container create failed: time="2023-02-23T08:54:29Z" level=error msg="runc create failed: unable to start container process: exec: \"dbus-daemon\": executable file not found in $PATH"

Version of all relevant components (if applicable):
OCP version: 4.13.0-0.nightly-2023-02-23-000625
OCS version: 4.13.0-86

Does this issue impact your ability to continue to work with the product
(please explain in detail what is the user impact)? yes


Is there any workaround available to the best of your knowledge? No


Rate from 1 - 5 the complexity of the scenario you performed that caused this
bug (1 - very simple, 5 - very complex)? 1


Can this issue reproducible? yes


Can this issue reproduce from the UI? yes


If this is a regression, please provide more details to justify this: NA


Steps to Reproduce:
1. Enable nfs from UI by slecting the checkbox from storagesystem creation page or from CLI by patch request,
$ oc patch -n openshift-storage storageclusters.ocs.openshift.io ocs-storagecluster --patch '{"spec": {"nfs":{"enable": true}}}' --type merge

2. Check cephnfs got created
$ oc get cephnfs

3. Check the status for rook-ceph-nfs pod
$ oc get pods|grep rook-ceph-nfs


Actual results:
rook-ceph-nfs pod is stuck at status 'CreateContainerError' after enabling nfs feature.

Expected results:
rook-ceph-nfs pod should be running after enabling nfs feature.

Additional info:
Have attached rook-ceph-nfs pod log.

Comment 10 Blaine Gardner 2023-02-28 21:32:47 UTC
I see a message from the dashboard screenshot "dbus-daemon not found in $PATH". I think the package that provides dbus-daemon must have been removed from our image. registry.access.redhat.com/ubi8/ubi:8.7 has the `dbus-daemon` app. I think the dbus package should be set as a requirement for the NFS Ganesha RPM as well. Moving this to the build component for them to check that out.

Comment 11 Deepshikha khandelwal 2023-03-01 05:34:50 UTC
The rhceph image(RHEL9 based) `6-105` is used to build the 4.13 rook image. I can see the `dbus` package is listed in the package list for this image.

https://quay.io/repository/rhceph-dev/rhceph/manifest/sha256:dfeceae86b14ab0c349ead487cbae17deebbed579021d5fbdf7b6df9d43e9623?tab=packages

Comment 12 Boris Ranto 2023-03-01 07:34:43 UTC
The dbus packae is available in the package but there is no dbus-daemon package installed.

@Blaine Do we need the dbus-daemon in rook-ceph image only or both rook-ceph as well as rhceph container?

We need to re-assign the bz based on that.

Comment 13 Blaine Gardner 2023-03-01 17:17:42 UTC
@branto the rhceph container will need dbus-daemon. Thanks :)

I will go ahead and create a rhceph BZ to track that.

Comment 14 Blaine Gardner 2023-03-01 17:23:54 UTC
@branto, I created https://bugzilla.redhat.com/show_bug.cgi?id=2172089 to track getting the package into rhceph. I believe that should be rhcs 6.0. If some part of that doesn't sound right, I'd appreciate if you'd update it.

Comment 15 Boris Ranto 2023-03-01 17:32:08 UTC
Looks good to me, thanks for creating the bz!

Comment 17 Mudit Agarwal 2023-03-16 15:54:41 UTC
Fix should be available with RHCS 6.1, its already merged upstream.

Comment 25 errata-xmlrpc 2023-06-21 15:24:01 UTC
Since the problem described in this bug report should be
resolved in a recent advisory, it has been closed with a
resolution of ERRATA.

For information on the advisory (Red Hat OpenShift Data Foundation 4.13.0 enhancement and bug fix update), and where to find the updated
files, follow the link below.

If the solution does not work for you, open a new bug report.

https://access.redhat.com/errata/RHBA-2023:3742


Note You need to log in before you can comment on or make changes to this bug.