Bug 2209695
Summary: | When collecting Must-gather logs shows /usr/bin/gather_ceph_resources: line 341: jq: command not found | |||
---|---|---|---|---|
Product: | [Red Hat Storage] Red Hat OpenShift Data Foundation | Reporter: | Pratik Surve <prsurve> | |
Component: | must-gather | Assignee: | yati padia <ypadia> | |
Status: | CLOSED ERRATA | QA Contact: | Pratik Surve <prsurve> | |
Severity: | urgent | Docs Contact: | ||
Priority: | unspecified | |||
Version: | 4.13 | CC: | branto, muagarwa, ocs-bugs, odf-bz-bot, ypadia | |
Target Milestone: | --- | |||
Target Release: | ODF 4.13.0 | |||
Hardware: | Unspecified | |||
OS: | Unspecified | |||
Whiteboard: | ||||
Fixed In Version: | 4.13.0-207 | Doc Type: | No Doc Update | |
Doc Text: | Story Points: | --- | ||
Clone Of: | ||||
: | 2210475 (view as bug list) | Environment: | ||
Last Closed: | 2023-06-21 15:25:39 UTC | Type: | Bug | |
Regression: | --- | Mount Type: | --- | |
Documentation: | --- | CRM: | ||
Verified Versions: | Category: | --- | ||
oVirt Team: | --- | RHEL 7.3 requirements from Atomic Host: | ||
Cloudforms Team: | --- | Target Upstream Version: | ||
Embargoed: | ||||
Bug Depends On: | ||||
Bug Blocks: | 2210475 |
Description
Pratik Surve
2023-05-24 14:25:54 UTC
Since OCS 4.8, "jq" was added to the downstream via build, see http://pkgs.devel.redhat.com/cgit/containers/rook-ceph/commit/?h=ocs-4.8-rhel-8 Boris, has something changed in 4.13? I don't see any change here around jq really. I tried comparing rook-ceph 4.13 and 4.9, both have jq binary in them, same version (1.6) and in the exact same location (/usr/bin/jq). I can confirm that there was no jq binary in ocs-must-gather in e.g. ODF 4.9 either so no change there either. My guess would be that the script is not running the jq binary in the rook-ceph pod anymore for some reason? It could be somehow related to the rhceph image using ubi-minimal as a base nowadays maybe? I was looking at the script and it looks like I'm right. The error is coming from this line: subvolgrp_names=$(timeout 120 oc -n "${ns}" exec "${HOSTNAME}"-helper -- bash -c "${ceph_command}"| jq --raw-output '.[].name') and the escaping is wrong there so it is trying to run jq in the must-gather container and it is no available there. The line should look like this instead: subvolgrp_names=$(timeout 120 oc -n "${ns}" exec "${HOSTNAME}"-helper -- bash -c "${ceph_command} | jq --raw-output '.[].name'") Thanks Boris! Yati, please send a patch asap. Added the link to the patch, will update once merged. This exists since 4.12 (commit https://github.com/red-hat-storage/ocs-operator/commit/b58ba9b8a8d6f5220842e44c210a6b42f2a6466a) Yati, please clone this bug to 4.12 also. We need to fix it there as well, I don't know why this was never discovered. Yeah sure, will do that. Since the problem described in this bug report should be resolved in a recent advisory, it has been closed with a resolution of ERRATA. For information on the advisory (Red Hat OpenShift Data Foundation 4.13.0 enhancement and bug fix update), and where to find the updated files, follow the link below. If the solution does not work for you, open a new bug report. https://access.redhat.com/errata/RHBA-2023:3742 |