Adding a more Ceph commands to ease debugging.
backport PR is merged
I've tested this by trying all the newly added ceph commands They all seem to work except for 'ceph pool autoscale-status' and 'ceph osd drain status' sh-4.4# ceph pool autoscale-status no valid command found; 10 closest matches: pg stat pg getmap pg dump {all|summary|sum|delta|pools|osds|pgs|pgs_brief [all|summary|sum|delta|pools|osds|pgs|pgs_brief...]} pg dump_json {all|summary|sum|pools|osds|pgs [all|summary|sum|pools|osds|pgs...]} pg dump_pools_json pg ls-by-pool <poolstr> {<states> [<states>...]} pg ls-by-primary <osdname (id|osd.id)> {<int>} {<states> [<states>...]} pg ls-by-osd <osdname (id|osd.id)> {<int>} {<states> [<states>...]} pg ls {<int>} {<states> [<states>...]} pg dump_stuck {inactive|unclean|stale|undersized|degraded [inactive|unclean|stale|undersized|degraded...]} {<int>} Error EINVAL: invalid command sh-4.4# ceph osd drain status no valid command found; 10 closest matches: osd perf osd df {plain|tree} {class|name} {<filter>}ceph_commands+=("ceph osd crush weight-set dump") ceph_commands+=("ceph osd crush weight-set dump") osd blocked-by osd pool stats {<poolname>} osd pool scrub <poolname> [<poolname>...] osd pool deep-scrub <poolname> [<poolname>...] osd pool repair <poolname> [<poolname>...] osd pool force-recovery <poolname> [<poolname>...] osd pool force-backfill <poolname> [<poolname>...] osd pool cancel-force-recovery <poolname> [<poolname>...] Error EINVAL: invalid command sh-4.4# Also, in must-gather/collection-scripts/gather_ceph_resources, lines 63 and 64 appear to be the same: ceph_commands+=("ceph osd crush weight-set dump") ceph_commands+=("ceph osd crush weight-set dump") Is one of these lines in error?
Warren, is must-gather failing or is it ignoring the errors? If the command is not available yet (let's say in a newer Ceph version) then if it's fine, but must-gather should ignore the errors and proceed with other commands. Also the double "ceph_commands+=("ceph osd crush weight-set dump")" is unintended but harmless.
oc adm must-gather is working and returning 0. As far as I can tell, it looks like all the information is in the must-gather sub-directory. The ceph pool autoscale-status problem still exists so I assume that this means that the ceph command is not affecting must-gather.
Since the problem described in this bug report should be resolved in a recent advisory, it has been closed with a resolution of ERRATA. For information on the advisory (Red Hat OpenShift Container Storage 4.5.0 bug fix and enhancement update), and where to find the updated files, follow the link below. If the solution does not work for you, open a new bug report. https://access.redhat.com/errata/RHBA-2020:3754