Bug 2007566
| Summary: | [IBM Z] ceph osd heap profiler fails with "not using tcmalloc" error | ||
|---|---|---|---|
| Product: | [Red Hat Storage] Red Hat OpenShift Data Foundation | Reporter: | Abdul Kandathil (IBM) <akandath> |
| Component: | ceph | Assignee: | Scott Ostapovicz <sostapov> |
| Status: | CLOSED WORKSFORME | QA Contact: | Raz Tamir <ratamir> |
| Severity: | medium | Docs Contact: | |
| Priority: | unspecified | ||
| Version: | 4.9 | CC: | bhubbard, bniver, madam, muagarwa, ocs-bugs, odf-bz-bot, pbalogh |
| Target Milestone: | --- | ||
| Target Release: | --- | ||
| Hardware: | s390x | ||
| OS: | Linux | ||
| Whiteboard: | |||
| Fixed In Version: | Doc Type: | If docs needed, set a value | |
| Doc Text: | Story Points: | --- | |
| Clone Of: | Environment: | ||
| Last Closed: | 2021-10-14 02:33:16 UTC | Type: | Bug |
| Regression: | --- | Mount Type: | --- |
| Documentation: | --- | CRM: | |
| Verified Versions: | Category: | --- | |
| oVirt Team: | --- | RHEL 7.3 requirements from Atomic Host: | |
| Cloudforms Team: | --- | Target Upstream Version: | |
| Embargoed: | |||
|
Description
Abdul Kandathil (IBM)
2021-09-24 09:41:28 UTC
This looks like a CI build problem to me. Petr, can you please take a look if this is a ci issue? Hello Abdul, is this issue consistently reproducible? Is this error coming from `oc rhs` command itself or this is really returned output from: ceph tell osd.2 heap start_profiler From toolbox pod? If it's constantly reproducible, can you please just RSH to toolbox pod and try to run command locally there in the pod? This is the first time I see such error, so not sure what can be problem, but if this is the output coming from the command (ceph tell osd.2 heap start_profiler) itself, it doesn't look like issue in OCS-CI if this is the valid command. If it's returned from oc command, then it can be some OCP issue to run RSH command on pod. Which can be temporary glitch or bug, not sure. I'd suggest you check whether the 'z' build disables tcmalloc. If so this error is totally expected. https://github.com/ceph/ceph/blob/29bda6fd2aabcb37cf1c46a6edddf004d28bb164/src/osd/OSD.cc#L11509-L11513 With the newer version (odf 4.9.0-164.ci), I am not able to reproduce this issue. sh-4.4$ ceph tell osd.0 heap start_profiler osd.0 started profiler sh-4.4$ I think what probably happened here is the original ceph build you tested for 4.9.0-154.ci had tcmalloc disabled (I remember hearing something about this happening on some earlier builds) but that the ceph build for 4.9.0-164.ci now has tcmalloc enabled. Please reopen if this still exists. |