Bug 2318289
| Summary: | [7.1z backport] osd/TrackedOp: Fix TrackedOp event order | ||
|---|---|---|---|
| Product: | [Red Hat Storage] Red Hat Ceph Storage | Reporter: | Prashant Dhange <pdhange> |
| Component: | RADOS | Assignee: | Prashant Dhange <pdhange> |
| Status: | CLOSED ERRATA | QA Contact: | skanta |
| Severity: | high | Docs Contact: | |
| Priority: | high | ||
| Version: | 5.3 | CC: | bhubbard, ceph-eng-bugs, cephqe-warriors, dwalveka, ngangadh, nojha, pdhiran, skanta, tserlin, vumrao |
| Target Milestone: | --- | Flags: | pdhange:
needinfo-
pdhange: needinfo- pdhange: needinfo- pdhange: needinfo- dwalveka: needinfo- |
| Target Release: | 7.1z3 | ||
| Hardware: | x86_64 | ||
| OS: | Linux | ||
| Whiteboard: | |||
| Fixed In Version: | ceph-18.2.1-289.el8cp | Doc Type: | Bug Fix |
| Doc Text: |
Header read time are now reported accurately
Previously, the header read time was calculated based on the throttle timestamp (`throttle_stamp`) instead of the receive timestamp (`recv_stamp`). While the throttled event preceded the header read event in the event queue, the throttle timestamp was older than the receive timestamp. This indicates a logical error in how the header read time is determined when throttling occurs. The header read time must reflect when the header was *received*, not when the associated operation was *throttled*. As a result, incorrect header read times were reported. This led to inaccurate performance metrics, making it difficult to analyze and optimize system performance. It also triggered false alarms in monitoring systems that relied on accurate header read time data. Furthermore, it masked actual performance issues if the reported header read times were artificially inflated by throttling delays.
With this fix, the code is modified to use the receive timestamp (`recv_stamp`) as the basis for calculating the header read time, regardless of a throttling event. The header read time now represents the time the header was received and not the time the corresponding operation was delayed. Hence, accurate header read times are reported, reflecting the actual time the header was received. This will provide more reliable performance metrics, improve the accuracy of monitoring systems, and enable better analysis and optimization of system performance.
|
Story Points: | --- |
| Clone Of: | 2318288 | Environment: | |
| Last Closed: | 2025-02-24 15:41:43 UTC | Type: | --- |
| Regression: | --- | Mount Type: | --- |
| Documentation: | --- | CRM: | |
| Verified Versions: | Category: | --- | |
| oVirt Team: | --- | RHEL 7.3 requirements from Atomic Host: | |
| Cloudforms Team: | --- | Target Upstream Version: | |
| Embargoed: | |||
| Bug Depends On: | 2318288 | ||
| Bug Blocks: | |||
|
Description
Prashant Dhange
2024-10-12 08:21:34 UTC
Since the problem described in this bug report should be resolved in a recent advisory, it has been closed with a resolution of ERRATA. For information on the advisory (Red Hat Ceph Storage 7.1 security, bug fix, enhancement, and known issue updates), and where to find the updated files, follow the link below. If the solution does not work for you, open a new bug report. https://access.redhat.com/errata/RHBA-2025:1770 |