| Summary: | It takes about 1 second to get stonith agent metadata using crm_resource | |||
|---|---|---|---|---|
| Product: | Red Hat Enterprise Linux 7 | Reporter: | Tomas Jelinek <tojeline> | |
| Component: | pacemaker | Assignee: | Jan Pokorný [poki] <jpokorny> | |
| Status: | CLOSED WONTFIX | QA Contact: | cluster-qe <cluster-qe> | |
| Severity: | low | Docs Contact: | ||
| Priority: | high | |||
| Version: | 7.3 | CC: | cluster-maint, jpokorny, kgaillot, mnovacek, phagara | |
| Target Milestone: | rc | |||
| Target Release: | 7.9 | |||
| Hardware: | Unspecified | |||
| OS: | Unspecified | |||
| Whiteboard: | ||||
| Fixed In Version: | Doc Type: | No Doc Update | ||
| Doc Text: |
undefined
|
Story Points: | --- | |
| Clone Of: | ||||
| : | 1552654 (view as bug list) | Environment: | ||
| Last Closed: | 2020-02-21 16:56:40 UTC | Type: | Bug | |
| Regression: | --- | Mount Type: | --- | |
| Documentation: | --- | CRM: | ||
| Verified Versions: | Category: | --- | ||
| oVirt Team: | --- | RHEL 7.3 requirements from Atomic Host: | ||
| Cloudforms Team: | --- | Target Upstream Version: | ||
| Bug Depends On: | ||||
| Bug Blocks: | 1552654 | |||
|
Description
Tomas Jelinek
2016-10-12 14:56:07 UTC
This does seem odd. Reassigning to RHEL7, as RHEL6 is only getting high-priority bugfixes now, and the behavior is present on RHEL7. If I am not mistaken, part of the issue may be that there are two
roundtrips hidden in the stonith query as opposed to the resource
one:
lrmd API client lrmd API client
(crm_resource) (crm_resource)
| ^ | ^
| | | |
v | v |
lrmd API handler lrmd API handler
(lrmd) (lrmd)
| ^
| |
v |
stonith-ng API
handler
(stonithd)
Sorry, there's in fact no message routing round trip at all in the context of pcs' use of crm_resource. Results in RHEL 7.3 VM: - "/usr/sbin/fence_apc -o metadata" takes around 0.084s - "crm_resource --show-metadata stonith:fence_apc" around 1.022s Using strace with timeouts, I can notice that there is a significant pause (750-800 ms) after forked process to exec fence_apc has exited and before WNOHANG wait resumes. Looks like commit 12cf7b901733a96e4a7844e9f596430c5e8c2a3c introduced unnecessary block-for-a-second penalty. Proposed and tested fix (boost by factor of 10): https://github.com/ClusterLabs/pacemaker/pull/1214 Unfortunately, it currently fails at an lrmd test (more investigation pending). This will be not be ready for 7.4, bumping to 7.5 This will not make it in time for 7.5 Due to developer time constraints, this is unlikely to be done in the 7.9 time frame and so will be fixed for RHEL 8 only (Bug 1552654) |