Note: This bug is displayed in read-only format because the product is no longer active in Red Hat Bugzilla.

RHEL Engineering is moving the tracking of its product development work on RHEL 6 through RHEL 9 to Red Hat Jira (issues.redhat.com). If you're a Red Hat customer, please continue to file support cases via the Red Hat customer portal. If you're not, please head to the "RHEL project" in Red Hat Jira and file new tickets here. Individual Bugzilla bugs in the statuses "NEW", "ASSIGNED", and "POST" are being migrated throughout September 2023. Bugs of Red Hat partners with an assigned Engineering Partner Manager (EPM) are migrated in late September as per pre-agreed dates. Bugs against components "kernel", "kernel-rt", and "kpatch" are only migrated if still in "NEW" or "ASSIGNED". If you cannot log in to RH Jira, please consult article #7032570. That failing, please send an e-mail to the RH Jira admins at rh-issues@redhat.com to troubleshoot your issue as a user management inquiry. The email creates a ServiceNow ticket with Red Hat. Individual Bugzilla bugs that are migrated will be moved to status "CLOSED", resolution "MIGRATED", and set with "MigratedToJIRA" in "Keywords". The link to the successor Jira issue will be found under "Links", have a little "two-footprint" icon next to it, and direct you to the "RHEL project" in Red Hat Jira (issue links are of type "https://issues.redhat.com/browse/RHEL-XXXX", where "X" is a digit). This same link will be available in a blue banner at the top of the page informing you that that bug has been migrated.

Bug 1707069

Summary:	A `ping` resource does not use the default timeout value when an operational (monitor, start) is not declared or set
Product:	Red Hat Enterprise Linux 8	Reporter:	Shane Bradley <sbradley>
Component:	pacemaker	Assignee:	Ken Gaillot <kgaillot>
Status:	CLOSED WONTFIX	QA Contact:	cluster-qe <cluster-qe>
Severity:	low	Docs Contact:
Priority:	low
Version:	8.0	CC:	cluster-maint, sbradley, tojeline
Target Milestone:	pre-dev-freeze	Keywords:	FutureFeature, Triaged
Target Release:	---	Flags:	pm-rhel: mirror+
Hardware:	Unspecified
OS:	Unspecified
Whiteboard:
Fixed In Version:		Doc Type:	If docs needed, set a value
Doc Text:		Story Points:	---
Clone Of:		Environment:
Last Closed:	2023-10-09 16:56:35 UTC	Type:	Story
Regression:	---	Mount Type:	---
Documentation:	---	CRM:
Verified Versions:		Category:	---
oVirt Team:	---	RHEL 7.3 requirements from Atomic Host:
Cloudforms Team:	---	Target Upstream Version:
Embargoed:

Comment 2 Ken Gaillot 2019-05-08 18:15:55 UTC

It's definitely confusing, but pacemaker ignores the timeouts listed in a resource agent's meta-data.

The timeouts (and intervals) in the meta-data are "hints" to the user (and UIs such as pcs) as to what's a reasonable value to use. It's expected that actually desirable values will vary by deployment and so should be tested and adjusted by the user.

By contrast, pacemaker uses (in order of preference): any timeout explicitly set in the operation configuration in the CIB; any timeout set in op_defaults in the CIB (i.e. pcs resource op defaults); or 20 seconds.

Partly this is imposed by pacemaker's scheduling model -- the scheduler only has access to the CIB, which currently doesn't include agent meta-data. We could consider having pacemaker save agent meta-data hints in the CIB, but that could hurt scalability in clusters with many different resource types. There's also a problem with different versions of the same agent installed on different nodes -- we'd potentially need to store the hints per node, which would be even worse for scalability.

This would be considered a new feature, so it would be RHEL 8 only.

(BTW, the ocf:pacemaker: agents are part of the pacemaker component, not resource-agents.)

Comment 4 Ken Gaillot 2021-07-26 15:48:01 UTC

I have thought of an implementation that could scale: instead of feeding agent meta-data to the scheduler, the controller could keep the default timeout values in its meta-data cache, and the scheduler could add a flag to scheduled actions when the timeout is the "default default" (i.e. not explicitly specified in either the action configuration or op_defaults). The controller would then override the scheduler's timeout value with the one from cache when available.

Unfortunately there is still a large backlog, so I would not expect a fix in the next couple of point releases.

Comment 6 Ken Gaillot 2023-10-09 16:56:35 UTC

Manually migrated to Jira as RHEL-12304