Bug 2105151

Summary: No node is allowed to use fence_dummy to unfence another node
Product: Red Hat Enterprise Linux 7 Reporter: Reid Wahl <nwahl>
Component: pacemakerAssignee: Ken Gaillot <kgaillot>
Status: CLOSED WONTFIX QA Contact: cluster-qe <cluster-qe>
Severity: low Docs Contact:
Priority: medium    
Version: 7.9CC: cluster-maint
Target Milestone: rc   
Target Release: ---   
Hardware: All   
OS: Linux   
Whiteboard:
Fixed In Version: Doc Type: If docs needed, set a value
Doc Text:
Story Points: ---
Clone Of: Environment:
Last Closed: 2022-07-08 09:25:46 UTC Type: Bug
Regression: --- Mount Type: ---
Documentation: --- CRM:
Verified Versions: Category: ---
oVirt Team: --- RHEL 7.3 requirements from Atomic Host:
Cloudforms Team: --- Target Upstream Version:
Embargoed:

Description Reid Wahl 2022-07-08 06:29:41 UTC
Description of problem:

I've tried with and without mock_dynamic_hosts. Originally I only had "fastvm-rhel-7-6-51" in mock_dynamic_hosts and pcmk_host_list. I've tried with and without pcmk_host_check, and I've tried setting it to static-list and dynamic-list. Nothing works. I'm wondering if I have a typo somewhere...

 Resource: slowpass_fence (class=stonith type=fence_dummy)
  Attributes: delay=120 mock_dynamic_hosts="node1 node2 fastvm-rhel-7-6-51" mode=pass monitor_mode=pass pcmk_host_list="node1 node2 fastvm-rhel-7-6-51" pcmk_off_retries=1 pcmk_off_timeout=240s pcmk_on_retries=1 pcmk_on_timeout=240s pcmk_reboot_retries=1 pcmk_reboot_timeout=240s verbose=1 pcmk_host_check="static-list"
  Meta Attrs: provides=unfencing


Jul 07 23:18:04 [3798] fastvm-rhel-7-6-21 stonith-ng: (  commands.c:2690  )  notice: handle_request:    Client crmd.3802.16502056 wants to fence (on) 'fastvm-rhel-7-6-51' with device '(any)'
Jul 07 23:18:04 [3798] fastvm-rhel-7-6-21 stonith-ng: (  commands.c:2478  )   trace: check_alternate_host:      Checking if we (node1) can fence fastvm-rhel-7-6-51
Jul 07 23:18:04 [3798] fastvm-rhel-7-6-21 stonith-ng: (    remote.c:1128  )  notice: initiate_remote_stonith_op:        Requesting peer fencing (on) targeting fastvm-rhel-7-6-51 | id=ea747800-b680-456e-a8af-2c78426ec49d state=0
Jul 07 23:18:04 [3798] fastvm-rhel-7-6-21 stonith-ng: (  commands.c:2898  )   debug: stonith_command:   Processed st_fence from crmd.3802: Operation now in progress (-115)
Jul 07 23:18:04 [3798] fastvm-rhel-7-6-21 stonith-ng: (  commands.c:2884  )   debug: stonith_command:   Processing st_query 0 from node1 (              40)
Jul 07 23:18:04 [3798] fastvm-rhel-7-6-21 stonith-ng: (    remote.c:993   )   debug: create_remote_stonith_op:  ea747800-b680-456e-a8af-2c78426ec49d already exists
Jul 07 23:18:04 [3798] fastvm-rhel-7-6-21 stonith-ng: (  commands.c:2029  )   debug: stonith_query:     Query   <stonith_command __name__="stonith_command" t="stonith-ng" st_async_id="ea747800-b680-456e-a8af-2c78426ec49d" st_op="st_query" st_callid="21" st_callopt="64" st_remote_op="ea747800-b680-456e-a8af-2c78426ec49d" st_target="fastvm-rhel-7-6-51" st_device_action="on" st_origin="node1" st_clientid="16502056-9756-4ff6-a85f-89d38832f334" st_clientname="crmd.3802" st_timeout="60" src="node1"/>
Jul 07 23:18:04 [3798] fastvm-rhel-7-6-21 stonith-ng: (  commands.c:2398  )   trace: stonith_construct_reply:   Creating a basic reply
Jul 07 23:18:04 [3798] fastvm-rhel-7-6-21 stonith-ng: (  commands.c:1806  )   debug: get_capable_devices:       Searching through 3 devices to see what is capable of action (on) for target fastvm-rhel-7-6-51
Jul 07 23:18:04 [3798] fastvm-rhel-7-6-21 stonith-ng: (  commands.c:1625  )   trace: localhost_is_eligible:     'on' operation with slowpass_fence can only be executed for localhost not fastvm-rhel-7-6-51
Jul 07 23:18:04 [3798] fastvm-rhel-7-6-21 stonith-ng: (  commands.c:1053  )   trace: string_in_list:    0: 'fastvm-rhel-7-6-51' != 'fastvm-rhel-7.6-21'
Jul 07 23:18:04 [3798] fastvm-rhel-7-6-21 stonith-ng: (  commands.c:1053  )   trace: string_in_list:    1: 'fastvm-rhel-7-6-51' != 'fastvm-rhel-7.6-22'
Jul 07 23:18:04 [3798] fastvm-rhel-7-6-21 stonith-ng: (  commands.c:1053  )   trace: string_in_list:    2: 'fastvm-rhel-7-6-51' != 'fastvm-rhel-7.6-51'
Jul 07 23:18:04 [3798] fastvm-rhel-7-6-21 stonith-ng: (  commands.c:1726  )  notice: can_fence_host_with_device:        xvm can fence (on) fastvm-rhel-7-6-51 (aka. 'fastvm-rhel-7.6-51'): static-list
Jul 07 23:18:04 [3798] fastvm-rhel-7-6-21 stonith-ng: (  commands.c:1625  )   trace: localhost_is_eligible:     'on' operation with slowfail_fence can only be executed for localhost not fastvm-rhel-7-6-51
Jul 07 23:18:04 [3798] fastvm-rhel-7-6-21 stonith-ng: (  commands.c:1595  )   debug: search_devices_record_result:      Finished Search. 1 devices can perform action (on) on node fastvm-rhel-7-6-51
Jul 07 23:18:04 [3798] fastvm-rhel-7-6-21 stonith-ng: (  commands.c:1987  )   debug: stonith_query_capable_device_cb:   Found 1 matching devices for 'fastvm-rhel-7-6-51'
Jul 07 23:18:04 [3798] fastvm-rhel-7-6-21 stonith-ng: (  commands.c:1993  )   trace: stonith_query_capable_device_cb:   Add query results   <stonith_query_capable_device_cb st_target="fastvm-rhel-7-6-51" st-available-devices="1">
Jul 07 23:18:04 [3798] fastvm-rhel-7-6-21 stonith-ng: (  commands.c:1993  )   trace: stonith_query_capable_device_cb:   Add query results     <st_device_id id="xvm" agent="fence_xvm" st_monitor_verified="1"/>
Jul 07 23:18:04 [3798] fastvm-rhel-7-6-21 stonith-ng: (  commands.c:1993  )   trace: stonith_query_capable_device_cb:   Add query results   </stonith_query_capable_device_cb>


Version-Release number of selected component (if applicable):

pacemaker-cts-1.1.23-1.el7_9.1.x86_64


How reproducible:

Always

Comment 3 Reid Wahl 2022-07-08 06:38:05 UTC
Figured it out. The problem isn't in the config, it's in the metadata. "on_target" = 1 for the "on" action.

Comment 4 Reid Wahl 2022-07-08 09:25:46 UTC
As kwenninger pointed out, this is fixed upstream by commit 072bbc5 from 2018. This feature is likely to be used only by developers, and it's no big deal to modify the script ad-hoc if needed. Closing as WONTFIX.