Bug 2032997
Summary: | When a fence-scsi storage device becomes unavailable, pcs cannot remove it from configuration and add a running one | |||
---|---|---|---|---|
Product: | Red Hat Enterprise Linux 8 | Reporter: | Nina Hostakova <nhostako> | |
Component: | pcs | Assignee: | Miroslav Lisik <mlisik> | |
Status: | CLOSED ERRATA | QA Contact: | cluster-qe <cluster-qe> | |
Severity: | medium | Docs Contact: | ||
Priority: | medium | |||
Version: | 8.5 | CC: | cluster-maint, idevat, kmalyjur, mlisik, mpospisi, omular, tojeline | |
Target Milestone: | rc | Keywords: | Triaged | |
Target Release: | 8.6 | |||
Hardware: | Unspecified | |||
OS: | Unspecified | |||
Whiteboard: | ||||
Fixed In Version: | pcs-0.10.12-3.el8 | Doc Type: | Bug Fix | |
Doc Text: |
If this bug requires documentation, please select an appropriate Doc Type value.
|
Story Points: | --- | |
Clone Of: | ||||
: | 2033248 (view as bug list) | Environment: | ||
Last Closed: | 2022-05-10 14:50:48 UTC | Type: | Bug | |
Regression: | --- | Mount Type: | --- | |
Documentation: | --- | CRM: | ||
Verified Versions: | Category: | --- | ||
oVirt Team: | --- | RHEL 7.3 requirements from Atomic Host: | ||
Cloudforms Team: | --- | Target Upstream Version: | ||
Embargoed: | ||||
Bug Depends On: | ||||
Bug Blocks: | 2033248 |
Description
Nina Hostakova
2021-12-15 16:37:19 UTC
There are 2 cases: 1) adding a new disk and removing the unavailable disk This can be fixed. There is a workaround: Update disk in 2 steps. First remove unavailable disks and then add a new disk. 2) add disk to a cluster with unavailable disks This is ok to fail because it is not possible to check if the node was fenced. Proposed fix: https://github.com/ClusterLabs/pcs/commit/b3332366d280379edfe0dc95e8f789b02aac1166 Test: Case 1: [root@r8-node-01 pcs]# export $disk1=/dev/disk/by-id/scsi-360014052bc36324cf7d4a709a959340b [root@r8-node-01 pcs]# export $disk2=/dev/disk/by-id/scsi-3600140547721f8ee2774aa8bac6d8ebe [root@r8-node-01 pcs]# export $disk3=/dev/disk/by-id/scsi-360014052f8c6f3de01047c29b72040f4 [root@r8-node-01 pcs]# pcs stonith config Resource: fence-scsi (class=stonith type=fence_scsi) Attributes: devices=/dev/disk/by-id/scsi-360014052bc36324cf7d4a709a959340b pcmk_host_check=static-list pcmk_host_list="r8-node-01 r8-node-02 r8-node-03" pcmk_reboot_action=off Meta Attrs: provides=unfencing Operations: monitor interval=60s (fence-scsi-monitor-interval-60s) [root@r8-node-01 pcs]# echo offline > /sys/block/$(basename $(readlink $disk1))/device/state; [root@r8-node-01 pcs]# cat /sys/block/$(basename $(readlink $disk1))/device/state; offline [root@r8-node-01 pcs]# lpcs stonith update-scsi-devices fence-scsi add $disk2 remove $disk1 [root@r8-node-01 pcs]# echo $? 0 [root@r8-node-01 pcs]# pcs stonith config Resource: fence-scsi (class=stonith type=fence_scsi) Attributes: devices=/dev/disk/by-id/scsi-3600140547721f8ee2774aa8bac6d8ebe pcmk_host_check=static-list pcmk_host_list="r8-node-01 r8-node-02 r8-node-03" pcmk_reboot_action=off Meta Attrs: provides=unfencing Operations: monitor interval=60s (fence-scsi-monitor-interval-60s) Case 2: [root@r8-node-01 pcs]# pcs stonith config Resource: fence-scsi (class=stonith type=fence_scsi) Attributes: devices=/dev/disk/by-id/scsi-3600140547721f8ee2774aa8bac6d8ebe pcmk_host_check=static-list pcmk_host_list="r8-node-01 r8-node-02 r8-node-03" pcmk_reboot_action=off Meta Attrs: provides=unfencing Operations: monitor interval=60s (fence-scsi-monitor-interval-60s) [root@r8-node-01 pcs]# echo offline > /sys/block/$(basename $(readlink $disk2))/device/state; [root@r8-node-01 pcs]# cat /sys/block/$(basename $(readlink $disk2))/device/state; offline [root@r8-node-01 pcs]# lpcs stonith update-scsi-devices fence-scsi add $disk3 Error: r8-node-01: Unfencing failed, unable to check status of device '/dev/disk/by-id/scsi-3600140547721f8ee2774aa8bac6d8ebe': 2022-01-13 14:10:52,938 ERROR: Cannot get registration keys 2022-01-13 14:10:52,939 ERROR: Please use '-h' for usage Error: Errors have occurred, therefore pcs is unable to continue (pcs) [root@r8-node-01 pcs]# echo $? 1 [root@r8-node-01 pcs]# pcs stonith config Resource: fence-scsi (class=stonith type=fence_scsi) Attributes: devices=/dev/disk/by-id/scsi-3600140547721f8ee2774aa8bac6d8ebe pcmk_host_check=static-list pcmk_host_list="r8-node-01 r8-node-02 r8-node-03" pcmk_reboot_action=off Meta Attrs: provides=unfencing Operations: monitor interval=60s (fence-scsi-monitor-interval-60s) DevTestResults: [root@r8-node-01 pcs]# rpm -q pcs pcs-0.10.12-3.el8.x86_64 [root@r8-node-01 pcs]# export $disk1=/dev/disk/by-id/scsi-360014052bc36324cf7d4a709a959340b [root@r8-node-01 pcs]# export $disk2=/dev/disk/by-id/scsi-3600140547721f8ee2774aa8bac6d8ebe [root@r8-node-01 pcs]# pcs stonith config Resource: fence-scsi (class=stonith type=fence_scsi) Attributes: devices=/dev/disk/by-id/scsi-360014052bc36324cf7d4a709a959340b pcmk_host_check=static-list pcmk_host_list="r8-node-01 r8-node-02 r8-node-03" pcmk_reboot_action=off Meta Attrs: provides=unfencing Operations: monitor interval=60s (fence-scsi-monitor-interval-60s) [root@r8-node-01 pcs]# echo offline > /sys/block/$(basename $(readlink $disk1))/device/state [root@r8-node-01 pcs]# cat /sys/block/$(basename $(readlink $disk1))/device/state offline [root@r8-node-01 pcs]# pcs stonith update-scsi-devices fence-scsi add $disk2 remove $disk1 [root@r8-node-01 pcs]# echo $? 0 [root@r8-node-01 pcs]# pcs stonith config Resource: fence-scsi (class=stonith type=fence_scsi) Attributes: devices=/dev/disk/by-id/scsi-3600140547721f8ee2774aa8bac6d8ebe pcmk_host_check=static-list pcmk_host_list="r8-node-01 r8-node-02 r8-node-03" pcmk_reboot_action=off Meta Attrs: provides=unfencing Operations: monitor interval=60s (fence-scsi-monitor-interval-60s) Since the problem described in this bug report should be resolved in a recent advisory, it has been closed with a resolution of ERRATA. For information on the advisory (pcs bug fix and enhancement update), and where to find the updated files, follow the link below. If the solution does not work for you, open a new bug report. https://access.redhat.com/errata/RHEA-2022:1978 |