Bug 1502795
| Summary: | RFE: Allow starting remote nodes in a instant-standby state | ||
|---|---|---|---|
| Product: | Red Hat Enterprise Linux 8 | Reporter: | Ken Gaillot <kgaillot> |
| Component: | pacemaker | Assignee: | Chris Lumens <clumens> |
| Status: | CLOSED ERRATA | QA Contact: | cluster-qe <cluster-qe> |
| Severity: | low | Docs Contact: | |
| Priority: | high | ||
| Version: | 8.0 | CC: | cluster-maint, jrehova, jruemker, mnovacek, msmazova, sbradley |
| Target Milestone: | pre-dev-freeze | Keywords: | FutureFeature, Reopened, Triaged |
| Target Release: | 8.9 | Flags: | pm-rhel:
mirror+
|
| Hardware: | All | ||
| OS: | Linux | ||
| Whiteboard: | |||
| Fixed In Version: | pacemaker-2.1.6-2.el8 | Doc Type: | Enhancement |
| Doc Text: |
Feature: Users may specify PCMK_node_start_state in /etc/sysconfig/pacemaker to force a Pacemaker Remote node to start in standby or online mode.
Reason: The PCMK_node_start_state sysconfig option was supported for cluster nodes, but not Pacemaker Remote nodes.
Result: Users can choose to force a Pacemaker Remote node to start in standby mode when joining the cluster.
|
Story Points: | --- |
| Clone Of: | 1376556 | Environment: | |
| Last Closed: | 2023-11-14 15:32:34 UTC | Type: | Feature Request |
| Regression: | --- | Mount Type: | --- |
| Documentation: | --- | CRM: | |
| Verified Versions: | Category: | --- | |
| oVirt Team: | --- | RHEL 7.3 requirements from Atomic Host: | |
| Cloudforms Team: | --- | Target Upstream Version: | |
| Embargoed: | |||
| Bug Depends On: | 1376556 | ||
| Bug Blocks: | 1427246 | ||
|
Description
Ken Gaillot
2017-10-16 17:23:16 UTC
After evaluating this issue, there are no plans to address it further or fix it in an upcoming release. Therefore, it is being closed. If plans change such that this issue will be fixed in an upcoming release, then the bug can be reopened. This is still a goal, but it will be tracked via the upstream bz. Support for remote nodes was added in upstream main branch as of commit 76bd508cc Version of pacemaker: > [root@virt-248:~]# rpm -q pacemaker > pacemaker-2.1.6-3.el8.x86_64 Setting of 5-node cluster -> 2 nodes and 3 remote nodes: > [root@virt-248:~]#pcs status > Cluster name: STSRHTS8954 > Cluster Summary: > * Stack: corosync (Pacemaker is running) > * Current DC: virt-249 (version 2.1.6-3.el8-6fdc9deea29) - partition with quorum > * Last updated: Mon Jul 17 14:35:21 2023 on virt-248 > * Last change: Mon Jul 17 14:35:04 2023 by root via cibadmin on virt-248 > * 5 nodes configured > * 8 resource instances configured > > Node List: > * online: [ virt-248 virt-249 ] > * Remoteonline: [ virt-256 virt-257 virt-261 ] > > Full List of Resources: > * fence-virt-248 (stonith:fence_xvm): Started virt-249 > * fence-virt-249 (stonith:fence_xvm): Started virt-248 > * fence-virt-256 (stonith:fence_xvm): Started virt-249 > * fence-virt-257 (stonith:fence_xvm): Started virt-249 > * fence-virt-261 (stonith:fence_xvm): Started virt-248 > * virt-256 (ocf::pacemaker:remote): Started virt-248 > * virt-257 (ocf::pacemaker:remote): Started virt-249 > * virt-261 (ocf::pacemaker:remote): Started virt-248 > > Daemon Status: > corosync: active/disabled > pacemaker: active/disabled > pcsd: active/enabled ___________________________________________________________________________________________________________ OPTION 1: remote node to standby change PCMK_node_start_state expected status after before disable? on remote node? enable remote node ===================== ============================ ======================= yes no connect as standby Setting standby state with pcs node standby: > [root@virt-248:~]# pcs node standby virt-256 virt-257 virt-261 > [root@virt-248:~]# pcs status > Cluster name: STSRHTS8954 > Cluster Summary: > * Stack: corosync (Pacemaker is running) > * Current DC: virt-249 (version 2.1.6-3.el8-6fdc9deea29) - partition with quorum > * Last updated: Mon Jul 17 22:06:12 2023 on virt-248 > * Last change: Mon Jul 17 22:06:07 2023 by root via cibadmin on virt-248 > * 5 nodes configured > * 8 resource instances configured > > Node List: > * RemoteNode virt-256: standby > * RemoteNode virt-257: standby > * RemoteNode virt-261: standby > * online: [ virt-248 virt-249 ] Disabling remote nodes: > [root@virt-248:~]# pcs resource disable virt-256 > [root@virt-248:~]# pcs resource disable virt-257 > [root@virt-248:~]# pcs resource disable virt-261 > [root@virt-248:~]# pcs status > ... > Node List: > * online: [ virt-248 virt-249 ] > * RemoteOFFLIno: [ virt-256 virt-257 virt-261 ] > ... Enabling remote nodes: > [root@virt-248:~]# pcs resource enable virt-257 > [root@virt-248:~]# pcs resource enable virt-256 > [root@virt-248:~]# pcs resource enable virt-261 > [root@virt-248:~]# pcs status > Cluster name: STSRHTS8954 > Cluster Summary: > * Stack: corosync (Pacemaker is running) > * Current DC: virt-249 (version 2.1.6-3.el8-6fdc9deea29) - partition with quorum > * Last updated: Mon Jul 17 22:10:55 2023 on virt-248 > * Last change: Mon Jul 17 22:10:50 2023 by root via cibadmin on virt-248 > * 5 nodes configured > * 8 resource instances configured > > Node List: > * RemoteNode virt-256: standby > * RemoteNode virt-257: standby > * RemoteNode virt-261: standby > * online: [ virt-248 virt-249 ] RESULT: It is working well for this option -> all remote nodes have standby state. ___________________________________________________________________________________________________________ OPTION 2: remote node to standby change PCMK_node_start_state expected status after before disable? on remote node? enable remote node ===================== ============================ ======================= yes yes, to "online" connect as online yes yes, to "standby" connect as standby yes yes, to "default" connect as standby Setting standby state with pcs node standby: > [root@virt-248:~]# pcs node standby virt-256 virt-257 virt-261 > [root@virt-248:~]# pcs status > Cluster name: STSRHTS8954 > Cluster Summary: > * Stack: corosync (Pacemaker is running) > * Current DC: virt-249 (version 2.1.6-3.el8-6fdc9deea29) - partition with quorum > * Last updated: Mon Jul 17 23:29:17 2023 on virt-248 > * Last change: Mon Jul 17 23:29:13 2023 by root via cibadmin on virt-248 > * 5 nodes configured > * 8 resource instances configured > > Node List: > * RemoteNode virt-256: standby > * RemoteNode virt-257: standby > * RemoteNode virt-261: standby > * Online: [ virt-248 virt-249 ] Disabling remote nodes: > [root@virt-248:~]# pcs resource disable virt-261 > [root@virt-248:~]# pcs resource disable virt-257 > [root@virt-248:~]# pcs resource disable virt-256 > [root@virt-248:~]# pcs status > ... > Node List: > * online: [ virt-248 virt-249 ] > * RemoteOFFLIno: [ virt-256 virt-257 virt-261 ] > ... Changing PCMK_node_start_state in /etc/sysconfig/pacemaker for each remote nodes different values - "online", "default" and "standby": > [root@virt-261 ~]# vim /etc/sysconfig/pacemaker > VALGRIND_OPTS="--leak-check=full --trace-children=no --vgdb=no --num-callers=25" > VALGRIND_OPTS="$VALGRIND_OPTS --log-file=/var/lib/pacemaker/valgrind-%p" > VALGRIND_OPTS="$VALGRIND_OPTS --suppressions=/usr/share/pacemaker/tests/valgrind-pcmk.suppressions" > VALGRIND_OPTS="$VALGRIND_OPTS --gen-suppressions=all" > PCMK_node_start_state="default" > [root@virt-261 ~]# systemctl restart pacemaker-remote > [root@virt-257 ~]# vim /etc/sysconfig/pacemaker > VALGRIND_OPTS="--leak-check=full --trace-children=no --vgdb=no --num-callers=25" > VALGRIND_OPTS="$VALGRIND_OPTS --log-file=/var/lib/pacemaker/valgrind-%p" > VALGRIND_OPTS="$VALGRIND_OPTS --suppressions=/usr/share/pacemaker/tests/valgrind-pcmk.suppressions" > VALGRIND_OPTS="$VALGRIND_OPTS --gen-suppressions=all" > PCMK_node_start_state="online" > [root@virt-257 ~]# systemctl restart pacemaker-remote > [root@virt-256 ~]# vim /etc/sysconfig/pacemaker > VALGRIND_OPTS="--leak-check=full --trace-children=no --vgdb=no --num-callers=25" > VALGRIND_OPTS="$VALGRIND_OPTS --log-file=/var/lib/pacemaker/valgrind-%p" > VALGRIND_OPTS="$VALGRIND_OPTS --suppressions=/usr/share/pacemaker/tests/valgrind-pcmk.suppressions" > VALGRIND_OPTS="$VALGRIND_OPTS --gen-suppressions=all" > PCMK_node_start_state="standby" > [root@virt-256 ~]# systemctl restart pacemaker-remote Enabling remote nodes: > [root@virt-248:~]# pcs resource enable virt-261 > [root@virt-248:~]# pcs resource enable virt-256 > [root@virt-248:~]# pcs resource enable virt-257 > [root@virt-248:~]# pcs status > Cluster name: STSRHTS8954 > Cluster Summary: > * Stack: corosync (Pacemaker is running) > * Current DC: virt-249 (version 2.1.6-3.el8-6fdc9deea29) - partition with quorum > * Last updated: Mon Jul 17 23:33:05 2023 on virt-248 > * Last change: Mon Jul 17 23:33:01 2023 by root via cibadmin on virt-248 > * 5 nodes configured > * 8 resource instances configured > > Node List: > * RemoteNode virt-256: standby > * RemoteNode virt-261: standby > * Online: [ virt-248 virt-249 ] > * RemoteOnline: [ virt-257 ] RESULT: It is working well for this option -> 2 remote nodes have standby state (RNs with "default" and "standby") and 1 remote node is online (RN with "online"). ___________________________________________________________________________________________________________ OPTION 3: remote node to standby change PCMK_node_start_state expected status after before disable? on remote node? enable remote node ===================== ============================ ======================= no no connect as online Setting of cluster -> all remote nodes = online: > [root@virt-248:~]# pcs status > Cluster name: STSRHTS8954 > Cluster Summary: > * Stack: corosync (Pacemaker is running) > * Current DC: virt-249 (version 2.1.6-3.el8-6fdc9deea29) - partition with quorum > * Last updated: Mon Jul 17 22:37:24 2023 on virt-248 > * Last change: Mon Jul 17 22:35:11 2023 by root via cibadmin on virt-248 > * 5 nodes configured > * 8 resource instances configured > > Node List: > * online: [ virt-248 virt-249 ] > * Remoteonline: [ virt-256 virt-257 virt-261 ] Disabling remote nodes: > [root@virt-248:~]# pcs resource disable virt-261 > [root@virt-248:~]# pcs resource disable virt-257 > [root@virt-248:~]# pcs resource disable virt-256 > [root@virt-248:~]# pcs status > ... > Node List: > * online: [ virt-248 virt-249 ] > * RemoteOFFLIno: [ virt-256 virt-257 virt-261 ] > ... Enabling remote nodes: > [root@virt-248:~]# pcs resource enable virt-256 > [root@virt-248:~]# pcs resource enable virt-257 > [root@virt-248:~]# pcs resource enable virt-261 > [root@virt-248:~]# pcs status > Cluster name: STSRHTS8954 > Cluster Summary: > * Stack: corosync (Pacemaker is running) > * Current DC: virt-249 (version 2.1.6-3.el8-6fdc9deea29) - partition with quorum > * Last updated: Mon Jul 17 22:39:03 2023 on virt-248 > * Last change: Mon Jul 17 22:39:00 2023 by root via cibadmin on virt-248 > * 5 nodes configured > * 8 resource instances configured > > Node List: > * online: [ virt-248 virt-249 ] > * Remoteonline: [ virt-256 virt-257 virt-261 ] RESULT: It is working well for this option -> all remote nodes are online. ___________________________________________________________________________________________________________ OPTION 4: remote node to standby change PCMK_node_start_state expected status after before disable? on remote node? enable remote node ===================== ============================ ======================= no yes, to "online" connect as online no yes, to "standby" connect as standby no yes, to "default" connect as online Setting of cluster -> all remote nodes = online: > [root@virt-248:~]# pcs status > Cluster name: STSRHTS8954 > Cluster Summary: > * Stack: corosync (Pacemaker is running) > * Current DC: virt-249 (version 2.1.6-3.el8-6fdc9deea29) - partition with quorum > * Last updated: Mon Jul 17 22:39:03 2023 on virt-248 > * Last change: Mon Jul 17 22:39:00 2023 by root via cibadmin on virt-248 > * 5 nodes configured > * 8 resource instances configured > > Node List: > * online: [ virt-248 virt-249 ] > * Remoteonline: [ virt-256 virt-257 virt-261 ] Disabling remote nodes: > [root@virt-248:~]# pcs resource disable virt-256 > [root@virt-248:~]# pcs resource disable virt-257 > [root@virt-248:~]# pcs resource disable virt-261 > [root@virt-248:~]# pcs status > ... > Node List: > * online: [ virt-248 virt-249 ] > * RemoteOFFLIno: [ virt-256 virt-257 virt-261 ] > ... Changing PCMK_node_start_state in /etc/sysconfig/pacemaker for each remote nodes different values - "online", "default" and "standby": > [root@virt-261 ~]# vim /etc/sysconfig/pacemaker > VALGRIND_OPTS="--leak-check=full --trace-children=no --vgdb=no --num-callers=25" > VALGRIND_OPTS="$VALGRIND_OPTS --log-file=/var/lib/pacemaker/valgrind-%p" > VALGRIND_OPTS="$VALGRIND_OPTS --suppressions=/usr/share/pacemaker/tests/valgrind-pcmk.suppressions" > VALGRIND_OPTS="$VALGRIND_OPTS --gen-suppressions=all" > PCMK_node_start_state="default" > [root@virt-261 ~]# systemctl restart pacemaker-remote > [root@virt-257 ~]# vim /etc/sysconfig/pacemaker > VALGRIND_OPTS="--leak-check=full --trace-children=no --vgdb=no --num-callers=25" > VALGRIND_OPTS="$VALGRIND_OPTS --log-file=/var/lib/pacemaker/valgrind-%p" > VALGRIND_OPTS="$VALGRIND_OPTS --suppressions=/usr/share/pacemaker/tests/valgrind-pcmk.suppressions" > VALGRIND_OPTS="$VALGRIND_OPTS --gen-suppressions=all" > PCMK_node_start_state="online" > [root@virt-257 ~]# systemctl restart pacemaker-remote > [root@virt-256 ~]# vim /etc/sysconfig/pacemaker > VALGRIND_OPTS="--leak-check=full --trace-children=no --vgdb=no --num-callers=25" > VALGRIND_OPTS="$VALGRIND_OPTS --log-file=/var/lib/pacemaker/valgrind-%p" > VALGRIND_OPTS="$VALGRIND_OPTS --suppressions=/usr/share/pacemaker/tests/valgrind-pcmk.suppressions" > VALGRIND_OPTS="$VALGRIND_OPTS --gen-suppressions=all" > PCMK_node_start_state="standby" > [root@virt-256 ~]# systemctl restart pacemaker-remote Enabling remote nodes: > [root@virt-248:~]# pcs resource enable virt-256 > [root@virt-248:~]# pcs resource enable virt-261 > [root@virt-248:~]# pcs resource enable virt-257 > [root@virt-248:~]# pcs status > Cluster name: STSRHTS8954 > Cluster Summary: > * Stack: corosync (Pacemaker is running) > * Current DC: virt-249 (version 2.1.6-3.el8-6fdc9deea29) - partition with quorum > * Last updated: Tue Jul 18 00:10:48 2023 on virt-248 > * Last change: Tue Jul 18 00:10:42 2023 by hacluster via crmd on virt-248 > * 5 nodes configured > * 8 resource instances configured > > Node List: > * RemoteNode virt-256: standby > * Online: [ virt-248 virt-249 ] > * RemoteOnline: [ virt-257 virt-261 ] RESULT: It is working well for this option -> 2 remote nodes are online (RNs with "default" and "online") and 1 remote node has standby state (RN with "standby"). Since the problem described in this bug report should be resolved in a recent advisory, it has been closed with a resolution of ERRATA. For information on the advisory (pacemaker bug fix and enhancement update), and where to find the updated files, follow the link below. If the solution does not work for you, open a new bug report. https://access.redhat.com/errata/RHEA-2023:6970 |