Bug 1493969 - stonith configuration is currently broken
Summary: stonith configuration is currently broken
Keywords:
Status: CLOSED ERRATA
Alias: None
Product: Red Hat OpenStack
Classification: Red Hat
Component: puppet-pacemaker
Version: 12.0 (Pike)
Hardware: All
OS: Linux
high
high
Target Milestone: beta
: 12.0 (Pike)
Assignee: Michele Baldessari
QA Contact: nlevinki
URL:
Whiteboard:
Depends On:
Blocks: 1509912 1536067
TreeView+ depends on / blocked
 
Reported: 2017-09-21 08:53 UTC by Michele Baldessari
Modified: 2018-02-05 19:15 UTC (History)
6 users (show)

Fixed In Version: puppet-pacemaker-0.6.1-0.20171015172907.44d3226.el7ost
Doc Type: If docs needed, set a value
Doc Text:
Clone Of:
: 1509912 (view as bug list)
Environment:
Last Closed: 2017-12-13 22:11:04 UTC
Target Upstream Version:
Embargoed:


Attachments (Terms of Use)


Links
System ID Private Priority Status Summary Last Updated
Launchpad 1717531 0 None None None 2017-09-21 08:54:40 UTC
Launchpad 1717566 0 None None None 2017-09-21 08:54:24 UTC
OpenStack gerrit 504425 0 None MERGED Fix brokenness when changing the value of an existing property 2020-07-23 07:16:10 UTC
OpenStack gerrit 504931 0 None MERGED Add pcmk_stonith provider 2020-07-23 07:16:08 UTC
OpenStack gerrit 504932 0 None MERGED Switch the stonith_agent_generator to pcmk_stonith provider 2020-07-23 07:16:08 UTC
OpenStack gerrit 506326 0 None MERGED Fix up package relationship in generated stonith manifests 2020-07-23 07:16:08 UTC
Red Hat Product Errata RHEA-2017:3462 0 normal SHIPPED_LIVE Red Hat OpenStack Platform 12.0 Enhancement Advisory 2018-02-16 01:43:25 UTC

Description Michele Baldessari 2017-09-21 08:53:40 UTC
There are currently two problems when we configure stonith via tripleo.

A) https://bugs.launchpad.net/tripleo/+bug/1717531
parameter_defaults:
  EnableFencing: true
  FencingConfig:
    devices:
    - agent: fence_ipmilan
      host_mac: 00:92:bb:63:40:3d
      params:
        login: admin
        ipaddr: 192.168.24.1
        ipport: 6230
        passwd: password
        lanplus: 1
....

The deploy did not have stonith enabled:
Cluster Properties:
 cluster-infrastructure: corosync
 cluster-name: tripleo_cluster
 dc-version: 1.1.16-12.13.el7.centos-94ff4df
 have-watchdog: false
 redis_REPL_INFO: overcloud-controller-0
 stonith-enabled: false

B) https://bugs.launchpad.net/tripleo/+bug/1717566
Stonith creation is racy and not all stonith resources get created.


So the following might happen:
Sep 15 09:43:51 localhost os-collect-config: "Notice: /Stage[main]/Tripleo::Fencing/Pacemaker::Stonith::Fence_ipmilan[00:92:bb:63:40:9f]/Exec[Create stonith-fence_ipmilan-0092bb63409f]/returns: executed successfully",
Sep 15 09:43:51 localhost os-collect-config: "Notice: /Stage[main]/Tripleo::Fencing/Pacemaker::Stonith::Fence_ipmilan[00:92:bb:63:40:9f]/Exec[Add non-local constraint for stonith-fence_ipmilan-0092bb63409f]/returns: Error: Resource 'stonith-fence_ipmilan-0092bb63409f' does not exist"

The reasons for this are the direct calls to the pcs command.

Comment 2 Michele Baldessari 2017-09-21 18:55:18 UTC
One more smaller patch is needed. gerrit @ openstack is down. will link here the review once it is up

Comment 3 Michele Baldessari 2017-09-21 19:08:30 UTC
Ok added last review. While this one (506326) is not essential for tripleo, it'd still be good to have

Comment 4 Michele Baldessari 2017-09-22 14:40:19 UTC
506326 merged, moving to POST again

Comment 9 errata-xmlrpc 2017-12-13 22:11:04 UTC
Since the problem described in this bug report should be
resolved in a recent advisory, it has been closed with a
resolution of ERRATA.

For information on the advisory, and where to find the updated
files, follow the link below.

If the solution does not work for you, open a new bug report.

https://access.redhat.com/errata/RHEA-2017:3462


Note You need to log in before you can comment on or make changes to this bug.