Bug 1336945

Summary: [NFS-Ganesha] : stonith-enabled option not set with new versions of cman,pacemaker,corosync and pcs
Product: [Community] GlusterFS Reporter: Kaleb KEITHLEY <kkeithle>
Component: common-haAssignee: Kaleb KEITHLEY <kkeithle>
Status: CLOSED CURRENTRELEASE QA Contact:
Severity: high Docs Contact:
Priority: unspecified    
Version: mainlineCC: asoman, bugs, jthottan, kgaillot, kkeithle, mzywusko, ndevos, nlevinki, skoduri, storage-qa-internal
Target Milestone: ---Keywords: Triaged
Target Release: ---   
Hardware: x86_64   
OS: Linux   
Whiteboard:
Fixed In Version: Doc Type: If docs needed, set a value
Doc Text:
Story Points: ---
Clone Of: 1334092
: 1336947 (view as bug list) Environment:
Last Closed: 2016-12-06 05:18:51 UTC Type: Bug
Regression: --- Mount Type: ---
Documentation: --- CRM:
Verified Versions: Category: ---
oVirt Team: --- RHEL 7.3 requirements from Atomic Host:
Cloudforms Team: --- Target Upstream Version:
Embargoed:
Bug Depends On: 1334092    
Bug Blocks: 1336947, 1336948    

Comment 1 Vijay Bellur 2016-05-17 21:33:23 UTC
REVIEW: http://review.gluster.org/14404 (common-ha: stonith-enabled option set error in new pacemaker) posted (#1) for review on master by Kaleb KEITHLEY (kkeithle)

Comment 2 Vijay Bellur 2016-05-19 10:17:10 UTC
COMMIT: http://review.gluster.org/14404 committed in master by Kaleb KEITHLEY (kkeithle) 
------
commit 5bd9d42c997695d0c673ae50c1dbb9f4ab23695d
Author: Kaleb S KEITHLEY <kkeithle>
Date:   Tue May 17 17:27:37 2016 -0400

    common-ha: stonith-enabled option set error in new pacemaker
    
    Setting the option too early results in an error in newer versions
    of pacemaker. Postpone setting the option in order for it to succeed.
    
    N.B. We do not use a fencing agent. Yes, we know this is not supported.
    
    Change-Id: I86953fdd67e6736294dbd2d0795611837188bd9d
    BUG: 1336945
    Signed-off-by: Kaleb S KEITHLEY <kkeithle>
    Reviewed-on: http://review.gluster.org/14404
    Smoke: Gluster Build System <jenkins.com>
    NetBSD-regression: NetBSD Build System <jenkins.org>
    CentOS-regression: Gluster Build System <jenkins.com>
    Reviewed-by: soumya k <skoduri>
    Reviewed-by: jiffin tony Thottan <jthottan>

Comment 3 Vijay Bellur 2016-05-19 17:02:42 UTC
REVIEW: http://review.gluster.org/14426 (common-ha: wait for cluster to elect DC before accessing CIB) posted (#1) for review on master by Kaleb KEITHLEY (kkeithle)

Comment 4 Kaleb KEITHLEY 2016-05-23 11:34:56 UTC
Ken Gaillot 2016-05-19 10:50:42 EDT

Your "too early" comment finally rang a bell: What you're seeing is an unfortunately necessary side effect of a fix for an issue in 6.7. The situation was discussed in BZ#1320740.

The bottom line is, yes, you have to wait a short while after startup before querying or writing to the CIB in RHEL 6.8, so the cluster has time to elect a DC. The easiest way is just to loop if the option change fails, or to loop until "crmadmin --dc_lookup --timeout=5000" exits 0 before doing the option change.

Comment 5 Vijay Bellur 2016-05-23 11:35:51 UTC
REVIEW: http://review.gluster.org/14426 (common-ha: wait for cluster to elect DC before accessing CIB) posted (#2) for review on master by Kaleb KEITHLEY (kkeithle)

Comment 6 Vijay Bellur 2016-05-24 09:36:02 UTC
COMMIT: http://review.gluster.org/14426 committed in master by Kaleb KEITHLEY (kkeithle) 
------
commit 4394291064b68f7c1a8634a00b211154087df1bc
Author: Kaleb S KEITHLEY <kkeithle>
Date:   Thu May 19 12:57:58 2016 -0400

    common-ha: wait for cluster to elect DC before accessing CIB
    
    access attempts, e.g. `pcs property set stonith-enabled=false`
    will fail (or time out) if attempted "too early", i.e. before
    the cluster has elected its DC.
    
    Change-Id: Ifc0aa7ce652c1da339b9eb8fe17e40e8a09b1096
    BUG: 1336945
    Signed-off-by: Kaleb S KEITHLEY <kkeithle>
    Reviewed-on: http://review.gluster.org/14426
    CentOS-regression: Gluster Build System <jenkins.com>
    NetBSD-regression: Gluster Build System <jenkins.com>
    Tested-by: Gluster Build System <jenkins.com>
    Reviewed-by: soumya k <skoduri>
    NetBSD-regression: NetBSD Build System <jenkins.org>
    Smoke: Gluster Build System <jenkins.com>
    Reviewed-by: jiffin tony Thottan <jthottan>