Bug 2149113

Summary: pcs can't create MS SQL Server cluster resources
Product: Red Hat Enterprise Linux 8 Reporter: Daniel Yeisley <dyeisley>
Component: resource-agentsAssignee: Oyvind Albrigtsen <oalbrigt>
Status: CLOSED DUPLICATE QA Contact: cluster-qe <cluster-qe>
Severity: high Docs Contact:
Priority: unspecified    
Version: 8.8CC: agk, amitkh, cfeist, cluster-maint, fdinitto, idevat, mlisik, mpospisi, omular, sbradley, smerrow, tojeline
Target Milestone: rc   
Target Release: ---   
Hardware: Unspecified   
OS: Unspecified   
Whiteboard:
Fixed In Version: Doc Type: If docs needed, set a value
Doc Text:
Story Points: ---
Clone Of: Environment:
Last Closed: 2023-02-28 15:23:02 UTC Type: Bug
Regression: --- Mount Type: ---
Documentation: --- CRM:
Verified Versions: Category: ---
oVirt Team: --- RHEL 7.3 requirements from Atomic Host:
Cloudforms Team: --- Target Upstream Version:
Embargoed:

Description Daniel Yeisley 2022-11-28 21:29:04 UTC
Description of problem:
I test MS SQL Server in a multi-node availability group configuration and have problems with RHEL 8.8. 

Version-Release number of selected component (if applicable):
pcs-0.10.15-1.el8.x86_64

How reproducible:



Steps to Reproduce:
1. Install MS SQL Server + SQL Server HA packages.
2. Create two node cluster, pcs host auth, pcs cluster setup, etc.
3. Attempt to create SQL ag resource. 

Actual results:
[root@isvqe-01 ~]# pcs resource create ag_cluster ocf:mssql:ag ag_name=ag1 meta failure-timeout=80s promotable on-fail="demote" notify=true
Error: Validation result from agent (use --force to override):
  Nov 28 16:08:36 INFO: mssql_validate
  Nov 28 16:08:36 INFO: OCF_RESKEY_CRM_meta_interval value: 0
  Nov 28 16:08:36 INFO: OCF_RESKEY_CRM_meta_timeout value: 20000
  ocf-exit-reason:Resource must be configured with notify=true
  Nov 28 16:08:36 INFO: Resource agent invoked with: validate-all

Downgraded pcs package and tried again.

[root@isvqe-01 ~]# rpm -Uvh --force pcs-0.10.14-5.el8.x86_64.rpm 
warning: pcs-0.10.14-5.el8.x86_64.rpm: Header V3 RSA/SHA256 Signature, key ID fd431d51: NOKEY
Verifying...                          ################################# [100%]
Preparing...                          ################################# [100%]
Updating / installing...
   1:pcs-0.10.14-5.el8                ################################# [ 50%]
Cleaning up / removing...
   2:pcs-0.10.15-1.el8                ################################# [100%]
/sbin/ldconfig: /usr/lib64/llvm14/lib/libclang.so.13 is not a symbolic link

[root@isvqe-01 ~]# pcs status
Cluster name: isvqe-cluster

WARNINGS:
Following resources have been moved and their move constraints are still in place: 'virtualip'
Run 'pcs constraint location' or 'pcs resource clear <resource id>' to view or remove the constraints, respectively

Status of pacemakerd: 'Pacemaker is running' (last updated 2022-11-28 16:16:27 -05:00)
Cluster Summary:
  * Stack: corosync
  * Current DC: isvqe-01 (version 2.1.5-2.el8-631339ca5aa) - partition with quorum
  * Last updated: Mon Nov 28 16:16:27 2022
  * Last change:  Mon Nov 28 16:07:36 2022 by hacluster via crmd on isvqe-01
  * 2 nodes configured
  * 1 resource instance configured

Node List:
  * Online: [ isvqe-01 isvqe-02 ]

Full List of Resources:
  * virtualip	(ocf::heartbeat:IPaddr2):	 Started isvqe-01

Daemon Status:
  corosync: active/enabled
  pacemaker: active/enabled
  pcsd: active/disabled

[root@isvqe-01 ~]# sudo pcs resource create ag_cluster ocf:mssql:ag ag_name=ag1 meta failure-timeout=60s promotable notify=true

[root@isvqe-01 ~]# pcs status
Cluster name: isvqe-cluster

WARNINGS:
Following resources have been moved and their move constraints are still in place: 'virtualip'
Run 'pcs constraint location' or 'pcs resource clear <resource id>' to view or remove the constraints, respectively

Status of pacemakerd: 'Pacemaker is running' (last updated 2022-11-28 16:16:39 -05:00)
Cluster Summary:
  * Stack: corosync
  * Current DC: isvqe-01 (version 2.1.5-2.el8-631339ca5aa) - partition with quorum
  * Last updated: Mon Nov 28 16:16:40 2022
  * Last change:  Mon Nov 28 16:16:33 2022 by root via cibadmin on isvqe-01
  * 2 nodes configured
  * 3 resource instances configured

Node List:
  * Online: [ isvqe-01 isvqe-02 ]

Full List of Resources:
  * virtualip	(ocf::heartbeat:IPaddr2):	 Started isvqe-01
  * Clone Set: ag_cluster-clone [ag_cluster] (promotable):
    * ag_cluster	(ocf::mssql:ag):	 Master isvqe-01 (Monitoring)
    * ag_cluster	(ocf::mssql:ag):	 Slave isvqe-02 (Monitoring)

Daemon Status:
  corosync: active/enabled
  pacemaker: active/enabled
  pcsd: active/disabled
[root@isvqe-01 ~]# 
[root@isvqe-01 ~]# cat /etc/redhat-release 
Red Hat Enterprise Linux release 8.8 Beta (Ootpa)


Expected results:


Additional info:
MS SQL Server packages installed:
[root@isvqe-01 ~]# rpm -qa | grep ^mssql 
mssql-server-15.0.4261.1-2.x86_64
mssql-tools-17.10.1.1-1.x86_64
mssql-server-ha-15.0.4261.1-2.x86_64
mssql-server-fts-15.0.4261.1-2.x86_64

Comment 1 Tomas Jelinek 2022-11-29 08:26:21 UTC
This is related to the new pcs feature to run validate-all action of resource agents bz1816852. Moving to resource-agents for investigation.

Comment 10 Chris Feist 2023-02-28 15:23:02 UTC
Closing this issue as it has been resolved by pcs in bz1816852.

*** This bug has been marked as a duplicate of bug 1816852 ***