1499217 – Cleanup of bundle resource is incomplete

RHEL Engineering is moving the tracking of its product development work on RHEL 6 through RHEL 9 to Red Hat Jira (issues.redhat.com). If you're a Red Hat customer, please continue to file support cases via the Red Hat customer portal. If you're not, please head to the "RHEL project" in Red Hat Jira and file new tickets here. Individual Bugzilla bugs in the statuses "NEW", "ASSIGNED", and "POST" are being migrated throughout September 2023. Bugs of Red Hat partners with an assigned Engineering Partner Manager (EPM) are migrated in late September as per pre-agreed dates. Bugs against components "kernel", "kernel-rt", and "kpatch" are only migrated if still in "NEW" or "ASSIGNED". If you cannot log in to RH Jira, please consult article #7032570. That failing, please send an e-mail to the RH Jira admins at rh-issues@redhat.com to troubleshoot your issue as a user management inquiry. The email creates a ServiceNow ticket with Red Hat. Individual Bugzilla bugs that are migrated will be moved to status "CLOSED", resolution "MIGRATED", and set with "MigratedToJIRA" in "Keywords". The link to the successor Jira issue will be found under "Links", have a little "two-footprint" icon next to it, and direct you to the "RHEL project" in Red Hat Jira (issue links are of type "https://issues.redhat.com/browse/RHEL-XXXX", where "X" is a digit). This same link will be available in a blue banner at the top of the page informing you that that bug has been migrated.

Bug 1499217 - Cleanup of bundle resource is incomplete

Summary: Cleanup of bundle resource is incomplete

Keywords:
Status:	CLOSED ERRATA
Alias:	None
Product:	Red Hat Enterprise Linux 7
Classification:	Red Hat
Component:	pacemaker
Sub Component:
Version:	7.4
Hardware:	Unspecified
OS:	Unspecified
Priority:	urgent
Severity:	urgent
Target Milestone:	rc
Target Release:	7.5
Assignee:	Andrew Beekhof
QA Contact:	pkomarov
Docs Contact:
URL:
Whiteboard:
Duplicates (1):	1505909 (view as bug list)
Depends On:
Blocks:	1494455 1509874 1514520
TreeView+	depends on / blocked

Reported:	2017-10-06 11:48 UTC by Damien Ciabrini
Modified:	2018-04-10 15:34 UTC (History)
CC List:	9 users (show)
Fixed In Version:	pacemaker-1.1.18-4.el7
Doc Type:	No Doc Update
Doc Text:	Previously, the "pcs resource cleanup" command ignored stopped child clone resources of a bundle. Consequently, it was not possible to erase the state of the resources. With this update, Pacemaker now recognizes stopped clone resources. As a result, the pcs tool now works correctly with bundles when cleaning up.
Clone Of:
Clones:	1509874 1514520 (view as bug list)
Environment:
Last Closed:	2018-04-10 15:32:51 UTC
Target Upstream Version:
Embargoed:

Attachments	(Terms of Use)
CIB before pcs cleanup resource galera (26.11 KB, text/plain) 2017-10-06 11:51 UTC, Damien Ciabrini	no flags	Details
CIB after pcs resource cleanup galera (27.26 KB, text/plain) 2017-10-06 11:52 UTC, Damien Ciabrini	no flags	Details
output of pcs resource cleanup galera (268 bytes, text/plain) 2017-10-06 11:52 UTC, Damien Ciabrini	no flags	Details
galera configuration (882 bytes, text/plain) 2017-10-06 12:01 UTC, Damien Ciabrini	no flags	Details
crm_report of the unexpected restart (213.20 KB, application/x-bzip) 2017-11-07 20:18 UTC, Damien Ciabrini	no flags	Details
View All

Links
System	ID	Private	Priority	Status	Summary	Last Updated
Red Hat Product Errata	RHEA-2018:0860	0	None	None	None	2018-04-10 15:34:11 UTC

Description Damien Ciabrini 2017-10-06 11:48:57 UTC

Description of problem:
I'm running a galera resource in a bundle with "op promote on-fail=block".
I'm forcing the galera server to fail during promotion, which correctly gives me a resource is FAILED state on one node, and "blocked" from restarting.

Now when doing "pcs resource cleanup galera", I can see that the failcount on the resource is correctly cleaned up:

[root@centos2 ~]# pcs resource cleanup galera                                                                                                                                                                             
Cleaning up galera:0 on galera-bundle-0, removing fail-count-galera                                                                                                                                                       
Cleaning up galera:1 on galera-bundle-1, removing fail-count-galera
Cleaning up galera:2 on galera-bundle-2, removing fail-count-galera

but the resource still shows up as "FAILED (blocked)" in pcs status.

Attached are two dumps of the CIB before the cleanup, and after the cleanup.

When diff -u the two files, I can see the following diff:
       <transient_attributes id="galera-bundle-2">                                                                                                                                                                        
         <instance_attributes id="status-galera-bundle-2">                                                                                                                                                                
-          <nvpair id="status-galera-bundle-2-fail-count-galera" name="fail-count-galera" value="1"/>                                                                                                                     
           <nvpair id="status-galera-bundle-2-last-failure-galera" name="last-failure-galera" value="1507289301"/>                                                                                                        
         </instance_attributes>                                                                                                                                                                                          
       </transient_attributes>                                                                                                                                                                                            

Showing that the fail-count is cleaned up, but apparently the last-failure is not.


Version-Release number of selected component (if applicable):
pacemaker-1.1.16-12.el7_4.3

How reproducible:
Always

Steps to Reproduce:
There might be simpler reproducer, but here's the procedure with equivalent packages from RDO and the galera resource.

On a three node cluster centos1, centos2, centos3

1. pull the container image on all nodes
  docker pull docker.io/tripleomaster/centos-binary-mariadb:passed-ci-test

2. prepare the hosts
  touch /foo # create a empty file on the host
  # install the attached galera.cnf in /etc/my.cnf.d/galera.cnf and adapt the host names 

3. create a bundle
  pcs resource bundle create galera-bundle container docker image=docker.io/tripleomaster/centos-binary-mariadb:passed-ci-test replicas=3 masters=3 network=host options="--user=root --log-driver=journald" run-command="/usr/sbin/pacemaker_remoted" network control-port=3123 storage-map id=map1 source-dir=/foo target-dir=/etc/libqb/force-filesystem-sockets options=ro storage-map id=map2 source-dir=/etc/my.cnf.d/galera.cnf target-dir=/etc/my.cnf.d/galera.cnf options=ro storage-map id=map3 source-dir=/var/lib/mysql target-dir=/var/lib/mysql options=rw --disabled

4. create the galera resource inside the bundle (adapt the host names)
  pcs resource create galera galera enable_creation=true wsrep_cluster_address='gcomm://centos1,centos2,centos3' cluster_host_map='centos1:centos1;centos2:centos2;centos3:centos3' op promote on-fail=block meta container-attribute-target=host bundle galera-bundle

5. start a first time the bundle to bootstrap the galera cluster
  pcs resource enable galera-bundle

6. once all nodes are in master, stop the bundle to stop the galera cluster
  pcs resource disable galera-bundle

7. on third node, break galera internals to force a failure at next restart
  dd if=/dev/null of=/var/lib/mysql/gvwstate.dat

8. restart the bundle and wait for the galera resource on centos3 to FAIL
  pcs resource enable galera-bundle
  

Actual results:
resource does is still blocked and out of pacemaker's control after cleanup. 

Expected results:
resource should be managed again by pacemaker (be in "Slave" state after the clean up and pacemaker should resume its scheduling). 

Additional info:

Comment 2 Damien Ciabrini 2017-10-06 11:51:42 UTC

Created attachment 1335243 [details]
CIB before pcs cleanup resource galera

Comment 3 Damien Ciabrini 2017-10-06 11:52:24 UTC

Created attachment 1335244 [details]
CIB after pcs resource cleanup galera

Comment 4 Damien Ciabrini 2017-10-06 11:52:56 UTC

Created attachment 1335245 [details]
output of pcs resource cleanup galera

Comment 5 Damien Ciabrini 2017-10-06 12:01:37 UTC

Created attachment 1335281 [details]
galera configuration

Comment 6 Ken Gaillot 2017-10-06 14:20:47 UTC

As with clones, the upstream recommendation is to always operate on the bundle resource, never its primitive. I think pcs automatically translates it for clones, and it might be a good idea to do that with bundles, too. But I agree this is an odd outcome worth looking into.

Comment 7 Andrew Beekhof 2017-10-09 10:40:52 UTC

(In reply to Ken Gaillot from comment #6)
> As with clones, the upstream recommendation is to always operate on the
> bundle resource, never its primitive. I think pcs automatically translates
> it for clones, and it might be a good idea to do that with bundles, too. But
> I agree this is an odd outcome worth looking into.

I don't buy this.
crm_resource/pcs automatically escalates the request from the primitive to the clone.
The only difference here is that it doesn't go all the way up to the bundle.

Comment 8 Andrew Beekhof 2017-10-17 22:49:35 UTC

Fixed by the following:

https://github.com/beekhof/pacemaker/commit/a6466923875cb752cb68ad412cfc8296191e62ac

https://github.com/beekhof/pacemaker/commit/b0ca9a11581e3ec62429e41899f76fe3afc8b294

https://github.com/beekhof/pacemaker/commit/c3d4ec0377a5e742a7aca5b129139f1ad970e4f7

Comment 11 Ken Gaillot 2017-11-06 17:11:03 UTC

*** Bug 1505909 has been marked as a duplicate of this bug. ***

Comment 12 Damien Ciabrini 2017-11-07 20:12:16 UTC

As noted in https://bugzilla.redhat.com/show_bug.cgi?id=1505909, comment #7, I tested a scratch build with the provided patch and I can now clean errors by doing "pcs resource cleanup galera-bundle". I can also reprobe the state of unmanaged resource.

However, I now face another issue, in that when I "pcs resource manage galera-bundle" after the cleanup, a restart operation is triggered, which is unexpected and breaks the idiomatic way of "reprobing the current state of a resource before gicing back controller to pacemaker".

Comment 13 Damien Ciabrini 2017-11-07 20:18:08 UTC

Created attachment 1349106 [details]
crm_report of the unexpected restart

Attached crm_report of the unexpected restart:
Nov 07 21:01:15 ra1 crmd[5111]:   notice: State transition S_IDLE -> S_POLICY_ENGINE
Nov 07 21:01:15 ra1 pengine[5110]:   notice:  * Restart    galera:2                   (          Master galera-bundle-2 )

Comment 14 Ken Gaillot 2017-11-07 21:37:23 UTC

(In reply to Damien Ciabrini from comment #12)
> As noted in https://bugzilla.redhat.com/show_bug.cgi?id=1505909, comment #7,
> I tested a scratch build with the provided patch and I can now clean errors
> by doing "pcs resource cleanup galera-bundle". I can also reprobe the state
> of unmanaged resource.
> 
> However, I now face another issue, in that when I "pcs resource manage
> galera-bundle" after the cleanup, a restart operation is triggered, which is
> unexpected and breaks the idiomatic way of "reprobing the current state of a
> resource before gicing back controller to pacemaker".

To clarify, the scratch build is for the z-stream Bug 1509874. Will comment there.

Comment 15 Artem Hrechanychenko 2017-11-17 16:11:06 UTC

Move to POST because in latest puddle - 
http://download.lab.bos.redhat.com/rcm-guest/puddles/OpenStack/12.0-RHEL-7/2017-11-16.4/

pacemaker-1.1.16-12.el7_4.4.x86_64

Comment 16 Omri Hochman 2017-11-17 16:25:45 UTC

(In reply to Artem Hrechanychenko from comment #15)
> Move to POST because in latest puddle - 
> http://download.lab.bos.redhat.com/rcm-guest/puddles/OpenStack/12.0-RHEL-7/
> 2017-11-16.4/
> 
> pacemaker-1.1.16-12.el7_4.4.x86_64

Switch back to ON_QA as this is RHEL BZ .
I'm cloning this bug to be verified on OSP12,  as it blocks the replace controller scenario.

Comment 17 pkomarov 2018-01-11 13:02:14 UTC

Resolved , cluster retains active control after galera node resumes its active status :

after Description steps : 

galera resoure is active on all nodes 

Full list of resources:

   galera-bundle-0	(ocf::heartbeat:galera):	Master controller-0
   galera-bundle-1	(ocf::heartbeat:galera):	Master controller-1
   galera-bundle-2	(ocf::heartbeat:galera):	Master controller-2

Daemon Status:
  corosync: active/enabled
  pacemaker: active/enabled
  pcsd: active/enabled


and as indicated by the logs : 

process_lrm_event:  Result of monitor operation for galera-bundle-docker-0 on controller-0: 0 (ok)
remote_node_up:     Announcing pacemaker_remote node galera-bundle-0
erase_status_tag:   Deleting lrm status entries for galera-bundle-0 | xpath=//node_state[@uname='galera-bundle-0']/lrm
erase_status_tag:   Deleting transient_attributes status entries for galera-bundle-0 | xpath=//node_state[@uname='galera-bundle-0']/transient_attributes
crm_update_peer_state_iter: Node galera-bundle-0 state is now member | nodeid=0 previous=lost source=remote_node_up
peer_update_callback:       Remote node galera-bundle-0 is now member (was lost)
send_remote_state_message:  Notifying DC controller-2 of pacemaker_remote node galera-bundle-0 coming up

Comment 20 errata-xmlrpc 2018-04-10 15:32:51 UTC

Since the problem described in this bug report should be
resolved in a recent advisory, it has been closed with a
resolution of ERRATA.

For information on the advisory, and where to find the updated
files, follow the link below.

If the solution does not work for you, open a new bug report.

https://access.redhat.com/errata/RHEA-2018:0860

Note You need to log in before you can comment on or make changes to this bug.