Bug 614421

Summary:

clusterfs ras does not umount because of wrong check in the code

Product:

Red Hat Enterprise Linux 6

Reporter:

Fabio Massimo Di Nitto <fdinitto>

Component:

resource-agents

Assignee:

Lon Hohberger <lhh>

Status:

CLOSED CURRENTRELEASE

QA Contact:

Cluster QE <mspqa-list>

Severity:

medium

Docs Contact:

Priority:

low

Version:

6.0

CC:

cluster-maint, cmarthal, lhh, ssaha

Target Milestone:

Target Release:

---

Hardware:

All

OS:

Linux

Whiteboard:

Fixed In Version:

resource-agents-3.0.12-13.el6

Doc Type:

Bug Fix

Doc Text:

Story Points:

---

Clone Of:

Environment:

Last Closed:

2010-11-10 22:16:08 UTC

Type:

---

Regression:

---

Mount Type:

---

Documentation:

---

CRM:

Verified Versions:

Category:

---

oVirt Team:

---

RHEL 7.3 requirements from Atomic Host:

Cloudforms Team:

---

Target Upstream Version:

Embargoed:

Attachments:

Description	Flags
Fix	none

Description Fabio Massimo Di Nitto 2010-07-14 13:11:08 UTC

-       if [ -z "$force_umount" ]; then
+       if [ -z "$OCF_RESKEY_force_unmount" ]; then

the check is never true as-is and even with force_unmount=yes specified in the config, the comparison is always wrong.

So the result is that a clustefs is never umount when rgmanager exits or the service migrate.

Comment 1 Lon Hohberger 2010-07-14 13:56:08 UTC

This is because in the test, force_unmount="on".  utils/fs-lib.sh checks for "yes" and "1", but not "on".

So, either we need to change it in fs-lib.sh or fix in luci.

Comment 2 Lon Hohberger 2010-07-14 14:21:48 UTC

Created attachment 431787 [details]
Fix

Comment 3 Lon Hohberger 2010-07-14 14:27:07 UTC

[root@localhost resources]# fs-test/clusterfs-test.sh
Creating holder program...OK
Creating file system image /tmp/test-23154/fs-image (50MB)...OK
Creating gfs2 filesystem...OK
Basic tests:
- Status:OK
- Start:OK
- Status:OK
- Status (depth 10):OK
- Status (depth 20):OK
- Stopping (no force_unmount; should stay mounted):OK
- Stopping (force_unmount):OK
Basic Tests PASSED
Testing service-level reference count handling...OK
Clearing service-level reference count and unmounting...OK
Testing clearing a reference with force_unmount set...OK
Testing clearing a reference on subdirectory with force_unmount set...OK
Testing clearing a reference on subdirectory with force_unmount set to 'on'...OK
Testing clearing a reference on subdirectory with force_unmount set to 'true'...OK
Testing clearing a reference on subdirectory with force_unmount set to 'yes'...OK
Testing a known reference with a near-match: [/tmp/test-23154/mountfoo]...OK (holder was not killed)
Testing a known reference with a near-match: [/tmp/test-23154/mount foo]...OK (holder was not killed)
Testing a known reference with a near-match: [/tmp/test-23154/mount foo]...OK (holder was not killed)
Testing a known reference with a near-match: [/tmp/test-23154/mount /tmp/test-23154/mount]...OK (holder was not killed)
Testing a known reference with a near-match: [/tmp/test-23154/tmp/test-23154/mount]...OK (holder was not killed)
Testing clearing a process with SIGTERM blocked with force_unmount set...OK
Testing clearing a process living in the mountpoint...OK
Testing clearing a process living in the mountpoint with SIGTERM blocked...OK

Comment 4 Lon Hohberger 2010-07-14 14:36:13 UTC

http://git.fedorahosted.org/git?p=cluster.git;a=commit;h=9f315303a6b88bcc9d3d6b04f98a640c7bd3049e

Comment 7 Corey Marthaler 2010-08-27 20:47:05 UTC

<clusterfs device="/dev/grant/gfs1" force_unmount="1" fstype="gfs2" mountpoint="/mnt/gfs1" name="gfs1" options="" nfslock="1"/>

I was successfully able to relocate this fs resource with the force_unmount on and with both server and client references to it.

Aug 27 15:37:43 grant-01 qarshd[8134]: Running cmdline: clusvcadm -r HA_GFS -m grant-02
Aug 27 15:37:43 grant-01 rgmanager[2214]: Stopping service service:HA_GFS
Aug 27 15:37:43 grant-01 rgmanager[8168]: Removing IPv4 address 10.15.89.209/24 from eth0
Aug 27 15:37:53 grant-01 rgmanager[8213]: Removing export: *:/mnt/gfs1
Aug 27 15:37:54 grant-01 rgmanager[8270]: Dropping node-wide NFS locks
Aug 27 15:37:54 grant-01 rgmanager[8303]: Asking lockd to drop locks (pid 3054)
Aug 27 15:37:54 grant-01 rgmanager[8347]: Sending reclaim notifications via grant-01.lab.msp.redhat.com
Aug 27 15:37:54 grant-01 sm-notify[8369]: Version 1.2.2 starting
Aug 27 15:37:54 grant-01 sm-notify[8369]: Already notifying clients; Exiting!
Aug 27 15:37:58 grant-01 rgmanager[8425]: unmounting /mnt/gfs1
Aug 27 15:37:58 grant-01 rgmanager[8470]: Sending SIGTERM to processes on /mnt/gfs1
Aug 27 15:38:03 grant-01 rgmanager[8532]: unmounting /mnt/gfs1
Aug 27 15:38:04 grant-01 rgmanager[8577]: Sending SIGKILL to processes on /mnt/gfs1
Aug 27 15:38:09 grant-01 rgmanager[8640]: unmounting /mnt/gfs1
Aug 27 15:38:09 grant-01 rgmanager[2214]: Service service:HA_GFS is stopped

Marking verified with the latest latest build.
2.6.32-70.el6.x86_64
rgmanager-3.0.12-10.el6.x86_64

Comment 8 releng-rhel@redhat.com 2010-11-10 22:16:08 UTC

Red Hat Enterprise Linux 6.0 is now available and should resolve
the problem described in this bug report. This report is therefore being closed
with a resolution of CURRENTRELEASE. You may reopen this bug report if the
solution does not work for you.