Bug 206470 - gfs recovery mixed with unmounting can hang
gfs recovery mixed with unmounting can hang
Product: Red Hat Cluster Suite
Classification: Red Hat
Component: gfs (Show other bugs)
All Linux
medium Severity medium
: ---
: ---
Assigned To: David Teigland
GFS Bugs
Depends On:
  Show dependency treegraph
Reported: 2006-09-14 12:16 EDT by David Teigland
Modified: 2010-01-11 22:13 EST (History)
0 users

See Also:
Fixed In Version:
Doc Type: Bug Fix
Doc Text:
Story Points: ---
Clone Of:
Last Closed: 2007-08-13 16:14:29 EDT
Type: ---
Regression: ---
Mount Type: ---
Documentation: ---
Verified Versions:
Category: ---
oVirt Team: ---
RHEL 7.3 requirements from Atomic Host:
Cloudforms Team: ---

Attachments (Terms of Use)

  None (edit)
Description David Teigland 2006-09-14 12:16:12 EDT
Description of problem:

gfs/lock_dlm get a callback to do recovery at the same time that
a local gfs unmount happens.

lock_dlm prints "pr_start 31060 skip for umount/wd"
and tries to do a kcl_service_leave() which won't work
because the service (in SM) is still in recovery state 2
and needs a start_done() ack from lock_dlm.  In this case,
the node that got the unmount and recovery callback at
the same time was the only node with the fs mounted.

Version-Release number of selected component (if applicable):

How reproducible:

Do a test with lots of mounting/unmounting and throw in some node
failures and you'll run into this.

Steps to Reproduce:
Actual results:

Expected results:

Additional info:
Comment 2 David Teigland 2006-10-17 12:48:42 EDT
This will require a lot of work, will put it off until it becomes
an issue for someone.
Comment 3 Kiersten (Kerri) Anderson 2006-11-10 11:24:46 EST
Moving out for consideration for 4.6
Comment 4 David Teigland 2007-08-13 16:14:29 EDT
This issue has never actually been seen, so not planning on changing it
(which would be a high regression risk).

Note You need to log in before you can comment on or make changes to this bug.