Bug 773423 - HA LVM mirror repair can fail due to service relocation
Summary: HA LVM mirror repair can fail due to service relocation
Keywords:
Status: CLOSED WONTFIX
Alias: None
Product: Red Hat Enterprise Linux 5
Classification: Red Hat
Component: rgmanager
Version: 5.8
Hardware: x86_64
OS: Linux
medium
medium
Target Milestone: rc
: ---
Assignee: Jonathan Earl Brassow
QA Contact: Cluster QE
URL:
Whiteboard:
Depends On:
Blocks: 730141 807971
TreeView+ depends on / blocked
 
Reported: 2012-01-11 19:35 UTC by Corey Marthaler
Modified: 2014-05-30 21:41 UTC (History)
13 users (show)

Fixed In Version:
Doc Type: Bug Fix
Doc Text:
Clone Of: 692186
Environment:
Last Closed: 2012-04-18 21:08:57 UTC
Target Upstream Version:


Attachments (Terms of Use)

Comment 1 Corey Marthaler 2012-01-11 19:38:33 UTC
Jan 11 11:39:56 taft-03 lvm[8381]: Mirror status: 1 of 2 images failed.
Jan 11 11:39:56 taft-03 lvm[8381]: Trying to up-convert to 2 images, 1 logs.
Jan 11 11:40:05 taft-03 lvm[8381]: TAFT/ha1: Converted: 0.9%
Jan 11 11:40:10 taft-03 clurgmgrd: [11296]: <notice> Deactivating TAFT/ha1
Jan 11 11:40:10 taft-03 clurgmgrd: [11296]: <notice> Making resilient : lvchange -an TAFT/ha1
Jan 11 11:40:12 taft-03 clurgmgrd: [11296]: <notice> Resilient command: lvchange -an TAFT/ha1 --config devices{filter=["a|/dev/sda2|","a|/dev/sdb1|","a|/dev/sdc1|","a|/dev/sdd1|","a|/dev/sdf1|","a|/dev/sdg1|","a|/dev/sdh1|","a|unknown|","a|device|","r|.*|"]}
Jan 11 11:40:15 taft-03 clurgmgrd: [11296]: <notice> Removing ownership tag (taft-03) from TAFT/ha1
Jan 11 11:40:16 taft-03 clurgmgrd: [11296]: <err> Unable to delete tag from TAFT/ha1
Jan 11 11:40:16 taft-03 clurgmgrd: [11296]: <err> Attempting volume group clean-up and retry
Jan 11 11:40:18 taft-03 clurgmgrd[11296]: <notice> Service service:halvm is stopped
Jan 11 11:40:20 taft-03 lvm[8381]: ABORTING: Mirror percentage check failed.
Jan 11 11:40:20 taft-03 lvm[8381]: Repair of mirrored LV TAFT/ha1 failed.
Jan 11 11:40:20 taft-03 lvm[8381]: Failed to remove faulty devices in TAFT-ha1.
Jan 11 11:40:20 taft-03 lvm[8381]: dm_task_run failed, errno = 6, No such device or address
Jan 11 11:40:20 taft-03 lvm[8381]: TAFT-ha1 disappeared, detaching
Jan 11 11:40:20 taft-03 lvm[8381]: No longer monitoring mirror device TAFT-ha1 for events.




2.6.18-301.el5

lvm2-2.02.88-5.el5    BUILT: Fri Dec  2 12:25:45 CST 2011
lvm2-cluster-2.02.88-5.el5    BUILT: Fri Dec  2 12:48:37 CST 2011
device-mapper-1.02.67-2.el5    BUILT: Mon Oct 17 08:31:56 CDT 2011
device-mapper-event-1.02.67-2.el5    BUILT: Mon Oct 17 08:31:56 CDT 2011
cmirror-1.1.39-14.el5    BUILT: Wed Nov  2 17:25:33 CDT 2011
kmod-cmirror-0.1.22-3.el5    BUILT: Tue Dec 22 13:39:47 CST 2009
rgmanager-2.0.52-28.el5      BUILT:  Wed Jan 11 08:44:24 AM CST 2012

Comment 2 RHEL Program Management 2012-04-02 10:42:31 UTC
This request was evaluated by Red Hat Product Management for inclusion
in a Red Hat Enterprise Linux release.  Product Management has
requested further review of this request by Red Hat Engineering, for
potential inclusion in a Red Hat Enterprise Linux release for currently
deployed products.  This request is not yet committed for inclusion in
a release.

Comment 3 Jonathan Earl Brassow 2012-04-10 16:41:06 UTC
Comment #1 should maybe have been posted in 756180?

This bug concerns the problem of the 'lvconvert' poll daemon trying to report on an LV that has been deactivated in another terminal.  Here is some select editing from the original bug:

"There is no failure here, just scary messages.  Here's what's happening:

1) Test kills one of mirror legs
2) Mirror is down-converted to linear to remove faulty device
3) Mirror is up-converted to replace faulty device (causes new log device too)
4) Poll daemon is launched to track the progress of the up-convert
5) Test forces relocation of service
6) Poll daemon fails to track sync progress b/c mirror has moved.
7) Mirror starts safely and cleanly on alternate machine
*) Scary messages remain on original node.

This analysis is verifiably correct.  Simply try the following:
terminal1~> lvcreate -L 1G -n lv vg
terminal1~> lvconvert -m1 vg/lv
terminal2~> lvchange -an vg/lv

<output from terminal1>
[root@bp-01 ~]# lvcreate -L 1G -n lv vg
  Logical volume "lv" created
[root@bp-01 ~]# lvconvert -m1 vg/lv
  vg/lv: Converted: 0.0%
  ABORTING: Mirror percentage check failed."

This bug is very low priority.


Note You need to log in before you can comment on or make changes to this bug.