Bug 198821
Summary: | logical volume corruption during mirror creation/deletion loop | ||
---|---|---|---|
Product: | Red Hat Enterprise Linux 4 | Reporter: | Corey Marthaler <cmarthal> |
Component: | lvm2 | Assignee: | Jonathan Earl Brassow <jbrassow> |
Status: | CLOSED ERRATA | QA Contact: | Cluster QE <mspqa-list> |
Severity: | medium | Docs Contact: | |
Priority: | medium | ||
Version: | 4.4 | CC: | agk, dwysocha, jbrassow, k.georgiou, mbroz |
Target Milestone: | --- | ||
Target Release: | --- | ||
Hardware: | All | ||
OS: | Linux | ||
Whiteboard: | |||
Fixed In Version: | RHBA-2007-0287 | Doc Type: | Bug Fix |
Doc Text: | Story Points: | --- | |
Clone Of: | Environment: | ||
Last Closed: | 2007-05-08 00:03:10 UTC | Type: | --- |
Regression: | --- | Mount Type: | --- |
Documentation: | --- | CRM: | |
Verified Versions: | Category: | --- | |
oVirt Team: | --- | RHEL 7.3 requirements from Atomic Host: | |
Cloudforms Team: | --- | Target Upstream Version: | |
Embargoed: |
Description
Corey Marthaler
2006-07-13 21:25:08 UTC
reproduced this situation with the newly build cmirror-kernel and 2.6.9-42.BRsmp kernel. Jul 18 11:28:02 taft-03 lvm[4437]: LV mirror3_mlog invalid: segment 2 should begin at LE 1 (found 0. Jul 18 11:28:02 taft-03 lvm[4437]: Couldn't read all logical volumes for volume group mirror_1_1917. Jul 18 11:28:02 taft-03 lvm[4437]: LV mirror3_mlog invalid: segment 2 should begin at LE 1 (found 0. Jul 18 11:28:02 taft-03 lvm[4437]: Couldn't read all logical volumes for volume group mirror_1_1917. Jul 18 11:28:02 taft-03 lvm[4437]: LV mirror3_mlog invalid: segment 2 should begin at LE 1 (found 0. Jul 18 11:28:02 taft-03 lvm[4437]: Couldn't read all logical volumes for volume group mirror_1_1917. Jul 18 11:28:02 taft-03 lvm[4437]: LV mirror3_mlog invalid: segment 2 should begin at LE 1 (found 0. Jul 18 11:28:02 taft-03 lvm[4437]: Couldn't read all logical volumes for volume group mirror_1_1917. Jul 18 11:28:02 taft-03 lvm[4437]: /dev/dm-2: stat failed: No such file or directory Jul 18 11:28:02 taft-03 lvm[4437]: Path /dev/dm-2 no longer valid for device(253,2) Jul 18 11:28:02 taft-03 lvm[4437]: /dev/mapper/mirror_1_1917-mirror1_mlog: stat failed: No such fily Jul 18 11:28:02 taft-03 lvm[4437]: Path /dev/mapper/mirror_1_1917-mirror1_mlog no longer valid for ) Jul 18 11:28:02 taft-03 lvm[4437]: Aborting - please provide new pathname for what used to be /dev/g Jul 18 11:28:02 taft-03 lvm[4437]: /dev/dm-3: stat failed: No such file or directory Jul 18 11:28:02 taft-03 lvm[4437]: Path /dev/dm-3 no longer valid for device(253,3) Jul 18 11:28:02 taft-03 lvm[4437]: /dev/mapper/mirror_1_1917-mirror1_mimage_0: stat failed: No suchy Jul 18 11:28:02 taft-03 lvm[4437]: Path /dev/mapper/mirror_1_1917-mirror1_mimage_0 no longer valid ) Jul 18 11:28:02 taft-03 lvm[4437]: Aborting - please provide new pathname for what used to be /dev/0 Jul 18 11:28:02 taft-03 lvm[4437]: /dev/dm-4: stat failed: No such file or directory Jul 18 11:28:02 taft-03 lvm[4437]: Path /dev/dm-4 no longer valid for device(253,4) Jul 18 11:28:02 taft-03 lvm[4437]: /dev/mapper/mirror_1_1917-mirror1_mimage_1: stat failed: No suchy Jul 18 11:28:02 taft-03 lvm[4437]: Path /dev/mapper/mirror_1_1917-mirror1_mimage_1 no longer valid ) Jul 18 11:28:02 taft-03 lvm[4437]: Aborting - please provide new pathname for what used to be /dev/1 Jul 18 11:28:02 taft-03 lvm[4437]: /dev/dm-5: stat failed: No such file or directory Jul 18 11:28:02 taft-03 lvm[4437]: Path /dev/dm-5 no longer valid for device(253,5) Jul 18 11:28:02 taft-03 lvm[4437]: /dev/mapper/mirror_1_1917-mirror1: stat failed: No such file or y Jul 18 11:28:02 taft-03 lvm[4437]: Path /dev/mapper/mirror_1_1917-mirror1 no longer valid for devic) Jul 18 11:28:02 taft-03 lvm[4437]: /dev/mirror_1_1917/mirror1: stat failed: No such file or directoy Jul 18 11:28:02 taft-03 lvm[4437]: Path /dev/mirror_1_1917/mirror1 no longer valid for device(253,5) Jul 18 11:28:02 taft-03 lvm[4437]: Aborting - please provide new pathname for what used to be /dev/1 Jul 18 11:28:02 taft-03 lvm[4437]: LV mirror3_mlog invalid: segment 2 should begin at LE 1 (found 0. Jul 18 11:28:02 taft-03 lvm[4437]: Couldn't read all logical volumes for volume group mirror_1_1917. Jul 18 11:28:02 taft-03 lvm[4437]: LV mirror3_mlog invalid: segment 2 should begin at LE 1 (found 0. Jul 18 11:28:02 taft-03 lvm[4437]: Couldn't read all logical volumes for volume group mirror_1_1917. Jul 18 11:28:02 taft-03 lvm[4437]: LV mirror3_mlog invalid: segment 2 should begin at LE 1 (found 0. Jul 18 11:28:02 taft-03 lvm[4437]: Couldn't read all logical volumes for volume group mirror_1_1917. Jul 18 11:28:02 taft-03 lvm[4437]: LV mirror3_mlog invalid: segment 2 should begin at LE 1 (found 0. Jul 18 11:28:02 taft-03 lvm[4437]: Couldn't read all logical volumes for volume group mirror_1_1917. Jul 18 11:28:02 taft-03 lvm[4437]: /dev/mapper/mirror_1_1917-mirror1_mlog: stat failed: No such fily Jul 18 11:28:02 taft-03 lvm[4437]: Path /dev/mapper/mirror_1_1917-mirror1_mlog no longer valid for ) Jul 18 11:28:02 taft-03 lvm[4437]: Aborting - please provide new pathname for what used to be /dev/g Jul 18 11:28:02 taft-03 lvm[4437]: /dev/mapper/mirror_1_1917-mirror1_mimage_0: stat failed: No suchy Jul 18 11:28:02 taft-03 lvm[4437]: Path /dev/mapper/mirror_1_1917-mirror1_mimage_0 no longer valid ) Jul 18 11:28:02 taft-03 lvm[4437]: Aborting - please provide new pathname for what used to be /dev/0 Jul 18 11:28:02 taft-03 lvm[4437]: /dev/mapper/mirror_1_1917-mirror1_mimage_1: stat failed: No suchy Jul 18 11:28:02 taft-03 lvm[4437]: Path /dev/mapper/mirror_1_1917-mirror1_mimage_1 no longer valid ) Jul 18 11:28:02 taft-03 lvm[4437]: Aborting - please provide new pathname for what used to be /dev/1 Jul 18 11:28:02 taft-03 lvm[4437]: /dev/mirror_1_1917/mirror1: stat failed: No such file or directoy Jul 18 11:28:02 taft-03 lvm[4437]: Path /dev/mirror_1_1917/mirror1 no longer valid for device(253,5) Jul 18 11:28:02 taft-03 lvm[4437]: Aborting - please provide new pathname for what used to be /dev/1 Jul 18 11:28:02 taft-03 lvm[4437]: LV mirror3_mlog invalid: segment 2 should begin at LE 1 (found 0. Jul 18 11:28:02 taft-03 lvm[4437]: Couldn't read all logical volumes for volume group mirror_1_1917. Jul 18 11:28:02 taft-03 lvm[4437]: LV mirror3_mlog invalid: segment 2 should begin at LE 1 (found 0. Jul 18 11:28:02 taft-03 lvm[4437]: Couldn't read all logical volumes for volume group mirror_1_1917. Jul 18 11:28:02 taft-03 lvm[4437]: LV mirror3_mlog invalid: segment 2 should begin at LE 1 (found 0. Jul 18 11:28:02 taft-03 lvm[4437]: Couldn't read all logical volumes for volume group mirror_1_1917. Jul 18 11:28:02 taft-03 lvm[4437]: LV mirror3_mlog invalid: segment 2 should begin at LE 1 (found 0. Jul 18 11:28:02 taft-03 lvm[4437]: Couldn't read all logical volumes for volume group mirror_1_1917. Jul 18 11:28:02 taft-03 lvm[4437]: /dev/mapper/mirror_1_1917-mirror1_mlog: stat failed: No such fily Jul 18 11:28:02 taft-03 lvm[4437]: Path /dev/mapper/mirror_1_1917-mirror1_mlog no longer valid for ) Jul 18 11:28:02 taft-03 lvm[4437]: Aborting - please provide new pathname for what used to be /dev/g Jul 18 11:28:02 taft-03 lvm[4437]: /dev/mapper/mirror_1_1917-mirror1_mimage_0: stat failed: No suchy Jul 18 11:28:02 taft-03 lvm[4437]: Path /dev/mapper/mirror_1_1917-mirror1_mimage_0 no longer valid ) Jul 18 11:28:02 taft-03 lvm[4437]: Aborting - please provide new pathname for what used to be /dev/0 Jul 18 11:28:02 taft-03 lvm[4437]: /dev/mapper/mirror_1_1917-mirror1_mimage_1: stat failed: No suchy Jul 18 11:28:02 taft-03 lvm[4437]: Path /dev/mapper/mirror_1_1917-mirror1_mimage_1 no longer valid ) Jul 18 11:28:02 taft-03 lvm[4437]: Aborting - please provide new pathname for what used to be /dev/1 Jul 18 11:28:02 taft-03 lvm[4437]: /dev/mirror_1_1917/mirror1: stat failed: No such file or directoy Jul 18 11:28:02 taft-03 lvm[4437]: Path /dev/mirror_1_1917/mirror1 no longer valid for device(253,5) Jul 18 11:28:02 taft-03 lvm[4437]: Aborting - please provide new pathname for what used to be /dev/1 Jul 18 11:28:02 taft-03 lvm[4437]: LV mirror3_mlog invalid: segment 2 should begin at LE 1 (found 0. Jul 18 11:28:02 taft-03 lvm[4437]: Couldn't read all logical volumes for volume group mirror_1_1917. Jul 18 11:28:02 taft-03 lvm[4437]: LV mirror3_mlog invalid: segment 2 should begin at LE 1 (found 0. Jul 18 11:28:02 taft-03 lvm[4437]: Couldn't read all logical volumes for volume group mirror_1_1917. Jul 18 11:28:02 taft-03 lvm[4437]: LV mirror3_mlog invalid: segment 2 should begin at LE 1 (found 0. Jul 18 11:28:02 taft-03 lvm[4437]: Couldn't read all logical volumes for volume group mirror_1_1917. Jul 18 11:28:02 taft-03 lvm[4437]: LV mirror3_mlog invalid: segment 2 should begin at LE 1 (found 0. Jul 18 11:28:02 taft-03 lvm[4437]: Couldn't read all logical volumes for volume group mirror_1_1917. Jul 18 11:28:02 taft-03 lvm[4437]: Volume group for uuid not found: 6LhjIRXJhdmCWuDAvqPCOCSjQTKTHwiA [root@taft-03 cluster]# lvs LV mirror3_mlog invalid: segment 2 should begin at LE 1 (found 0). Couldn't read all logical volumes for volume group mirror_1_1917. LV mirror3_mlog invalid: segment 2 should begin at LE 1 (found 0). Couldn't read all logical volumes for volume group mirror_1_1917. LV mirror3_mlog invalid: segment 2 should begin at LE 1 (found 0). Couldn't read all logical volumes for volume group mirror_1_1917. LV mirror3_mlog invalid: segment 2 should begin at LE 1 (found 0). Couldn't read all logical volumes for volume group mirror_1_1917. LV mirror3_mlog invalid: segment 2 should begin at LE 1 (found 0). Couldn't read all logical volumes for volume group mirror_1_1917. LV mirror3_mlog invalid: segment 2 should begin at LE 1 (found 0). Couldn't read all logical volumes for volume group mirror_1_1917. LV mirror3_mlog invalid: segment 2 should begin at LE 1 (found 0). Couldn't read all logical volumes for volume group mirror_1_1917. LV mirror3_mlog invalid: segment 2 should begin at LE 1 (found 0). Couldn't read all logical volumes for volume group mirror_1_1917. Volume group "mirror_1_1917" not found Reproduced this while creating/deleting mirrors during recovery. [root@link-01 ~]# rpm -q cmirror-kernel cmirror-kernel-2.6.9-13.0 I assume cluster mirrors and "recovery" == "resync"? Or does "recovery" mean you were killing nodes/ disks? recovery == killed a node in the cluster Were you creating/deleting mirrors on all machines? Was the machine that was killed the one that was doing the operations? this has been attributed to duplicate entries of the log device making their way into LVM metadata. A fix has been put in to prevent the writing of such invalid data to disk. However, the root cause is still unknown - so you will likely see a failure of the command (but no corruption). Short fix is already upstream - RHEL4U5 for redhat packages. This bug was a side affect of not activating the sub-lvs in the mirror before deleting them. The result was that the other members of the cluster would not know of the existence of the dm device because it was not linked to by and LVM volume. lvm_patches-rhel4-09082006/patches/activate_mirror_orphans_before_deactivate.patch This bug could also be hit due to the inability to remove device mapper devices because they were being waited on by dmeventd. lvm_patches-rhel4-09082006/patches/device_registered.patch Patches are awaiting integration. This issue appears to be fixed with the latest rpms. [root@link-08 ~]# rpm -q device-mapper device-mapper-1.02.12-3 [root@link-08 ~]# rpm -q lvm2 lvm2-2.02.13-1 [root@link-08 ~]# rpm -q lvm2-cluster lvm2-cluster-2.02.13-1 [root@link-08 ~]# rpm -q cmirror-kernel cmirror-kernel-2.6.9-13.0 An advisory has been issued which should help the problem described in this bug report. This report is therefore being closed with a resolution of ERRATA. For more information on the solution and/or where to find the updated files, please follow the link below. You may reopen this bug report if the solution does not work for you. http://rhn.redhat.com/errata/RHBA-2007-0287.html |