Bug 844493
| Summary: | Various dm_task_run failures cause many continuing problems in LVM | ||
|---|---|---|---|
| Product: | Red Hat Enterprise Linux 6 | Reporter: | Jonathan Earl Brassow <jbrassow> |
| Component: | lvm2 | Assignee: | Peter Rajnoha <prajnoha> |
| Status: | CLOSED ERRATA | QA Contact: | cluster-qe <cluster-qe> |
| Severity: | unspecified | Docs Contact: | |
| Priority: | unspecified | ||
| Version: | 6.3 | CC: | agk, cmarthal, dwysocha, heinzm, jbrassow, mcsontos, msnitzer, prajnoha, prockai, thornber, zkabelac |
| Target Milestone: | rc | Keywords: | Reopened |
| Target Release: | --- | ||
| Hardware: | Unspecified | ||
| OS: | Unspecified | ||
| Whiteboard: | |||
| Fixed In Version: | lvm2-2.02.140-1.el6 | Doc Type: | Bug Fix |
| Doc Text: | Story Points: | --- | |
| Clone Of: | 813954 | Environment: | |
| Last Closed: | 2016-05-11 01:19:39 UTC | Type: | Bug |
| Regression: | --- | Mount Type: | --- |
| Documentation: | --- | CRM: | |
| Verified Versions: | Category: | --- | |
| oVirt Team: | --- | RHEL 7.3 requirements from Atomic Host: | |
| Cloudforms Team: | --- | Target Upstream Version: | |
| Embargoed: | |||
| Bug Depends On: | |||
| Bug Blocks: | 619574, 624148 | ||
|
Description
Jonathan Earl Brassow
2012-07-30 22:19:15 UTC
This bug is the RHEL6 equivalent to bug 813954 - a bug designed to encapsulate all the issues that revolve around 'dm_task_run' errors. Again, the 'dm_task_run' issues should be solved before these other dependent bugs. This request was not resolved in time for the current release. Red Hat invites you to ask your support representative to propose this request, if still desired, for consideration in the next release of Red Hat Enterprise Linux. I believe there are no random dm_task_run errors anymore with current code. So I'm closing this BZ - if they appear - reopen this BZ with trace. This bug is not fixed: https://bugzilla.redhat.com/show_bug.cgi?id=624148#c27 This doesn't look like failure of dm_task_run() - but rather lvm2 mirror code is doing bad things here. ioctl is getting signal to wakeup from some kernel event - so not really a fault of dm_task_run. These don't just happen during mirror operations though. Lots of lvm operations can cause these messages to show up. May 30 16:52:54 qalvm-01 qarshd[1908]: Running cmdline: lvremove -f /dev/snapper_thinp/snap1 May 30 16:54:12 qalvm-01 lvm[1149]: Monitoring snapshot snapper_thinp-snap2 May 30 16:54:12 qalvm-01 systemd-udevd[1916]: inotify_add_watch(7, /dev/dm-14, 10) failed: No such file or directory May 30 16:54:12 qalvm-01 systemd-udevd[1919]: inotify_add_watch(7, /dev/dm-12, 10) failed: No such file or directory May 30 16:54:12 qalvm-01 qarshd[1953]: Running cmdline: dmsetup ls May 30 16:54:12 qalvm-01 qarshd[1955]: Running cmdline: ls /dev/snapper_thinp/snap1 May 30 16:54:12 qalvm-01 lvm[1149]: Logical volume snap1 not found in volume group snapper_thinp May 30 16:54:12 qalvm-01 lvm[1149]: Failed to extend snapshot snapper_thinp-snap1. May 30 16:54:12 qalvm-01 lvm[1149]: dm_task_run failed, errno = 6, No such device or address May 30 16:54:12 qalvm-01 lvm[1149]: snapper_thinp-snap1 disappeared, detaching May 30 16:54:12 qalvm-01 lvm[1149]: No longer monitoring snapshot snapper_thinp-snap1 May 30 16:54:14 qalvm-01 lvm[1149]: No longer monitoring snapshot snapper_thinp-snap2 I believe another one related to Bug #1108540 - where dmeventd was incorrectly unmonitoring devices. This is fixed with lvm2 2.02.112. (In reply to Zdenek Kabelac from comment #8) > I believe another one related to Bug #1108540 - where dmeventd was > incorrectly unmonitoring devices. > > This is fixed with lvm2 2.02.112. Well, the bug #1108540 had "Device or resource busy" error, while this bug (and bug #813954) has "Invalid argument" error issued. So it seems a bit different. Anyway, would be good to see if the other error (Invalid argument) is still reproducible... (In reply to Peter Rajnoha from comment #9) > (In reply to Zdenek Kabelac from comment #8) > > I believe another one related to Bug #1108540 - where dmeventd was > > incorrectly unmonitoring devices. > > > > This is fixed with lvm2 2.02.112. > > Well, the bug #1108540 had "Device or resource busy" error, while this bug > (and bug #813954) has "Invalid argument" error issued. So it seems a bit > different. (moving back to NEW as this is not yet resolved - the error described in this report differs) Can we insert an Internal Error to dmeventd which is failing here? Then QEs will have to run tests with abort_on_internal_errors set and provide the coredump should it happen again. We don't have a concrete reproducer for now, marking with "devel cond nak reproducer" for now. Also, there were lots of fixes in dmeventd - it's probably worth trying the new lvm2 release first once we do the new build for 6.8. AFAIK this bug should be closed and reopened when something new appears. We've fixed large list of bugs - e.g. dmeventd was breaking its own processed lvm2 command with SIGALRM causing unexpected ioctl failures if the timer expired while command has been processed. I'm considering this BZ as solved with release 2.02.133. Marking verified (SanityOnly) in the latest rpms. 2.6.32-615.el6.x86_64 lvm2-2.02.141-2.el6 BUILT: Wed Feb 10 07:49:03 CST 2016 lvm2-libs-2.02.141-2.el6 BUILT: Wed Feb 10 07:49:03 CST 2016 lvm2-cluster-2.02.141-2.el6 BUILT: Wed Feb 10 07:49:03 CST 2016 udev-147-2.71.el6 BUILT: Wed Feb 10 07:07:17 CST 2016 device-mapper-1.02.115-2.el6 BUILT: Wed Feb 10 07:49:03 CST 2016 device-mapper-libs-1.02.115-2.el6 BUILT: Wed Feb 10 07:49:03 CST 2016 device-mapper-event-1.02.115-2.el6 BUILT: Wed Feb 10 07:49:03 CST 2016 device-mapper-event-libs-1.02.115-2.el6 BUILT: Wed Feb 10 07:49:03 CST 2016 device-mapper-persistent-data-0.6.2-0.1.rc1.el6 BUILT: Wed Feb 10 09:52:15 CST 2016 cmirror-2.02.141-2.el6 BUILT: Wed Feb 10 07:49:03 CST 2016 Since the problem described in this bug report should be resolved in a recent advisory, it has been closed with a resolution of ERRATA. For information on the advisory, and where to find the updated files, follow the link below. If the solution does not work for you, open a new bug report. https://rhn.redhat.com/errata/RHBA-2016-0964.html |