681144 – [NetApp 6.0.z Bug] DM-Multipath fails to update paths during IO with fabric faults

RHEL Engineering is moving the tracking of its product development work on RHEL 6 through RHEL 9 to Red Hat Jira (issues.redhat.com). If you're a Red Hat customer, please continue to file support cases via the Red Hat customer portal. If you're not, please head to the "RHEL project" in Red Hat Jira and file new tickets here. Individual Bugzilla bugs in the statuses "NEW", "ASSIGNED", and "POST" are being migrated throughout September 2023. Bugs of Red Hat partners with an assigned Engineering Partner Manager (EPM) are migrated in late September as per pre-agreed dates. Bugs against components "kernel", "kernel-rt", and "kpatch" are only migrated if still in "NEW" or "ASSIGNED". If you cannot log in to RH Jira, please consult article #7032570. That failing, please send an e-mail to the RH Jira admins at rh-issues@redhat.com to troubleshoot your issue as a user management inquiry. The email creates a ServiceNow ticket with Red Hat. Individual Bugzilla bugs that are migrated will be moved to status "CLOSED", resolution "MIGRATED", and set with "MigratedToJIRA" in "Keywords". The link to the successor Jira issue will be found under "Links", have a little "two-footprint" icon next to it, and direct you to the "RHEL project" in Red Hat Jira (issue links are of type "https://issues.redhat.com/browse/RHEL-XXXX", where "X" is a digit). This same link will be available in a blue banner at the top of the page informing you that that bug has been migrated.

Bug 681144 - [NetApp 6.0.z Bug] DM-Multipath fails to update paths during IO with fabric faults

Summary: [NetApp 6.0.z Bug] DM-Multipath fails to update paths during IO with fabric f...

Keywords:
Status:	CLOSED ERRATA
Alias:	None
Product:	Red Hat Enterprise Linux 6
Classification:	Red Hat
Component:	device-mapper-multipath
Sub Component:
Version:	6.0
Hardware:	All
OS:	All
Priority:	high
Severity:	high
Target Milestone:	rc
Target Release:	---
Assignee:	Ben Marzinski
QA Contact:	Storage QE
Docs Contact:
URL:
Whiteboard:
Depends On:
Blocks:	684684
TreeView+	depends on / blocked

Reported:	2011-03-01 09:34 UTC by Rajashekhar M A
Modified:	2013-01-11 03:50 UTC (History)
CC List:	15 users (show)
Fixed In Version:	device-mapper-multipath-0.4.9-39.el6
Doc Type:	Bug Fix
Doc Text:	When a path was removed, the multipathd daemon did not always remove the path sysfs device from its cache. The daemon kept searching the cache for the device and created sysfs devices without the vecs lock held. Because of this, paths could have pointed to invalid sysfs devices and caused multipathd to crash. The multipathd daemon now always removes the sysfs device from cache when deleting a path and accesses the cache only with the vecs lock held.
Clone Of:
Environment:
Last Closed:	2011-05-19 14:13:01 UTC
Target Upstream Version:
Embargoed:

Attachments	(Terms of Use)
Tarball with messages and multipath.conf file. (160.83 KB, application/x-gzip) 2011-03-01 09:38 UTC, Rajashekhar M A	no flags	Details
syslog with test rpm messages (52.81 KB, application/x-gzip) 2011-03-07 09:04 UTC, Rajashekhar M A	no flags	Details
syslog and command logs with 681144.2 test rpms and directio (312.20 KB, application/x-gzip) 2011-03-08 12:32 UTC, Rajashekhar M A	no flags	Details
View All

Links
System	ID	Private	Priority	Status	Summary	Last Updated
Red Hat Product Errata	RHBA-2011:0725	0	normal	SHIPPED_LIVE	device-mapper-multipath bug fix and enhancement update	2011-05-19 09:37:12 UTC

Description Rajashekhar M A 2011-03-01 09:34:50 UTC

Description of problem:

When fabric faults with IO are run on a RHEL6.0 Errata host, multipathd daemon intermittently fails to update paths.

The multipath -ll output looks like the following in such a scenario:

# multipath -ll /dev/sdce
360a98000486e2f65686f6246516e6859 dm-22 NETAPP,LUN
size=5.0G features='1 queue_if_no_path' hwhandler='1 alua' wp=rw
|-+- policy='round-robin 0' prio=50 status=active
| |- 3:0:1:19 sdce 69:32  active ready running
| `- 2:0:1:19 sdca 68:224 active ready running
|-+- policy='round-robin 0' prio=10 status=enabled
| `- 2:0:0:19 sdaj 66:48  active ready running
`-+- policy='round-robin 0' prio=10 status=enabled
  `- 3:0:0:19 sdao 66:128 failed ready running
#


Version-Release number of selected component (if applicable):

device-mapper-multipath: 0.4.9.31.el6_0.2
device-mapper: 1.02.53-8.el6_0.4
kernel: 2.6.32-71.18.1.el6

How reproducible:
Frequent.

Steps to Reproduce:
1. Map 20 LUNs with 4 paths each.
2. Create 7 LVs striped across the 20 dm-multipath devices.
3. Create an fs and start IO to the LVs.
4. Run fabric/controller faults repeatedly.


Additional info:

I tried with tur path checker as well, I hit a different issue where one of the paths was not added to the map. The multipath -ll output looks like below:

# multipath -ll 360a98000486e2f65686f6246516f4468
360a98000486e2f65686f6246516f4468 dm-22 NETAPP,LUN
size=5.0G features='1 queue_if_no_path' hwhandler='1 alua' wp=rw
|-+- policy='round-robin 0' prio=50 status=active
| |- 2:0:1:20 sdcf 69:48  active ready running
| `- 3:0:1:20 sdcd 69:16  active ready running
`-+- policy='round-robin 0' prio=10 status=enabled
  `- 3:0:0:20 sdbi 67:192 active ready running

In syslog, we see messages like the following:

Mar  1 05:54:45 IBMx3250-200-178 multipathd: sdbk: add path (uevent)
Mar  1 05:54:45 IBMx3250-200-178 multipathd: sdbk: failed to get parent
Mar  1 05:54:45 IBMx3250-200-178 multipathd: sdbk: failed to store path info
Mar  1 05:54:45 IBMx3250-200-178 multipathd: uevent trigger error

But the device /dev/sdbk seems to be fine:

#
# ls /dev/sdbk
/dev/sdbk
# sg_turs -v /dev/sdbk
    test unit ready cdb: 00 00 00 00 00 00
#

Comment 1 Rajashekhar M A 2011-03-01 09:38:16 UTC

Created attachment 481576 [details]
Tarball with messages and multipath.conf file.

Attached is a tarball with full /var/log/messages files, (for both tur and directio path_checker scenarios). The tarball also has the multipath.conf file.

Comment 3 Ben Marzinski 2011-03-03 18:37:00 UTC

With directio, are you sure that you don't see this on RHEL-6.0 installs? Also, do you know if the path issues clears up in a couple of minutes. directio is an asynchronous checker. It won't actually fail a path until the IO returns failed, or it times out.

Could you try adding something like

fast_io_fail_tmo 5

to the defaults section. This will make sure that directio gets back IO to a failed device after 5 seconds of waiting, which should make directio a lot more responsive.

The tur issue makes much more sense as a regression. I did add something that could make you more likely to see "failed to get parent" messages. Multipath was caching that sysfs information forever, even if the device was deleted. This was causing a memory leak. Now multipath frees up that information when a device is removed.

However that information should be available when you are adding a path. The important things to look for are that the proper sysfs directories exist.

To get the sysfs device, multipathd checks

/sys/block/<devname>

so for your setup it would check /sys/block/sdbk

This should be a symlink to a directory. When you reproduce this, can you please post where /sys/block/<devname> is a symlink to. This directory appears to exist, otherwise, you would have failed in common_sysfs_pathinfo(), before you ever had a chance to get to try to get the parent device.

For example, on my system I see:

[root@ask-07 ~]# ls -l /sys/block/sdb
lrwxrwxrwx. 1 root root 0 Jan 19 06:21 /sys/block/sdb -> ../devices/pci0000:00/0000:00:0a.0/0000:06:00.0/host8/rport-8:0-0/target8:0:0/8:0:0:0/block/sdb

and

/sys/devices/pci0000:00/0000:00:0a.0/0000:06:00.0/host8/rport-8:0-0/target8:0:0/8:0:0:0/block/sdb

exists. As long as this directory starts with "/sys/devices", The parent should simply be the sysfs device with the last directory chopped off. So on my system, it is.

/sys/devices/pci0000:00/0000:00:0a.0/0000:06:00.0/host8/rport-8:0-0/target8:0:0/8:0:0:0/block

or if that is a link (It's not for me), whatever that links points to. If the last element is "block", which you can see from above, it is for me, multipath grabs the parent of this directory. For me this is

/sys/devices/pci0000:00/0000:00:0a.0/0000:06:00.0/host8/rport-8:0-0/target8:0:0/8:0:0:0

So unless your scsi sysfs devices are set up a lot different then mine, it appears that you are able to access a sysfs directory, but not it's parent, unless, of course, the issue is that the directory has actually been removed while this is happening. I will work on some test packages that print out all these paths when you fail to get a sysfs device. Please check if those sysfs directories exist when you see this issue.

Comment 4 Ben Marzinski 2011-03-03 21:38:46 UTC

There are debug packages available at:

http://people.redhat.com/bmarzins/device-mapper-multipath/rpms/RHEL6/x86_64/

and

http://people.redhat.com/bmarzins/device-mapper-multipath/rpms/RHEL6/i686/

These will print out a lot more information when you get the "failed to get parent" messages

Comment 5 Rajashekhar M A 2011-03-07 09:04:39 UTC

Created attachment 482629 [details]
syslog with test rpm messages

I could reproduce the bug with the test rpms (used tur as path_checker):

# multipath -ll /dev/sdbk
360a98000486e2f65686f6246516f4468 dm-9 NETAPP,LUN
size=5.0G features='1 queue_if_no_path' hwhandler='1 alua' wp=rw
|-+- policy='round-robin 0' prio=50 status=active
| |- 2:0:1:20 sdca 68:224 active ready running
| `- 3:0:1:20 sdcf 69:48  active ready running
`-+- policy='round-robin 0' prio=10 status=enabled
  `- 2:0:0:20 sdbh 67:176 active ready running
#

Here are the messages which I see in syslog:

Mar  6 02:25:42 IBMx3250-200-178 multipathd: sdbk: add path (uevent)
Mar  6 02:25:42 IBMx3250-200-178 multipathd: sysfs dev  has no parent
Mar  6 02:25:42 IBMx3250-200-178 multipathd: sdbk: failed to get parent
Mar  6 02:25:42 IBMx3250-200-178 multipathd: device: /devices/pci0000:00/0000:00:03.0/0000:06:00.1/host3/rport-3:0-2/target3:0:0/3:0:0:20/block/sdbk
Mar  6 02:25:42 IBMx3250-200-178 multipathd: device:
Mar  6 02:25:42 IBMx3250-200-178 multipathd: sdbk: failed to store path info
Mar  6 02:25:42 IBMx3250-200-178 multipathd: uevent trigger error

But, checked the directories, they do exist:

# ls -l /sys/block/sdbk
lrwxrwxrwx. 1 root root 0 Mar  6 02:36 /sys/block/sdbk -> ../devices/pci0000:00/0000:00:03.0/0000:06:00.1/host3/rport-3:0-2/target3:0:0/3:0:0:20/block/sdbk
# cd /sys/devices/pci0000:00/0000:00:03.0/0000:06:00.1/host3/rport-3:0-2/target3:0:0/3:0:0:20/block/sdbk
# ls
alignment_offset  capability  device             ext_range  inflight  queue  removable  size    stat       trace
bdi               dev         discard_alignment  holders    power     range  ro         slaves  subsystem  uevent
# pwd
/sys/devices/pci0000:00/0000:00:03.0/0000:06:00.1/host3/rport-3:0-2/target3:0:0/3:0:0:20/block/sdbk
# cd ../
# ls
sdbk
# pwd
/sys/devices/pci0000:00/0000:00:03.0/0000:06:00.1/host3/rport-3:0-2/target3:0:0/3:0:0:20/block
#

The other map where we see three paths is 360a98000486e2f65686f6246516f4168:

# multipath -ll 360a98000486e2f65686f6246516f4168
360a98000486e2f65686f6246516f4168 dm-7 NETAPP,LUN
size=5.0G features='1 queue_if_no_path' hwhandler='1 alua' wp=rw
|-+- policy='round-robin 0' prio=50 status=active
| |- 2:0:1:19 sdby 68:192 active ready running
| `- 3:0:1:19 sdce 69:32  active ready running
`-+- policy='round-robin 0' prio=10 status=enabled
  `- 2:0:0:19 sdbf 67:144 active ready running
#

Attached is the full syslog when the bug got reproduced.

Comment 6 Ben Marzinski 2011-03-08 07:11:16 UTC

The issue is that:
"sysfs dev  has no parent" should have a path after dev.  For some reason, the path has been cleared out. Same with the second device line in

Mar  6 02:25:42 IBMx3250-200-178 multipathd: device:
/devices/pci0000:00/0000:00:03.0/0000:06:00.1/host3/rport-3:0-2/target3:0:0/3:0:0:20/block/sdbk
Mar  6 02:25:42 IBMx3250-200-178 multipathd: device:

Somehow, the path part of you sysfs devicestructure is being cleared out.  This seems pretty odd. My best guess is that the sysfs device structure is getting deleted while it still in the cache, and part of the memory is getting reused. But I can't find anyplace where that could happen.

There are two new sets of packages.  They are at the same location as before

bz681144.2 Just adds some more printout messages, so I can see better how the sysfs device is getting setup.

bz681144.3 removes the part of the last zstream commit that had to do with sysfs devices.  I can't see why it should cause this to fail, but if removing it fixes the problem, then that's where the problem must be.

It would be great if you could try both and see if bz681144.3 fixes the issue, and post the output from bz681144.2

Comment 7 Rajashekhar M A 2011-03-08 08:06:22 UTC

We will update the bz once we collect more data from new test RPMs (bz681144.2).

We have already tested with the GAed version of the device-mapper-multipath (0.4.9-31.el6) and also the first errata that was released (0.4.9-31.el6_01). We did not hit the issue.

This seems to be the problem only with the last commit, i.e., 0.4.9-31.el6_02.

Comment 8 Rajashekhar M A 2011-03-08 12:32:58 UTC

Created attachment 482890 [details]
syslog and command logs with 681144.2 test rpms and directio

Attached is a zip file with full syslog messages and a few command outputs when the bug got reproduced (this time with directio) with the test rpms.

Comment 9 Martin George 2011-03-08 12:34:54 UTC

(In reply to comment #3)
> With directio, are you sure that you don't see this on RHEL-6.0 installs? 

The [failed][ready][running] problem with directio is seen only with the latest multipath errata package i.e. 0.4.9-31.el6_02. And not seen with previous packages like 0.4.9-31.el6 & 0.4.9-31.el6_01. So it does look like a regression affecting directio as well.

> Also, do you know if the path issues clears up in a couple of minutes. 

No. The path continues to remain in the same [failed][ready][running] status, and never recovers - though it is actually online (i.e. TUR is successful on this path).

Comment 11 Ben Marzinski 2011-03-08 14:29:31 UTC

I'd still really like you to test with 681144.3, even though you already tested with the earlier zstream packages, since there were two memory leaks that got closed in the latest package, and most of the code was for one that didn't effect the sysfs parent devices.  If you can still reproduce the tur checker error with 681144.3, then somehow the code to fix the other memory leak is messing with the parent sysfs device cache.  Also, I assume that the directio bug has to do with the other memory leak, and if you can still reproduce it with 681144.3, then I'll know for sure.

Thanks for getting the 681144.2 test data back so quickly.

Comment 12 Ben Marzinski 2011-03-08 16:47:08 UTC

I working on a test package that should hopefully fix the tur issue.  I don't think it will fix the directio issue, but I don't understand exactly what's happening with that issue yet, so it's possible.

Comment 13 Ben Marzinski 2011-03-08 21:14:30 UTC

There are new packages available that should fix the tur issue.  They may fix the directio issue as well. These are the 681144.4 packages, available at the same location as before.

Comment 16 Ben Marzinski 2011-03-09 18:20:15 UTC

    Technical note added. If any revisions are required, please edit the "Technical Notes" field
    accordingly. All revisions will be proofread by the Engineering Content Services team.
    
    New Contents:
Multipathd was not always removing a path's sysfs device from cache when the path was removed. Also, it was searching the the cache and creating sysfs devices without the vecs lock held. Because of this paths would occasionally have invalid sysfs devices, causing multipathd crashes and other errors.  Multipathd now always removes the sysfs device from cache when deleting the path, and it only accesses the cache with the vecs lock held.

Comment 18 Rajashekhar M A 2011-03-10 11:29:58 UTC

> I'd still really like you to test with 681144.3...

We have tested with 681144.3 rpms, we did not hit the issue. 

> There are new packages available that should fix the tur issue.  They may fix
the directio issue as well. These are the 681144.4 packages, available at the
same location as before.

We are currently testing the 681144.4 set of rpms. Will update the bugzilla with our results once we are done.

Comment 19 Rajashekhar M A 2011-03-11 11:02:14 UTC

Our tests showed that 681144.4 set of RPMs fixes the issue.

Comment 21 Chris Ward 2011-04-06 11:05:49 UTC

~~ Partners and Customers ~~

This bug was included in RHEL 6.1 Beta. Please confirm the status of this request as soon as possible.

If you're having problems accessing 6.1 bits, are delayed in your test execution or find in testing that the request was not addressed adequately, please let us know.

Thanks!

Comment 22 Eva Kopalova 2011-05-02 13:53:58 UTC

    Technical note updated. If any revisions are required, please edit the "Technical Notes" field
    accordingly. All revisions will be proofread by the Engineering Content Services team.
    
    Diffed Contents:
@@ -1 +1 @@
-Multipathd was not always removing a path's sysfs device from cache when the path was removed. Also, it was searching the the cache and creating sysfs devices without the vecs lock held. Because of this paths would occasionally have invalid sysfs devices, causing multipathd crashes and other errors.  Multipathd now always removes the sysfs device from cache when deleting the path, and it only accesses the cache with the vecs lock held.+When a path was removed, the multipathd daemon did not always remove the path sysfs device from its cache. The daemon kept searching the cache for the device and created sysfs devices without the vecs lock held. Because of this, paths could have pointed to invalid sysfs devices and caused multipathd to crash. The multipathd daemon now always removes the sysfs device from cache when deleting a path and accesses the cache only with the vecs lock held.

Comment 23 errata-xmlrpc 2011-05-19 14:13:01 UTC

An advisory has been issued which should help the problem
described in this bug report. This report is therefore being
closed with a resolution of ERRATA. For more information
on therefore solution and/or where to find the updated files,
please follow the link below. You may reopen this bug report
if the solution does not work for you.

http://rhn.redhat.com/errata/RHBA-2011-0725.html

Note You need to log in before you can comment on or make changes to this bug.