Note: This bug is displayed in read-only format because
the product is no longer active in Red Hat Bugzilla.
RHEL Engineering is moving the tracking of its product development work on RHEL 6 through RHEL 9 to Red Hat Jira (issues.redhat.com). If you're a Red Hat customer, please continue to file support cases via the Red Hat customer portal. If you're not, please head to the "RHEL project" in Red Hat Jira and file new tickets here. Individual Bugzilla bugs in the statuses "NEW", "ASSIGNED", and "POST" are being migrated throughout September 2023. Bugs of Red Hat partners with an assigned Engineering Partner Manager (EPM) are migrated in late September as per pre-agreed dates. Bugs against components "kernel", "kernel-rt", and "kpatch" are only migrated if still in "NEW" or "ASSIGNED". If you cannot log in to RH Jira, please consult article #7032570. That failing, please send an e-mail to the RH Jira admins at rh-issues@redhat.com to troubleshoot your issue as a user management inquiry. The email creates a ServiceNow ticket with Red Hat. Individual Bugzilla bugs that are migrated will be moved to status "CLOSED", resolution "MIGRATED", and set with "MigratedToJIRA" in "Keywords". The link to the successor Jira issue will be found under "Links", have a little "two-footprint" icon next to it, and direct you to the "RHEL project" in Red Hat Jira (issue links are of type "https://issues.redhat.com/browse/RHEL-XXXX", where "X" is a digit). This same link will be available in a blue banner at the top of the page informing you that that bug has been migrated.
Cause: Multipath wasn't checking if a pointer was NULL before dereferencing it.
Consequence: Occasionally, when the scsi layer deleted failed path devices, multipathd would crash
Fix: Multipath now checks if the pointer is NULL before dereferencing it.
Result: Multipath no longer crashes when the scsi layer removes path devices.
Description of problem:
When paths go down and are removed multipathd is crashing in find_slot()
Version-Release number of selected component (if applicable):
device-mapper-multipath-0.4.9-56.el6.x86_64
How reproducible:
Don't know.
Steps to Reproduce:
1. Don't know
2.
3.
Actual results:
segfault in find_slot()
Expected results:
No segfault
Additional info:
This report is coming in from Mike Christie at Fusion-io
Created attachment 641787[details]
check if the vector exists before dereferencing it.
This patch makes sure the the vector in find_slot is not NULL before dereferencing it.
We hit this bug by forcing paths to be added/deleted.
- Set dev_loss_tmo relativately low, so we can replciate it faster. Maybe 15-20 secs.
- Run IO test to dm-multipath device. Have dm multipath device setup with queue_if_no_path.
- Inject transport problem for dev_loss_tmo seconds, so the paths (/dev/sdXs) are deleted by the scsi layer, and so multipathd handles the removal by removing the path.
- Correct transport problem, so paths are added back by the scsi layer and multipathd.
- Repeat. We run this test for a several hours.
Comment 4RHEL Program Management
2012-12-14 08:50:18 UTC
This request was not resolved in time for the current release.
Red Hat invites you to ask your support representative to
propose this request, if still desired, for consideration in
the next release of Red Hat Enterprise Linux.
Since the problem described in this bug report should be
resolved in a recent advisory, it has been closed with a
resolution of ERRATA.
For information on the advisory, and where to find the updated
files, follow the link below.
If the solution does not work for you, open a new bug report.
http://rhn.redhat.com/errata/RHBA-2013-1574.html