RHEL Engineering is moving the tracking of its product development work on RHEL 6 through RHEL 9 to Red Hat Jira (issues.redhat.com). If you're a Red Hat customer, please continue to file support cases via the Red Hat customer portal. If you're not, please head to the "RHEL project" in Red Hat Jira and file new tickets here. Individual Bugzilla bugs in the statuses "NEW", "ASSIGNED", and "POST" are being migrated throughout September 2023. Bugs of Red Hat partners with an assigned Engineering Partner Manager (EPM) are migrated in late September as per pre-agreed dates. Bugs against components "kernel", "kernel-rt", and "kpatch" are only migrated if still in "NEW" or "ASSIGNED". If you cannot log in to RH Jira, please consult article #7032570. That failing, please send an e-mail to the RH Jira admins at rh-issues@redhat.com to troubleshoot your issue as a user management inquiry. The email creates a ServiceNow ticket with Red Hat. Individual Bugzilla bugs that are migrated will be moved to status "CLOSED", resolution "MIGRATED", and set with "MigratedToJIRA" in "Keywords". The link to the successor Jira issue will be found under "Links", have a little "two-footprint" icon next to it, and direct you to the "RHEL project" in Red Hat Jira (issue links are of type "https://issues.redhat.com/browse/RHEL-XXXX", where "X" is a digit). This same link will be available in a blue banner at the top of the page informing you that that bug has been migrated.
Bug 2053642 - [RFE] Add support in multipathd to listen for FPIN-Li events and mark effected paths as marginal
Summary: [RFE] Add support in multipathd to listen for FPIN-Li events and mark effecte...
Keywords:
Status: CLOSED ERRATA
Alias: None
Product: Red Hat Enterprise Linux 9
Classification: Red Hat
Component: device-mapper-multipath
Version: 9.0
Hardware: Unspecified
OS: Unspecified
high
unspecified
Target Milestone: rc
: ---
Assignee: Ben Marzinski
QA Contact: Lin Li
Kristina Slaveykova
URL:
Whiteboard:
Depends On:
Blocks:
TreeView+ depends on / blocked
 
Reported: 2022-02-11 16:59 UTC by Ben Marzinski
Modified: 2023-05-18 12:07 UTC (History)
12 users (show)

Fixed In Version: device-mapper-multipath-0.8.7-6.el9
Doc Type: Enhancement
Doc Text:
.`multipathd` now supports detecting FPIN-Li events When you add a new value `fpin` for the `marginal_pathgroups` config option, you enable `multipathd` to monitor the Link Integrity Fabric Performance Impact Notification (PFIN-Li) events and move paths with link integrity issues to a marginal pathgroup. With the `fpin` value set, `multipathd` overrides its existing marginal path detection methods and relies on the Fibre Channel fabric to identify link integrity issues. With this enhancement, the `multipathd` method becomes more robust in detecting marginal paths on Fibre Channel fabrics that can issue PFIN-Li events.
Clone Of:
Environment:
Last Closed: 2022-05-17 15:56:11 UTC
Type: Bug
Target Upstream Version:
Embargoed:
pm-rhel: mirror+


Attachments (Terms of Use)


Links
System ID Private Priority Status Summary Last Updated
Red Hat Issue Tracker RHELPLAN-112056 0 None None None 2022-02-11 17:01:33 UTC
Red Hat Product Errata RHBA-2022:3971 0 None None None 2022-05-17 15:56:30 UTC

Description Ben Marzinski 2022-02-11 16:59:24 UTC
Description of problem:
When link integrity issues are detected on a Fibre Channel fabric, A Link Integrity Fabric Performance Impact Notification (FPIN-Li) can be send to a node. If multipathd listens for these events, it can use them to control a path's marginal status, instead of trying to detect marginal paths internally. The path will remain marginal until a registered state change notification (RSCN) or Link Up event is received.

Comment 4 Ben Marzinski 2022-02-15 18:39:43 UTC
RHEL-9 packages with this fix are available for testing at:

http://people.redhat.com/~bmarzins/device-mapper-multipath/rpms/RHEL9/2053642/

Muneendra, It would be really helpful for getting this into RHEL-9.0 if you could test these before the end of next week.

Comment 8 Ben Marzinski 2022-02-21 16:35:24 UTC
Muneendra, just a ping to remind you that it would be very helpful to get this tested before the end of this week.

Comment 10 MUNEENDRA (Broadcom) 2022-02-22 10:44:15 UTC
Hi Benjamin,
I have installed the packages and tested the same on top of RHEL9 Beta.
We have injected the FPIN from the switch while the host is running the traffic.
On the host the affected paths and port_states have  been marked as marginal and the traffic has been shifted to active paths.
And things are working fine as expected.
With this testing we can conclude that the packages which you have sent are working fine.
And it is good to go.

Comment 13 Lin Li 2022-02-23 03:03:08 UTC
Move to verified according to comment 10 and comment 12.

Comment 15 errata-xmlrpc 2022-05-17 15:56:11 UTC
Since the problem described in this bug report should be
resolved in a recent advisory, it has been closed with a
resolution of ERRATA.

For information on the advisory (new packages: device-mapper-multipath), and where to find the updated
files, follow the link below.

If the solution does not work for you, open a new bug report.

https://access.redhat.com/errata/RHBA-2022:3971

Comment 17 Ewan D. Milne 2023-02-09 16:29:30 UTC
# man multipath.conf
defaults section 
     marginal_pathgroups
                        If set to off, the delay_*_checks, marginal_path_*, and san_path_err_* options will keep
                        marginal, or "shaky", paths from being reinstated until they  have  been  monitored  for
                        some time. This can cause situations where all non-marginal paths are down, and no paths
                        are usable until multipathd detects this and reinstates a marginal path. If  the  multi‐
                        path device is not configured to queue IO in this case, it can cause IO errors to occur,
                        even though there are marginal paths available.  However, if this option is set  to  on,
                        when  one  of the marginal path detecting methods determines that a path is marginal, it
                        will be reinstated and placed in a seperate pathgroup that will only be used  after  all
                        the  non-marginal  pathgroups have been tried first. This prevents the possibility of IO
                        errors occuring while marginal paths are still usable. After the path has been monitored
                        for  the  configured  time,  and  is declared healthy, it will be returned to its normal
                        pathgroup.  If this option is set to fpin, multipathd will receive  fpin  notifications,
                        set  path  states to "marginal" accordingly, and regroup paths as described for on. This
                        option can't be used in combination with other options for "Shaky path  detection"  (see
                        below). Note: If this is set to fpin, the marginal_path_* and san_path_err_* options are
                        implicitly set to no. Also, this option cannot be switched either to or from fpin  on  a
                        multipathd reconfigure. multipathd must be restarted for the change to take effect.  See
                        "Shaky paths detection" below for more information.

                        The default is: off


# cat /etc/multipath.conf
defaults {
	user_friendly_names yes
	find_multipaths yes
        marginal_pathgroups fpin
}


# multipathd show config
defaults {
	marginal_pathgroups "off"


Note You need to log in before you can comment on or make changes to this bug.