Bug 476941

Summary: Failure to restore the failed paths on restoring the array controller
Product: Red Hat Enterprise Linux 5 Reporter: vijay <vijayakumar>
Component: device-mapper-multipathAssignee: Ben Marzinski <bmarzins>
Status: CLOSED DUPLICATE QA Contact: Cluster QE <mspqa-list>
Severity: medium Docs Contact:
Priority: low    
Version: 5.1.zCC: agk, bmarzins, bmr, bzeranski, christophe.varoqui, dwysocha, egoggin, heinzm, iannis, junichi.nomura, kueda, lmb, mbroz, prockai, senthil-kumar.veluswamy, tranlan
Target Milestone: rc   
Target Release: ---   
Hardware: All   
OS: Linux   
Whiteboard:
Fixed In Version: Doc Type: Bug Fix
Doc Text:
Story Points: ---
Clone Of: Environment:
Last Closed: 2010-07-27 19:27:45 UTC Type: ---
Regression: --- Mount Type: ---
Documentation: --- CRM:
Verified Versions: Category: ---
oVirt Team: --- RHEL 7.3 requirements from Atomic Host:
Cloudforms Team: --- Target Upstream Version:
Embargoed:

Description vijay 2008-12-18 06:46:42 UTC
Description of problem:
on controller failure, the I/Os are failover to the available paths and
corresponding sas devices of the failed paths are removed. on re-enabling the
controller, not all the failed paths are failed back. 
Subsequent failure of the other controller, results in all path failure
condition( since it failed to failback the failed paths in previous step)
and read/write error for the luns are observed. 

Version-Release number of selected component (if applicable):
 Array: MSA 2000 with SAS connectivity(MSA2012sa), dual controller 
 Array firmware : J300p10
 mptsas driver version: mptlinux-3.12.14.00-2
 Number of luns presented to host: 15

How reproducible:
Frequently

Steps to Reproduce:
1. Fail one of Array controller, I/Os are failover to available paths.
2. Enable the failed array controller, all failed paths are not restored
3. Fail other array controller.
4. Read/write error to the SAS devices are observed.
  
Actual results:
On Enabling the failed array controller, not all failed paths are failed back. 

Expected results:
On Enabling the failed array controller, all failed paths needs to be enabled.

Additional info:
On heavy load, the DM generated uevents to add/remove the maps/paths are
delayed. Few of the uevents are lost which results in the failure to fail back
the failed paths.


Thanks!

Comment 2 Ben Marzinski 2010-07-27 19:27:45 UTC
This bug appears to have been solved by bz 500729.  If you are still able to reproduce this using kernel-2.6.18-162.el5 or a more recent kernel, please reopen this bug.

*** This bug has been marked as a duplicate of bug 500729 ***

Comment 3 Red Hat Bugzilla 2023-09-14 01:14:54 UTC
The needinfo request[s] on this closed bug have been removed as they have been unresolved for 1000 days