Bug 1669403

Summary: With one target node down, on respinning the app pod, block device fails to mount
Product: [Red Hat Storage] Red Hat Gluster Storage Reporter: Rachael <rgeorge>
Component: kubernetesAssignee: Humble Chirammal <hchiramm>
Status: CLOSED CURRENTRELEASE QA Contact: Rachael <rgeorge>
Severity: high Docs Contact:
Priority: unspecified    
Version: ocs-3.11CC: hchiramm, jmulligan, knarra, kramdoss, madam, ndevos, pasik, pkarampu, pprakash, prasanna.kalever, puebele, rcyriac, rhs-bugs, vbellur, xiubli
Target Milestone: ---Keywords: ZStream
Target Release: OCS 3.11.z Batch Update 3   
Hardware: Unspecified   
OS: Unspecified   
Whiteboard:
Fixed In Version: oc v3.11.95 Doc Type: If docs needed, set a value
Doc Text:
Story Points: ---
Clone Of:
: 1669560 (view as bug list) Environment:
Last Closed: 2020-02-19 11:19:52 UTC Type: Bug
Regression: --- Mount Type: ---
Documentation: --- CRM:
Verified Versions: Category: ---
oVirt Team: --- RHEL 7.3 requirements from Atomic Host:
Cloudforms Team: --- Target Upstream Version:
Embargoed:
Bug Depends On: 1669560, 1680012    
Bug Blocks:    

Comment 6 Niels de Vos 2019-01-25 11:14:27 UTC
(In reply to Xiubo Li from comment #4)
...
> read
> /sys/class/iscsi_host/host415/device/session383/connection383:0/
> iscsi_connection/connection383:0/address: transport endpoint is not connected
> [...]
...

I do not have much experience with the kubernetes iscsi code. But it seems that the above error is not handled correctly at https://github.com/kubernetes/kubernetes/blob/master/pkg/volume/util/device_util_linux.go#L163

From my brief reading of the code, it should not return an error when reading the 'address' file (or others) fail, but instead continue with the loop and look at the next connection.

Humble, what do you think?

Comment 19 RamaKasturi 2019-03-28 14:09:52 UTC
Acking this for 3.11.3