Bug 804784 - RHEL 6 online storage administration guide's section on remove_on_dev_loss is incorrect
Summary: RHEL 6 online storage administration guide's section on remove_on_dev_loss is...
Keywords:
Status: CLOSED CURRENTRELEASE
Alias: None
Deadline: 2012-08-21
Product: Red Hat Enterprise Linux 6
Classification: Red Hat
Component: doc-Storage_Admin_Guide
Version: 6.2
Hardware: Unspecified
OS: Linux
unspecified
medium
Target Milestone: rc
: ---
Assignee: Jacquelynn East
QA Contact: ecs-bugs
URL:
Whiteboard:
Depends On:
Blocks: 1095007
TreeView+ depends on / blocked
 
Reported: 2012-03-19 18:50 UTC by dale_fowle
Modified: 2018-11-30 20:03 UTC (History)
5 users (show)

Fixed In Version:
Doc Type: Bug Fix
Doc Text:
Clone Of:
: 1095007 (view as bug list)
Environment:
Last Closed: 2013-02-26 05:04:28 UTC
Target Upstream Version:


Attachments (Terms of Use)
Screenshot of RHEL 6 chapter 24 remove_on_dev_loss statement (289.82 KB, application/vnd.openxmlformats-officedocument.wordprocessingml.document)
2012-03-19 18:50 UTC, dale_fowle
no flags Details

Description dale_fowle 2012-03-19 18:50:02 UTC
Created attachment 571210 [details]
Screenshot of RHEL 6 chapter 24 remove_on_dev_loss statement

Description of problem:
RHEL 6 online storage administration guide's section on remove_on_dev_loss is incorrect. Chapter 24 on modifying link loss behavior seems to be incorrect. The documentation says that the remove_on_dev_loss is not recommended and is disabled by default, but it is in fact enabled by default.

Here's the proof that it is enabled:
According to a comment within the scsi_transport_fc kernel module, the dev_loss_tmo value now determines the device removal.

[root@cos-qa-tc06s08 scsi]# strings scsi_transport_fc.ko | grep remove
parm=dev_loss_tmo:Maximum number of seconds that the FC transport should insulate the loss of a remote port. Once this value is exceeded, the scsi target is removed. Value should be between 1 and SCSI_DEVICE_BLOCK_MAX_TIMEOUT if fast_io_fail_tmo is not set.
fc_remove_host
scsi_remove_target
sysfs_remove_link
transport_remove_device

I have a screenshot of the document in question that I should be able to attach to this bug.


Version-Release number of selected component (if applicable):


How reproducible:


Steps to Reproduce:
1.
2.
3.
  
Actual results:


Expected results:


Additional info:

Comment 3 Jacquelynn East 2012-08-09 04:47:23 UTC
Hi Dale

Can you please indicate exactly what part says that it is disabled by default? I had a look at the section in the screenshot provided but could find nothing there to suggest that remove_on_dev_loss was disabled by default.

Comment 4 dale_fowle 2012-08-09 15:59:45 UTC
So, the default behavior within the code is to remove a device on dev_loss. 

The issue with the documentation is that it makes it seem like this is NOT the desirable nor recommended behavior. Please see the warning section in the screenshot that states the use of remove_on_dev_loss is not recommended.

The documentation conflicts with the actual behavior of the RHEL 6.2 OS.

In simple terms:
1. The OS does use remove_on_dev_loss without having to change anything. We can demonstrate this by pulling a cable on the target side of the fabric and waiting until the dev_loss_tmo timer has expired. At that time, the device(s) in question will be removed.
2. The documentation says using remove_on_dev_loss is not recommended. 

What I'm asking for is that the documentation be changed to reflect the actual behavior of the OS. 

I hope this helps.

Thanks,
Dale

Comment 7 Dominic Cleal 2012-08-23 11:40:58 UTC
(In reply to comment #3)
> Can you please indicate exactly what part says that it is disabled by
> default? I had a look at the section in the screenshot provided but could
> find nothing there to suggest that remove_on_dev_loss was disabled by
> default.

The docs don't explicitly say "disabled", but seem to indicate this behaviour is not the default and you can optionally choose it:

https://access.redhat.com/knowledge/docs/en-US/Red_Hat_Enterprise_Linux/6/html/Storage_Administration_Guide/modifying-link-loss-behavior.html

In RHEL 6, this module parameter isn't present (confirm with a "modinfo scsi_transport_fc" on RHEL 5 vs. 6) and this device removal behaviour is now the default as far as I can tell.  The huge warning therefore doesn't make sense given that this is on and the parameter is no longer available.

(not clearing NEEDINFO, I can't speak authoritatively on this)


Note You need to log in before you can comment on or make changes to this bug.