Bug 169774 - fence_wti status check fails
Summary: fence_wti status check fails
Keywords:
Status: CLOSED WORKSFORME
Alias: None
Product: Red Hat Cluster Suite
Classification: Retired
Component: fence
Version: 4
Hardware: i386
OS: Linux
medium
medium
Target Milestone: ---
Assignee: Ryan O'Hara
QA Contact: Cluster QE
URL:
Whiteboard:
Depends On:
Blocks:
TreeView+ depends on / blocked
 
Reported: 2005-10-03 14:37 UTC by Lucas Tepper
Modified: 2009-04-16 20:27 UTC (History)
1 user (show)

Fixed In Version:
Doc Type: Bug Fix
Doc Text:
Clone Of:
Environment:
Last Closed: 2007-01-05 21:02:50 UTC
Embargoed:


Attachments (Terms of Use)

Description Lucas Tepper 2005-10-03 14:37:00 UTC
Description of problem:
Problem with fence_wti

Version-Release number of selected component (if applicable):
fence-1.32.1-0

How reproducible:
After removing the network from one of the nodes, the other node is trying to
fence the failed node. This succeeds, after that the status of the fence is
checked and the following appears in messages file:

Sep 29 14:59:45 dbn42 fenced[5280]: fencing node "dbn41.mz.local"
Sep 29 14:59:48 dbn42 fenced[5280]: agent "fence_wti" reports: failed: plug not
off ()
Sep 29 14:59:48 dbn42 fenced[5280]: fence "dbn41.mz.local" failed

We already applied the change mentioned in bugzilla#162805. It looks like there
goes something wrong in the loop that checks the status of the ports (seems to
be running this loop only one time!). Only one line is of the status output is
analysed. We are using a WTI NPS-230.

Steps to Reproduce:
1.
2.
3.
  
Actual results:
Status check of the port fails and keeps running in a loop. The failed cluster
resources are not brought online on the other node.

Expected results:


Additional info:

Comment 2 Jim Parsons 2005-11-22 13:33:08 UTC
looking for hardware to test on.

Comment 6 Jim Parsons 2006-12-20 14:09:40 UTC
Ryan - can you try and reproduce this, please? I have never been able to, with
the wti switch here in Westford...I don't know if you guys have this exact model
of WTI product in msp...

Comment 7 Ryan O'Hara 2007-01-05 18:33:17 UTC
I'm attempting to reproduce this with a WTI NPS-115 that I have setup in the lab
here. It appears to be identical to the NPS-230 except that it is 115 VAC,
wheras the NPS-230 is 230 VAC.

Comment 8 Ryan O'Hara 2007-01-05 21:02:50 UTC
I have not been able to reproduce this on an NPS-115, which should be identical
to the NPS-230 except for VAC (mentioned above).

Two things to mention:

1. The WTI NPS devices have a configurable "boot delay" which is used to set the
time between the off/on of a reboot operation. It seems possible that boot delay
could be set small enough that by the time we check the status (expecting "off")
we see that it is "on". However, I tried this with the lowest possible setting
(1 sec) and it did not occur. Even if this were the case, I would have expected
the log messages to suggest that is fould the state to be "ON". In the log
messages above, it appears the state is unknown.

2. There may be a very slight change in the console format due to different
firmware revisions. It appears that the switch I was testing with has version 2.00.

I am closing this since I am unable to reproduce this bug. If this continues to
be a problem, please reopen this bug with addition information, including the
firmware version, etc.




Note You need to log in before you can comment on or make changes to this bug.