Bug 563923 - Luci times out when managing cluster nodes, "Error reading from mynode.name.com:11111: timeout."
Summary: Luci times out when managing cluster nodes, "Error reading from mynode.name.c...
Keywords:
Status: CLOSED DUPLICATE of bug 564490
Alias: None
Product: Red Hat Enterprise Linux 5
Classification: Red Hat
Component: conga
Version: 5.4
Hardware: All
OS: Linux
low
medium
Target Milestone: rc
: ---
Assignee: Ryan McCabe
QA Contact: Cluster QE
URL:
Whiteboard:
Depends On:
Blocks:
TreeView+ depends on / blocked
 
Reported: 2010-02-11 15:31 UTC by Ben Turner
Modified: 2010-02-15 14:47 UTC (History)
1 user (show)

Fixed In Version:
Doc Type: Bug Fix
Doc Text:
Clone Of:
Environment:
Last Closed: 2010-02-15 14:47:27 UTC
Target Upstream Version:
Embargoed:


Attachments (Terms of Use)
screenshot from luci (2.76 MB, image/png)
2010-02-11 15:31 UTC, Ben Turner
no flags Details

Description Ben Turner 2010-02-11 15:31:03 UTC
Created attachment 390277 [details]
screenshot from luci

Description of problem: When managing my cluster with luci I commonly see the error:

"The ricci agent for this node is unresponsive. Node-specific information is not available at this time."

In the messages file I see:

Feb 11 11:15:37 cs-rh5-1 luci[27985]: Error connecting to cs-rh5-1.gsslab.rdu.redhat.com: Error reading from cs-rh5-1.gsslab.rdu.redhat.com:11111: timeout
Feb 11 11:17:52 cs-rh5-1 luci[27985]: Error reading from cs-rh5-2.gsslab.rdu.redhat.com:11111: timeout
Feb 11 11:17:57 cs-rh5-1 luci[27985]: Error reading from cs-rh5-1.gsslab.rdu.redhat.com:11111: timeout
Feb 11 11:20:38 cs-rh5-1 luci[27985]: Error reading from cs-rh5-2.gsslab.rdu.redhat.com:11111: timeout
Feb 11 11:20:43 cs-rh5-1 luci[27985]: Error reading from cs-rh5-1.gsslab.rdu.redhat.com:11111: timeout
Feb 11 11:21:21 cs-rh5-1 luci[27985]: Error reading from cs-rh5-1.gsslab.rdu.redhat.com:11111: timeout
Feb 11 11:21:21 cs-rh5-1 luci[27985]: Error connecting to cs-rh5-1.gsslab.rdu.redhat.com: Error reading from cs-rh5-1.gsslab.rdu.redhat.com:11111: timeout

Version-Release number of selected component (if applicable):

RHEL 5.4 
luci-0.12.2-6.el5_4.1
ricci-0.12.2-6.el5_4.1

How reproducible: This problem is intermittent but very reproducible.  The most common place I see it is when I add an existing cluster to luci, but it commonly appears when just clicking around the luci web site.

Steps to Reproduce:
1. Add an existing cluster to luci.
2. Click the different attributes of the cluster you just added.
3. Make some changes to the cluster config.
  
Actual results:

We see the error:

"The ricci agent for this node is unresponsive. Node-specific information is not available at this time."

In the messages file we see:

Feb 11 11:15:37 cs-rh5-1 luci[27985]: Error connecting to cs-rh5-1.gsslab.rdu.redhat.com: Error reading from cs-rh5-1.gsslab.rdu.redhat.com:11111: timeout
Feb 11 11:17:52 cs-rh5-1 luci[27985]: Error reading from cs-rh5-2.gsslab.rdu.redhat.com:11111: timeout
Feb 11 11:17:57 cs-rh5-1 luci[27985]: Error reading from cs-rh5-1.gsslab.rdu.redhat.com:11111: timeout
Feb 11 11:20:38 cs-rh5-1 luci[27985]: Error reading from cs-rh5-2.gsslab.rdu.redhat.com:11111: timeout
Feb 11 11:20:43 cs-rh5-1 luci[27985]: Error reading from cs-rh5-1.gsslab.rdu.redhat.com:11111: timeout
Feb 11 11:21:21 cs-rh5-1 luci[27985]: Error reading from cs-rh5-1.gsslab.rdu.redhat.com:11111: timeout
Feb 11 11:21:21 cs-rh5-1 luci[27985]: Error connecting to cs-rh5-1.gsslab.rdu.redhat.com: Error reading from cs-rh5-1.gsslab.rdu.redhat.com:11111: timeout

Expected results:

Normal operation without errors.

Additional info:

This problem is intermittent but is readily reproducible.  I see this problem all over the RHEL/Centos forums and the linux-cluster mailing list.  An example is:

http://www.mail-archive.com/linux-cluster@redhat.com/msg07818.html

I also have several RHEL customers who are hitting this.

Comment 2 Ben Turner 2010-02-15 14:47:27 UTC

*** This bug has been marked as a duplicate of bug 564490 ***


Note You need to log in before you can comment on or make changes to this bug.