Created attachment 390277 [details] screenshot from luci Description of problem: When managing my cluster with luci I commonly see the error: "The ricci agent for this node is unresponsive. Node-specific information is not available at this time." In the messages file I see: Feb 11 11:15:37 cs-rh5-1 luci[27985]: Error connecting to cs-rh5-1.gsslab.rdu.redhat.com: Error reading from cs-rh5-1.gsslab.rdu.redhat.com:11111: timeout Feb 11 11:17:52 cs-rh5-1 luci[27985]: Error reading from cs-rh5-2.gsslab.rdu.redhat.com:11111: timeout Feb 11 11:17:57 cs-rh5-1 luci[27985]: Error reading from cs-rh5-1.gsslab.rdu.redhat.com:11111: timeout Feb 11 11:20:38 cs-rh5-1 luci[27985]: Error reading from cs-rh5-2.gsslab.rdu.redhat.com:11111: timeout Feb 11 11:20:43 cs-rh5-1 luci[27985]: Error reading from cs-rh5-1.gsslab.rdu.redhat.com:11111: timeout Feb 11 11:21:21 cs-rh5-1 luci[27985]: Error reading from cs-rh5-1.gsslab.rdu.redhat.com:11111: timeout Feb 11 11:21:21 cs-rh5-1 luci[27985]: Error connecting to cs-rh5-1.gsslab.rdu.redhat.com: Error reading from cs-rh5-1.gsslab.rdu.redhat.com:11111: timeout Version-Release number of selected component (if applicable): RHEL 5.4 luci-0.12.2-6.el5_4.1 ricci-0.12.2-6.el5_4.1 How reproducible: This problem is intermittent but very reproducible. The most common place I see it is when I add an existing cluster to luci, but it commonly appears when just clicking around the luci web site. Steps to Reproduce: 1. Add an existing cluster to luci. 2. Click the different attributes of the cluster you just added. 3. Make some changes to the cluster config. Actual results: We see the error: "The ricci agent for this node is unresponsive. Node-specific information is not available at this time." In the messages file we see: Feb 11 11:15:37 cs-rh5-1 luci[27985]: Error connecting to cs-rh5-1.gsslab.rdu.redhat.com: Error reading from cs-rh5-1.gsslab.rdu.redhat.com:11111: timeout Feb 11 11:17:52 cs-rh5-1 luci[27985]: Error reading from cs-rh5-2.gsslab.rdu.redhat.com:11111: timeout Feb 11 11:17:57 cs-rh5-1 luci[27985]: Error reading from cs-rh5-1.gsslab.rdu.redhat.com:11111: timeout Feb 11 11:20:38 cs-rh5-1 luci[27985]: Error reading from cs-rh5-2.gsslab.rdu.redhat.com:11111: timeout Feb 11 11:20:43 cs-rh5-1 luci[27985]: Error reading from cs-rh5-1.gsslab.rdu.redhat.com:11111: timeout Feb 11 11:21:21 cs-rh5-1 luci[27985]: Error reading from cs-rh5-1.gsslab.rdu.redhat.com:11111: timeout Feb 11 11:21:21 cs-rh5-1 luci[27985]: Error connecting to cs-rh5-1.gsslab.rdu.redhat.com: Error reading from cs-rh5-1.gsslab.rdu.redhat.com:11111: timeout Expected results: Normal operation without errors. Additional info: This problem is intermittent but is readily reproducible. I see this problem all over the RHEL/Centos forums and the linux-cluster mailing list. An example is: http://www.mail-archive.com/linux-cluster@redhat.com/msg07818.html I also have several RHEL customers who are hitting this.
*** This bug has been marked as a duplicate of bug 564490 ***