Description of problem: When fence_ilo fires off, the server will power off, but will remain in a power off state Version-Release number of selected component (if applicable): This has been seen as recently as the fenced deployed with 5.4 How reproducible: We're seeing this on DL385, and DL385G2. Have not yet tested on the DL580s we have. Steps to Reproduce: 1. Setup working cluster 2. Kernel panic active node 3. Fence will power off panic'd node, but will not power back on. Actual results: System is left with power off. Can be verified via telnet/ssh into iLo, and viewing power status. Expected results: Server should power back on after power is turned off. Additional info: We've been resolving this on our end by editing the /sbin/fence_ilo script to do: poweroff poweron poweron Running poweron twice has fixed it, I have not tested by putting a delay in between off and on. BTW, nice meeting the team at Summit. Like I said, we'll try to keep you in the loop on the bugs we see that we workaround in RHCS. -Jason Nelson Lead Linux Engineer Rackspace Hosting
Can you take a look at bug #545682 ? I believe that it is same problem
Ah, does look like a duplicate here. Is this going to be back ported into earlier versions of RHEL 4/5? -Jason Nelson Lead Linux Engineer Rackspace Hosting
@Jason: Do you mean? * 4.9 update (yes) * z-stream (yes, already in 4.8.z) * in RHEL5/6 (yes, not cloned from same bugs as timing options were added for all fence agents) *** This bug has been marked as a duplicate of bug 545682 ***