RHQ 4.0.0.CR I turned off an agent monitoring a platform and see this in the server log: 2011-04-30 22:15:35,232 INFO [org.rhq.enterprise.server.core.AgentManagerBean] Agent with name [i-b21d37dd] just went down It was then 15minutes before any of the resources on the platform started showing red? a) I thought we were back filling after 10mins? b) If we now the agent went down, don't we want to backfill more aggressively than normal
Created attachment 496017 [details] platform monitoring tab
This has basically been implemented in the jshaughn/avail branch. Backfilling has been dropped to 5 minutes and also, graceful agent shutdown no longer depends on suspect job detection, the backfilling will be performed immediately. See: http://rhq-project.org/display/RHQ/Design-Availability+Checking#Design-AvailabilityChecking-DesignandChanges For more on planned avail changes.
This is in Master.
Bulk closing of items that are on_qa and in old RHQ releases, which are out for a long time and where the issue has not been re-opened since.