I am seeing the error below. One more agent (not 172.31.7.7) went down and I stopped an JBossAS in inventory by pressing Ctrl-C on it. Don't recall what else I did to end up in that situation. 16:58:07,897 INFO [DiscoveryServerServiceImpl] Error processing availability report from [172.31.7.7]: javax.ejb.EJBException:javax.persistence.PersistenceException: org.hibernate.PropertyValueException: not-null property references a null or transient value: org.rhq.core.domain.measurement.Availability.resource -> javax.persistence.PersistenceException:org.hibernate.PropertyValueException: not-null property references a null or transient value: org.rhq.core.domain.measurement.Availability.resource -> org.hibernate.PropertyValueException:not-null property references a null or transient value: org.rhq.core.domain.measurement.Availability.resource
Restarting the agent fixed this.
will close RHQ-385 as a duplicate, but this was seen by a customer.
I think this is critical, as I have seen that again in a customer log - unfortunately I did not find this issue while working on the case and now I don't find the case anymore :-/
Crap, raising the priority set the resolution to 'fixed'
Clearing the resolution.
setting this to blocker since its duplicate RHQ-385 was also set to blocker. i have a feeling that this issue is resolved by some of the new sync code. i haven't seen this locally in a few months, and we've only had this reported from customers. if this bug does in fact still exist, the HA testing should shake it out. if it doesn't, then i would consider this issue dated and close as "can not reproduce"
maybe we can make this more reliable by catching the one element with the problem, and letting the other availability elements in the report proceed.
A JON customer was seeing this for a couple of platforms, but not for other platforms in their inventory (Issue 281032). The two problematic platforms were red, not grey. Restarting the corresponding Agent with -lu got the platforms back to green, so at least there appears to be a workaround. The customer says that there were no network, DNS, or clock sync issues between the two Agent machines and the Server machine at the time the PropertyValueExceptions occurred and the platforms went red. The customer is running JON 2.1.2.
This bug was previously known as http://jira.rhq-project.org/browse/RHQ-327 This bug is duplicated by RHQ-385
Mass move to component= Monitoring
I think we can close this as it's stale and there has been a lot of refactoring in the avail handling code.
From the JON perspective the issue was last reported on JON 2.1 and does not appear to have occurred since. I would be willing to say the issue is out-of-date and not reproducible in the latest release. I will defer to ccrouch for his response.
Setting this to ON_QA for further triage. Recommending it be closed.
Bulk closing of BZs that have no target version set, but which are ON_QA for more than a year and thus are in production for a long time.