Hide Forgot
We have no way to expose information that any of the components are broken. When people monitor ovirt based solutions they check whether a host on which engine or vdsm is running is reachable and whether the services are up and running. There is no way to check whether hosts are connected from the engine perspective or storage or other parts works fine from vdsm perspective. We are missing information about any "logical" failures. We need to have a api like healthcheck functionality which would tell monitoring systems or sysadmins whether solution is healthy or not so automated alerts could be triggered.
We could do it via collectd, but I don't see yet a demand for it. Closing for the time being.