Hide Forgot
Description of problem: Intel is investigating RHS and have pointed out some monitoring deficiencies. They would like the ability to grab statistics on self-healing related data such as how long a heal took to complete over a period of time as well as something that can estimate how long it will take to heal. This is in the context of taking a server offline in an active cluster and then bringing it back online. Given an amount of data will have changed during this time, they need a way to see how quickly data is being healed and to estimate how long it will feel to complete a heal-in-progress.