Description of problem (please be detailed as possible and provide a log snippets): [RDR] Alerts should be raised when snapshot scheduling is stopped for a long time A version of all relevant components (if applicable): Does this issue impact your ability to continue to work with the product? (please explain in detail what is the user impact)? Is there any workaround available to the best of your knowledge? Rate from 1 - 5 the complexity of the scenario you performed that caused this bug (1 - very simple, 5 - very complex)? Can this issue be reproducible? Can this issue reproduce from the UI? If this is a regression, please provide more details to justify this: Steps to Reproduce: 1. 2. 3. Actual results: There are no alerts raised when snapshot scheduling is stopped on rbd mirrored images Expected results: Respective alerts should be raised Additional info:
Metric for delay doesn't exist currently, we're planning to add info about scheduling in the metrics for a future release, based on this doc. https://docs.google.com/document/d/11QIuTiK_n4ufIq4rzmERIjGDO3zKu_o_6SCUT4DwsIE/edit
We might be able to do this in 4.14 not 4.13, as the metric required is still not merged. To track it, here's the PR: https://github.com/ceph/ceph/pull/50711
Hi Aman, The metric that might help with formation of the epic is not available in ceph yet. Might need more information regarding the complete picture in order to go ahead of this alert. Thanks, Divyansh