Bug 1918938
Summary: | ocs-operator has Error logs with "unable to deploy Prometheus rules" | ||
---|---|---|---|
Product: | [Red Hat Storage] Red Hat OpenShift Container Storage | Reporter: | Neha Berry <nberry> |
Component: | ocs-operator | Assignee: | umanga <uchapaga> |
Status: | CLOSED ERRATA | QA Contact: | Neha Berry <nberry> |
Severity: | medium | Docs Contact: | |
Priority: | unspecified | ||
Version: | 4.7 | CC: | branto, jarrpa, madam, mbukatov, muagarwa, ocs-bugs, sostapov, uchapaga |
Target Milestone: | --- | Keywords: | Regression |
Target Release: | OCS 4.7.0 | ||
Hardware: | Unspecified | ||
OS: | Unspecified | ||
Whiteboard: | |||
Fixed In Version: | ocs-registry:4.7.0-241.ci | Doc Type: | No Doc Update |
Doc Text: | Story Points: | --- | |
Clone Of: | Environment: | ||
Last Closed: | 2021-05-19 09:18:16 UTC | Type: | Bug |
Regression: | --- | Mount Type: | --- |
Documentation: | --- | CRM: | |
Verified Versions: | Category: | --- | |
oVirt Team: | --- | RHEL 7.3 requirements from Atomic Host: | |
Cloudforms Team: | --- | Target Upstream Version: | |
Embargoed: |
Description
Neha Berry
2021-01-21 18:10:15 UTC
After quick discussion with Umanga, it seems that this breaks all OCS Alerts (so that no such alert could be raised). This is a valid problem that should be considered a blocker. Acking for OCS 4.7. QE will check that there are no prometheus errors in operator logs and via regression testing that alerting works. We might need to update the downstream Dockerfile to copy the prometheus rules. Do you regularly update the files in ./metrics/deploy/prometheus-ocs-rules-external.yaml ./metrics/deploy/prometheus-ocs-rules.yaml or do we need to generate them manually downstream? i.e. Can we directly use these files downstream? If they need to be generated, we will have to do some big changes to the way we build ocs-operator downstream. Please do provide the steps and how you generate these files if that is the case. (In reply to Boris Ranto from comment #9) > We might need to update the downstream Dockerfile to copy the prometheus > rules. > > Do you regularly update the files in > > ./metrics/deploy/prometheus-ocs-rules-external.yaml > ./metrics/deploy/prometheus-ocs-rules.yaml > Yes these will be updated as required for each release. No need to generate anything. In that case, this should be fixed by http://pkgs.devel.redhat.com/cgit/containers/ocs-operator/commit/Dockerfile?h=ocs-4.7-rhel-8&id=d0099ff5a24df54e011ccba415a0c925b12e1e76 If I understand this correctly, we didn't have to do this in OCS 4.6 since these yamls were somehow built-in in the source code/binary, right? This should be fixed in the latest build: ocs-registry:4.7.0-241.ci Moving the BZ to verified based on Comment#15 Also, verified that OCS, noobaa and ceph alerting rules exist in UI->Monitoring->Alerting->Alerting Rules. Since the problem described in this bug report should be resolved in a recent advisory, it has been closed with a resolution of ERRATA. For information on the advisory (Moderate: Red Hat OpenShift Container Storage 4.7.0 security, bug fix, and enhancement update), and where to find the updated files, follow the link below. If the solution does not work for you, open a new bug report. https://access.redhat.com/errata/RHSA-2021:2041 |