Bug 1881258
Summary: | Prometheus (Community) Operator clashes with cluster-monitoring-config prometheus-k8s PVCs | ||
---|---|---|---|
Product: | OpenShift Container Platform | Reporter: | Brendan Shirren <bshirren> |
Component: | Monitoring | Assignee: | Sergiusz Urbaniak <surbania> |
Status: | CLOSED UPSTREAM | QA Contact: | Junqi Zhao <juzhao> |
Severity: | medium | Docs Contact: | |
Priority: | medium | ||
Version: | 4.5 | CC: | aabhishe, alegrand, anpicker, bshirren, dahernan, dsover, ecordell, erooth, kakkoyun, lcosic, mloibl, naoto30, nhale, pkrupa, spasquie, surbania |
Target Milestone: | --- | Flags: | bshirren:
needinfo-
|
Target Release: | 4.7.0 | ||
Hardware: | x86_64 | ||
OS: | Linux | ||
Whiteboard: | |||
Fixed In Version: | Doc Type: | Bug Fix | |
Doc Text: |
Cause:
Due to a race of installing prometheus-operator via OLM and the in-cluster prometheus-operator there can be a clash of custom resources because OLM may register a different version of the monitoring CRD.
Consequence:
If you specified custom names for PVCs in the cluster-monitoring-operator configmap then this can cause Prometheus & Alertmanager PVCs configured in "cluster-monitoring-config" configmap (openshift-monitoring) to revert to default naming.
Fix:
The prometheus operator pod in the openshift-monitoring namespace must be killed. This will cause them to be restarted and the CRDs reinitialized to the correct version.
Result:
Custom configured PVC names start working.
|
Story Points: | --- |
Clone Of: | Environment: | ||
Last Closed: | 2021-01-07 08:15:10 UTC | Type: | Bug |
Regression: | --- | Mount Type: | --- |
Documentation: | --- | CRM: | |
Verified Versions: | Category: | --- | |
oVirt Team: | --- | RHEL 7.3 requirements from Atomic Host: | |
Cloudforms Team: | --- | Target Upstream Version: | |
Embargoed: |
Description
Brendan Shirren
2020-09-22 01:53:54 UTC
The Community Prometheus operator is installed in a completely different namespace(i.e "metrics" ns) and that triggered Prometheus and Alertmanager pods in openshift-monitoring to recycle. Age of STS, deployments, etc. show their parent resources weren't replaced. I guess expecting OLM to deny the installation of th community operator is not the way to go, instead ensuring that same resources ids are not used or not affecting rest of namespaces. *** Bug 1889681 has been marked as a duplicate of this bug. *** My opinion is that CMO should use different CRD names like *.monitoring.openshift.io from community PO's name like *.monitoring.coreos.com. So IMO this bug should be filed against CMO, not OLM. |