Bug 1950761
Summary: | Monitoring operator deployments anti-affinity rules prevent their rollout on single-node | ||||||||
---|---|---|---|---|---|---|---|---|---|
Product: | OpenShift Container Platform | Reporter: | Omer Tuchfeld <otuchfel> | ||||||
Component: | Monitoring | Assignee: | Damien Grisonnet <dgrisonn> | ||||||
Status: | CLOSED ERRATA | QA Contact: | Junqi Zhao <juzhao> | ||||||
Severity: | urgent | Docs Contact: | |||||||
Priority: | urgent | ||||||||
Version: | 4.8 | CC: | alegrand, anpicker, david.karlsen, dgrisonn, erooth, hongyli, juzhao, kakkoyun, lcosic, minmli, pkrupa, rfreiman, wking | ||||||
Target Milestone: | --- | ||||||||
Target Release: | 4.8.0 | ||||||||
Hardware: | Unspecified | ||||||||
OS: | Unspecified | ||||||||
Whiteboard: | |||||||||
Fixed In Version: | Doc Type: | No Doc Update | |||||||
Doc Text: | Story Points: | --- | |||||||
Clone Of: | Environment: | ||||||||
Last Closed: | 2021-07-27 23:01:42 UTC | Type: | Bug | ||||||
Regression: | --- | Mount Type: | --- | ||||||
Documentation: | --- | CRM: | |||||||
Verified Versions: | Category: | --- | |||||||
oVirt Team: | --- | RHEL 7.3 requirements from Atomic Host: | |||||||
Cloudforms Team: | --- | Target Upstream Version: | |||||||
Embargoed: | |||||||||
Attachments: |
|
Description
Omer Tuchfeld
2021-04-18 12:42:20 UTC
It broke SNO CI. https://testgrid.k8s.io/redhat-single-node#periodic-ci-openshift-release-master-nightly-4.8-e2e-metal-single-node-live-iso I created this WIP PR to verify that this is truly the cause, and it seems to confirm it: https://github.com/openshift/cluster-monitoring-operator/pull/1121 tested with 4.8.0-0.nightly-2021-04-19-121657, don't have rollout issue now, attach the prometheus-adapter/thanos-querier deployment files Created attachment 1773611 [details]
thanos-querier deployment file
Created attachment 1773612 [details]
prometheus-adapter deployment file
Behavior is not as expected, deployment prometheus-adapter should have affinity #oc -n openshift-monitoring get deployment prometheus-adapter -oyaml|grep -A10 affinity #oc -n openshift-monitoring get deployment thanos-querier -oyaml|grep -A10 affinity -- affinity: podAntiAffinity: preferredDuringSchedulingIgnoredDuringExecution: - podAffinityTerm: labelSelector: matchExpressions: - key: app.kubernetes.io/name operator: In values: - thanos-query namespaces: (In reply to hongyan li from comment #7) > Behavior is not as expected, deployment prometheus-adapter should have > affinity > > #oc -n openshift-monitoring get deployment prometheus-adapter -oyaml|grep > -A10 affinity > #oc -n openshift-monitoring get deployment thanos-querier -oyaml|grep -A10 > affinity > -- > affinity: > podAntiAffinity: > preferredDuringSchedulingIgnoredDuringExecution: > - podAffinityTerm: > labelSelector: > matchExpressions: > - key: app.kubernetes.io/name > operator: In > values: > - thanos-query > namespaces: Confirmed with Damien, this is an expected behavior for now and the fix is temporary. *** Bug 1950911 has been marked as a duplicate of this bug. *** *** Bug 1952762 has been marked as a duplicate of this bug. *** Since the problem described in this bug report should be resolved in a recent advisory, it has been closed with a resolution of ERRATA. For information on the advisory (Moderate: OpenShift Container Platform 4.8.2 bug fix and security update), and where to find the updated files, follow the link below. If the solution does not work for you, open a new bug report. https://access.redhat.com/errata/RHSA-2021:2438 |