Bug 2010375
Summary: | OpenShift Alerting Rules Style-Guide Compliance | ||
---|---|---|---|
Product: | OpenShift Container Platform | Reporter: | Brad Ison <brad.ison> |
Component: | OLM | Assignee: | Anik <anbhatta> |
OLM sub component: | OLM | QA Contact: | xzha |
Status: | CLOSED ERRATA | Docs Contact: | |
Severity: | low | ||
Priority: | low | CC: | anbhatta, ankithom, cchantse, spasquie |
Version: | 4.10 | ||
Target Milestone: | --- | ||
Target Release: | 4.12.0 | ||
Hardware: | Unspecified | ||
OS: | Unspecified | ||
Whiteboard: | |||
Fixed In Version: | Doc Type: | No Doc Update | |
Doc Text: | Story Points: | --- | |
Clone Of: | Environment: | ||
Last Closed: | 2023-01-17 19:46:45 UTC | Type: | Bug |
Regression: | --- | Mount Type: | --- |
Documentation: | --- | CRM: | |
Verified Versions: | Category: | --- | |
oVirt Team: | --- | RHEL 7.3 requirements from Atomic Host: | |
Cloudforms Team: | --- | Target Upstream Version: | |
Embargoed: |
Description
Brad Ison
2021-10-04 14:06:25 UTC
Any progress on this issue? The monitoring team could help if needed. verify: 1) install cluster with this PR zhaoxia@xzha-mac openshift-tests-private % oc get clusterversion NAME VERSION AVAILABLE PROGRESSING SINCE STATUS version 4.11.0-0.ci.test-2022-06-13-054554-ci-ln-zn13fyk-latest True False 39m Cluster version is 4.11.0-0.ci.test-2022-06-13-054554-ci-ln-zn13fyk-latest zhaoxia@xzha-mac openshift-tests-private % oc get catsrc NAME DISPLAY TYPE PUBLISHER AGE certified-operators Certified Operators grpc Red Hat 61m community-operators Community Operators grpc Red Hat 61m redhat-marketplace Red Hat Marketplace grpc Red Hat 61m redhat-operators Red Hat Operators grpc Red Hat 61m 2) make catsrc certified-operators pod is pending oc patch catsrc certified-operators -p='{"spec":{"grpcPodConfig":{"nodeSelector":{"fake43642":"fake"}}}}' --type=merge zhaoxia@xzha-mac openshift-tests-private % oc get pod NAME READY STATUS RESTARTS AGE certified-operators-7sns9 0/1 Pending 0 8s community-operators-8kqfv 1/1 Running 0 40m marketplace-operator-85d9b67789-g7tjw 1/1 Running 0 43m redhat-marketplace-bcc47 1/1 Running 0 40m redhat-operators-w6cbn 1/1 Running 0 40m 3) check alert zhaoxia@xzha-mac openshift-tests-private % curl -k -H "Authorization: Bearer $(oc create token prometheus-k8s -n openshift-monitoring)" https://$(oc get route prometheus-k8s -n openshift-monitoring -o=jsonpath='{.spec.host}')/api/v1/alerts| jq -r '.data.alerts[] | select (.labels.alertname == "OperatorHubSourceError")' { "labels": { "alertname": "OperatorHubSourceError", "container": "catalog-operator", "endpoint": "https-metrics", "exported_namespace": "openshift-marketplace", "instance": "10.128.0.25:8443", "job": "catalog-operator-metrics", "name": "certified-operators", "namespace": "openshift-operator-lifecycle-manager", "pod": "catalog-operator-6468cdd79f-4gnqb", "service": "catalog-operator-metrics", "severity": "warning" }, "annotations": { "description": "Operators shipped via the certified-operators source are not available for installation until the issue is fixed. Operators already installed from this source will not receive updates until issue is fixed. Inspect the status of the pod owned by certified-operators source in the openshift-marketplace namespace (oc -n openshift-marketplace get pods -l olm.catalogSource=certified-operators) to diagnose and repair.", "summary": "The certified-operators source is in non-ready state for more than 10 minutes." }, "state": "firing", "activeAt": "2022-06-13T06:38:33.120882761Z", "value": "0e+00" } 4) stop other catalogs oc patch catsrc community-operators -p='{"spec":{"grpcPodConfig":{"nodeSelector":{"fake43642":"fake"}}}}' --type=merge oc patch catsrc redhat-marketplace -p='{"spec":{"grpcPodConfig":{"nodeSelector":{"fake43642":"fake"}}}}' --type=merge oc patch catsrc redhat-operators -p='{"spec":{"grpcPodConfig":{"nodeSelector":{"fake43642":"fake"}}}}' --type=merge 5) check alert zhaoxia@xzha-mac openshift-tests-private % curl -k -H "Authorization: Bearer $(oc create token prometheus-k8s -n openshift-monitoring)" https://$(oc get route prometheus-k8s -n openshift-monitoring -o=jsonpath='{.spec.host}')/api/v1/alerts| jq -r '.data.alerts[] | select (.labels.alertname == "OperatorHubSourceError")' % Total % Received % Xferd Average Speed Time Time Time Current Dload Upload Total Spent Left Speed 100 10566 0 10566 0 0 7908 0 --:--:-- 0:00:01 --:--:-- 7998 { "labels": { "alertname": "OperatorHubSourceError", "container": "catalog-operator", "endpoint": "https-metrics", "exported_namespace": "openshift-marketplace", "instance": "10.128.0.25:8443", "job": "catalog-operator-metrics", "name": "certified-operators", "namespace": "openshift-operator-lifecycle-manager", "pod": "catalog-operator-6468cdd79f-4gnqb", "service": "catalog-operator-metrics", "severity": "warning" }, "annotations": { "description": "Operators shipped via the certified-operators source are not available for installation until the issue is fixed. Operators already installed from this source will not receive updates until issue is fixed. Inspect the status of the pod owned by certified-operators source in the openshift-marketplace namespace (oc -n openshift-marketplace get pods -l olm.catalogSource=certified-operators) to diagnose and repair.", "summary": "The certified-operators source is in non-ready state for more than 10 minutes." }, "state": "firing", "activeAt": "2022-06-13T06:38:33.120882761Z", "value": "0e+00" } { "labels": { "alertname": "OperatorHubSourceError", "container": "catalog-operator", "endpoint": "https-metrics", "exported_namespace": "openshift-marketplace", "instance": "10.128.0.25:8443", "job": "catalog-operator-metrics", "name": "community-operators", "namespace": "openshift-operator-lifecycle-manager", "pod": "catalog-operator-6468cdd79f-4gnqb", "service": "catalog-operator-metrics", "severity": "warning" }, "annotations": { "description": "Operators shipped via the community-operators source are not available for installation until the issue is fixed. Operators already installed from this source will not receive updates until issue is fixed. Inspect the status of the pod owned by community-operators source in the openshift-marketplace namespace (oc -n openshift-marketplace get pods -l olm.catalogSource=community-operators) to diagnose and repair.", "summary": "The community-operators source is in non-ready state for more than 10 minutes." }, "state": "firing", "activeAt": "2022-06-13T06:51:33.120882761Z", "value": "0e+00" } { "labels": { "alertname": "OperatorHubSourceError", "container": "catalog-operator", "endpoint": "https-metrics", "exported_namespace": "openshift-marketplace", "instance": "10.128.0.25:8443", "job": "catalog-operator-metrics", "name": "redhat-marketplace", "namespace": "openshift-operator-lifecycle-manager", "pod": "catalog-operator-6468cdd79f-4gnqb", "service": "catalog-operator-metrics", "severity": "warning" }, "annotations": { "description": "Operators shipped via the redhat-marketplace source are not available for installation until the issue is fixed. Operators already installed from this source will not receive updates until issue is fixed. Inspect the status of the pod owned by redhat-marketplace source in the openshift-marketplace namespace (oc -n openshift-marketplace get pods -l olm.catalogSource=redhat-marketplace) to diagnose and repair.", "summary": "The redhat-marketplace source is in non-ready state for more than 10 minutes." }, "state": "pending", "activeAt": "2022-06-13T06:53:33.120882761Z", "value": "0e+00" } { "labels": { "alertname": "OperatorHubSourceError", "container": "catalog-operator", "endpoint": "https-metrics", "exported_namespace": "openshift-marketplace", "instance": "10.128.0.25:8443", "job": "catalog-operator-metrics", "name": "redhat-operators", "namespace": "openshift-operator-lifecycle-manager", "pod": "catalog-operator-6468cdd79f-4gnqb", "service": "catalog-operator-metrics", "severity": "warning" }, "annotations": { "description": "Operators shipped via the redhat-operators source are not available for installation until the issue is fixed. Operators already installed from this source will not receive updates until issue is fixed. Inspect the status of the pod owned by redhat-operators source in the openshift-marketplace namespace (oc -n openshift-marketplace get pods -l olm.catalogSource=redhat-operators) to diagnose and repair.", "summary": "The redhat-operators source is in non-ready state for more than 10 minutes." }, "state": "pending", "activeAt": "2022-06-13T06:54:03.120882761Z", "value": "0e+00" } LGTM, verified. verify: zhaoxia@xzha-mac ~ % oc get clusterversion NAME VERSION AVAILABLE PROGRESSING SINCE STATUS version 4.12.0-0.nightly-2022-07-24-180529 True False 38m Cluster version is 4.12.0-0.nightly-2022-07-24-180529 zhaoxia@xzha-mac ~ % oc get catsrc -A NAMESPACE NAME DISPLAY TYPE PUBLISHER AGE openshift-marketplace certified-operators Certified Operators grpc Red Hat 56m openshift-marketplace community-operators Community Operators grpc Red Hat 56m openshift-marketplace redhat-marketplace Red Hat Marketplace grpc Red Hat 56m openshift-marketplace redhat-operators Red Hat Operators grpc Red Hat 56m 1) make catsrc pod is pending zhaoxia@xzha-mac ~ % oc patch catsrc certified-operators -p='{"spec":{"grpcPodConfig":{"nodeSelector":{"fake43642":"fake"}}}}' --type=merge catalogsource.operators.coreos.com/certified-operators patched zhaoxia@xzha-mac ~ % oc patch catsrc community-operators -p='{"spec":{"grpcPodConfig":{"nodeSelector":{"fake43642":"fake"}}}}' --type=merge catalogsource.operators.coreos.com/community-operators patched zhaoxia@xzha-mac ~ % oc patch catsrc redhat-marketplace -p='{"spec":{"grpcPodConfig":{"nodeSelector":{"fake43642":"fake"}}}}' --type=merge catalogsource.operators.coreos.com/redhat-marketplace patched zhaoxia@xzha-mac ~ % oc patch catsrc redhat-operators -p='{"spec":{"grpcPodConfig":{"nodeSelector":{"fake43642":"fake"}}}}' --type=merge catalogsource.operators.coreos.com/redhat-operators patched zhaoxia@xzha-mac ~ % oc get pod NAME READY STATUS RESTARTS AGE certified-operators-szdsx 0/1 Pending 0 2m28s community-operators-q5hn5 0/1 Pending 0 2m18s marketplace-operator-bbbc9755c-lpkpr 1/1 Running 5 (44m ago) 62m redhat-marketplace-dhzx4 0/1 Pending 0 2m7s redhat-operators-lhdw7 0/1 Pending 0 117s 2) check alert zhaoxia@xzha-mac ~ % curl -k -H "Authorization: Bearer $(oc create token prometheus-k8s -n openshift-monitoring)" https://$(oc get route prometheus-k8s -n openshift-monitoring -o=jsonpath='{.spec.host}')/api/v1/alerts| jq -r '.data.alerts[] | select (.labels.alertname == "OperatorHubSourceError")' % Total % Received % Xferd Average Speed Time Time Time Current Dload Upload Total Spent Left Speed 100 4305 0 4305 0 0 4527 0 --:--:-- --:--:-- --:--:-- 4565 { "labels": { "alertname": "OperatorHubSourceError", "container": "catalog-operator", "endpoint": "https-metrics", "exported_namespace": "openshift-marketplace", "instance": "10.128.0.20:8443", "job": "catalog-operator-metrics", "name": "certified-operators", "namespace": "openshift-operator-lifecycle-manager", "pod": "catalog-operator-98ccdfbfc-d9bqf", "service": "catalog-operator-metrics", "severity": "warning" }, "annotations": { "description": "Operators shipped via the certified-operators source are not available for installation until the issue is fixed. Operators already installed from this source will not receive updates until issue is fixed. Inspect the status of the pod owned by certified-operators source in the openshift-marketplace namespace (oc -n openshift-marketplace get pods -l olm.catalogSource=certified-operators) to diagnose and repair.", "summary": "The certified-operators source is in non-ready state for more than 10 minutes." }, "state": "pending", "activeAt": "2022-07-25T05:39:47.911111372Z", "value": "0e+00" } { "labels": { "alertname": "OperatorHubSourceError", "container": "catalog-operator", "endpoint": "https-metrics", "exported_namespace": "openshift-marketplace", "instance": "10.128.0.20:8443", "job": "catalog-operator-metrics", "name": "community-operators", "namespace": "openshift-operator-lifecycle-manager", "pod": "catalog-operator-98ccdfbfc-d9bqf", "service": "catalog-operator-metrics", "severity": "warning" }, "annotations": { "description": "Operators shipped via the community-operators source are not available for installation until the issue is fixed. Operators already installed from this source will not receive updates until issue is fixed. Inspect the status of the pod owned by community-operators source in the openshift-marketplace namespace (oc -n openshift-marketplace get pods -l olm.catalogSource=community-operators) to diagnose and repair.", "summary": "The community-operators source is in non-ready state for more than 10 minutes." }, "state": "pending", "activeAt": "2022-07-25T05:39:47.911111372Z", "value": "0e+00" } { "labels": { "alertname": "OperatorHubSourceError", "container": "catalog-operator", "endpoint": "https-metrics", "exported_namespace": "openshift-marketplace", "instance": "10.128.0.20:8443", "job": "catalog-operator-metrics", "name": "redhat-marketplace", "namespace": "openshift-operator-lifecycle-manager", "pod": "catalog-operator-98ccdfbfc-d9bqf", "service": "catalog-operator-metrics", "severity": "warning" }, "annotations": { "description": "Operators shipped via the redhat-marketplace source are not available for installation until the issue is fixed. Operators already installed from this source will not receive updates until issue is fixed. Inspect the status of the pod owned by redhat-marketplace source in the openshift-marketplace namespace (oc -n openshift-marketplace get pods -l olm.catalogSource=redhat-marketplace) to diagnose and repair.", "summary": "The redhat-marketplace source is in non-ready state for more than 10 minutes." }, "state": "pending", "activeAt": "2022-07-25T05:39:47.911111372Z", "value": "0e+00" } LGTM, verified. Since the problem described in this bug report should be resolved in a recent advisory, it has been closed with a resolution of ERRATA. For information on the advisory (Moderate: OpenShift Container Platform 4.12.0 bug fix and security update), and where to find the updated files, follow the link below. If the solution does not work for you, open a new bug report. https://access.redhat.com/errata/RHSA-2022:7399 |