Note: This bug is displayed in read-only format because the product is no longer active in Red Hat Bugzilla.

Bug 2074890

Summary: [4.10] Prometheus CI failure: 503 Service Unavailable
Product: OpenShift Container Platform Reporter: Junqi Zhao <juzhao>
Component: MonitoringAssignee: Simon Pasquier <spasquie>
Status: CLOSED DUPLICATE QA Contact: Junqi Zhao <juzhao>
Severity: medium Docs Contact:
Priority: medium    
Version: 4.10CC: amuller, anpicker, aos-bugs, erooth, janantha
Target Milestone: ---   
Target Release: ---   
Hardware: Unspecified   
OS: Unspecified   
Whiteboard:
Fixed In Version: Doc Type: If docs needed, set a value
Doc Text:
Story Points: ---
Clone Of: Environment:
Last Closed: 2022-04-21 13:47:56 UTC Type: Bug
Regression: --- Mount Type: ---
Documentation: --- CRM:
Verified Versions: Category: ---
oVirt Team: --- RHEL 7.3 requirements from Atomic Host:
Cloudforms Team: --- Target Upstream Version:
Embargoed:

Description Junqi Zhao 2022-04-13 09:38:08 UTC
Description of problem:
the bug is found when verify 4.11 bug 2066457
searched 4.10 CI jobs
https://search.ci.openshift.org/?search=Prometheus%3A+request+failed+with+code+503+Service+Unavailable%3B&maxAge=48h&context=1&type=bug%2Bjunit&name=4.10.*&excludeName=&maxMatches=5&maxBytes=20971520&groupBy=job

there are similar errors in 4.10, example:
https://prow.ci.openshift.org/view/gs/origin-ci-test/logs/periodic-ci-openshift-release-master-ci-4.10-upgrade-from-stable-4.9-e2e-azure-upgrade/1514090767277101056
: [sig-arch][Feature:ClusterUpgrade] Cluster should remain functional during upgrade [Disruptive] [Serial] expand_less 	1h12m15s
{  fail [github.com/openshift/origin/test/extended/prometheus/upgrade.go:57]: Unexpected error:
    <errors.aggregate | len:1, cap:1>: [
        {
            s: "receive series from Addr: 10.131.0.7:10901 LabelSets:  Mint: -62167219200000 Maxt: 9223372036854775807: rpc error: code = Unknown desc = query Prometheus: request failed with code 503 Service Unavailable; msg Service Unavailable",
        },
    ]
    receive series from Addr: 10.131.0.7:10901 LabelSets:  Mint: -62167219200000 Maxt: 9223372036854775807: rpc error: code = Unknown desc = query Prometheus: request failed with code 503 Service Unavailable; msg Service Unavailable
occurred}


Version-Release number of selected component (if applicable):
searched prow jobs

How reproducible:


Steps to Reproduce:
1. see the description
2.
3.

Actual results:


Expected results:


Additional info:

Comment 1 Simon Pasquier 2022-04-21 13:47:56 UTC

*** This bug has been marked as a duplicate of bug 2077516 ***