Bug 2018965
Summary: | e2e-metal-ipi-upgrade is permafailing in 4.10 | ||
---|---|---|---|
Product: | OpenShift Container Platform | Reporter: | Stephen Benjamin <stbenjam> |
Component: | Installer | Assignee: | Arda Guclu <aguclu> |
Installer sub component: | OpenShift on Bare Metal IPI | QA Contact: | Amit Ugol <augol> |
Status: | CLOSED ERRATA | Docs Contact: | |
Severity: | urgent | ||
Priority: | high | CC: | aguclu, bfournie, sippy, wking |
Version: | 4.10 | Keywords: | OtherQA, Triaged |
Target Milestone: | --- | ||
Target Release: | 4.10.0 | ||
Hardware: | Unspecified | ||
OS: | Unspecified | ||
Whiteboard: | |||
Fixed In Version: | Doc Type: | If docs needed, set a value | |
Doc Text: | Story Points: | --- | |
Clone Of: | Environment: |
job=periodic-ci-openshift-release-master-nightly-4.10-e2e-metal-ipi-upgrade=all
|
|
Last Closed: | 2022-03-10 16:23:41 UTC | Type: | Bug |
Regression: | --- | Mount Type: | --- |
Documentation: | --- | CRM: | |
Verified Versions: | Category: | --- | |
oVirt Team: | --- | RHEL 7.3 requirements from Atomic Host: | |
Cloudforms Team: | --- | Target Upstream Version: | |
Embargoed: |
Description
Stephen Benjamin
2021-11-01 11:34:29 UTC
Arda has a fix to change the webhook port number being used - https://github.com/openshift/cluster-baremetal-operator/pull/213 which was thought to be the source of the problem, however the PR's upgrade job still failed with the same error. So, although that fix is necessary there may be something else going on. It does not look like all the references were fixed: ~ git cluster-baremetal-operator $ grep -r 9443 . ./config/profiles/default/manager_webhook_patch.yaml: - containerPort: 9443 ./config/webhook/service.yaml: targetPort: 9443 ./manifests/0000_31_cluster-baremetal-operator_03_webhookservice.yaml: targetPort: 9443 ./manifests/0000_31_cluster-baremetal-operator_06_deployment.yaml: - containerPort: 9443 ./vendor/github.com/prometheus/procfs/fixtures.ttar:trans 706 944304 0 ./vendor/sigs.k8s.io/controller-runtime/pkg/webhook/server.go:var DefaultPort = 9443 ./vendor/sigs.k8s.io/controller-runtime/pkg/webhook/server.go: // It will be defaulted to 9443 if unspecified. BMO is using masters IP addresses, but cluster baremetal operator uses 10.*.*.* IP addresses and runs for a long time(maybe that's why, it does not cause port conflict). I think, there is no need to change above configurations to fix that bug. But for long term, we should change to different port number. According to the latest metrics https://deck-ci.apps.ci.l2s4.p1.openshiftapps.com/job-history/gs/origin-ci-test/logs/periodic-ci-openshift-release-master-nightly-4.10-e2e-metal-ipi-upgrade, upgrade jobs are passing after the fix. I'm closing this bug. Since the problem described in this bug report should be resolved in a recent advisory, it has been closed with a resolution of ERRATA. For information on the advisory (Moderate: OpenShift Container Platform 4.10.3 security update), and where to find the updated files, follow the link below. If the solution does not work for you, open a new bug report. https://access.redhat.com/errata/RHSA-2022:0056 |