Bug 1931033
Summary: | Upgrade from 4.3 to 4.4 kube-apiserver operator Degraded due to endpoints/host-etcd-2.openshift-etcd: not found | ||
---|---|---|---|
Product: | OpenShift Container Platform | Reporter: | Ke Wang <kewang> |
Component: | Etcd | Assignee: | Sam Batschelet <sbatsche> |
Status: | CLOSED EOL | QA Contact: | ge liu <geliu> |
Severity: | high | Docs Contact: | |
Priority: | medium | ||
Version: | 4.4 | ||
Target Milestone: | --- | ||
Target Release: | --- | ||
Hardware: | Unspecified | ||
OS: | Unspecified | ||
Whiteboard: | |||
Fixed In Version: | Doc Type: | If docs needed, set a value | |
Doc Text: | Story Points: | --- | |
Clone Of: | Environment: | ||
Last Closed: | 2021-03-05 19:42:08 UTC | Type: | Bug |
Regression: | --- | Mount Type: | --- |
Documentation: | --- | CRM: | |
Verified Versions: | Category: | --- | |
oVirt Team: | --- | RHEL 7.3 requirements from Atomic Host: | |
Cloudforms Team: | --- | Target Upstream Version: | |
Embargoed: |
Description
Ke Wang
2021-02-20 07:48:54 UTC
We've been running into this problem all these days with our upgrade tests from 4.3 to 4.4, the kube-apiservers Degraded was finished at last. Looks like the operator started at 01:08:11 > 2021-02-09T01:08:11.431099654Z I0209 01:08:11.430946 1 cmd.go:196] Using service-serving-cert provided certificates created endpoint resource at 01:10:30 > namespaces/openshift-etcd-operator/pods/etcd-operator-7bccfd8865-6kjh5/operator/operator/logs/current.log:1540:2021-02-09T01:10:30.644284521Z I0209 01:10:30.643124 1 event.go:278] Event(v1.ObjectReference{Kind:"Deployment", Namespace:"openshift-etcd-operator", Name:"etcd-operator", UID:"e9d55a70-c20a-40ac-9b2c-a973fdee0bab", APIVersion:"apps/v1", ResourceVersion:"", FieldPath:""}): type: 'Warning' reason: 'EndpointsCreated' Created endpoints/host-etcd-2 -n openshift-etcd because it was missing created again at 01:17:31 > namespaces/openshift-etcd-operator/pods/etcd-operator-7bccfd8865-6kjh5/operator/operator/logs/current.log:3654:2021-02-09T01:17:31.509572929Z I0209 01:17:31.500276 1 event.go:278] Event(v1.ObjectReference{Kind:"Deployment", Namespace:"openshift-etcd-operator", Name:"etcd-operator", UID:"e9d55a70-c20a-40ac-9b2c-a973fdee0bab", APIVersion:"apps/v1", ResourceVersion:"", FieldPath:""}): type: 'Warning' reason: 'EndpointsCreated' Created endpoints/host-etcd-2 -n openshift-etcd because it was missing There is an issue with endpoint resources being removed automatically if they don't have a service[2],[3]. We resolved this in later versions (4.5+) by moving from endpoint resource to configmap. ``` - apiVersion: v1 kind: Endpoints metadata: creationTimestamp: "2021-02-09T01:22:58Z" name: host-etcd-2 namespace: openshift-etcd [..] ``` As 4.4 is EOL on 4.7 GA I am not sure this will be resolved in 4.4. [1] https://github.com/openshift/cluster-etcd-operator/blob/release-4.4/pkg/operator/hostendpointscontroller2/host_endpoints_controller.go#L165 [2] https://bugzilla.redhat.com/show_bug.cgi?id=1836927 [3] https://github.com/openshift/cluster-etcd-operator/pull/354 4.4 is now EOL |