Bug 1811748
Summary: | Resources not rendered are not removed upon CNO recreation | |||
---|---|---|---|---|
Product: | OpenShift Container Platform | Reporter: | Maysa Macedo <mdemaced> | |
Component: | Networking | Assignee: | Maysa Macedo <mdemaced> | |
Networking sub component: | kuryr | QA Contact: | GenadiC <gcheresh> | |
Status: | CLOSED ERRATA | Docs Contact: | ||
Severity: | medium | |||
Priority: | unspecified | CC: | rbrattai | |
Version: | 4.4 | |||
Target Milestone: | --- | |||
Target Release: | 4.5.0 | |||
Hardware: | Unspecified | |||
OS: | Unspecified | |||
Whiteboard: | ||||
Fixed In Version: | Doc Type: | No Doc Update | ||
Doc Text: | Story Points: | --- | ||
Clone Of: | ||||
: | 1811830 (view as bug list) | Environment: | ||
Last Closed: | 2020-07-13 17:19:06 UTC | Type: | Bug | |
Regression: | --- | Mount Type: | --- | |
Documentation: | --- | CRM: | ||
Verified Versions: | Category: | --- | ||
oVirt Team: | --- | RHEL 7.3 requirements from Atomic Host: | ||
Cloudforms Team: | --- | Target Upstream Version: | ||
Embargoed: | ||||
Bug Depends On: | ||||
Bug Blocks: | 1811830 |
Description
Maysa Macedo
2020-03-09 16:59:01 UTC
Unable to reproduce orphaned resources on 4.5.0-0.nightly-2020-03-16-101116 with OpenShiftSDN The behaviour of the clusteroperator network relatedObjects on OpenShiftSDN seems to be slightly different, 'relatedObjects' never seems to be nil. When I delete the network-operator pod I do not see relatedObjects changing to nil. The only way I was able to reproduce the original issue on 4.4 SDN was to scale the CNO DaemonSet to 0, then oc edit and change the network config. Orphaned resources reproduction steps on 4.4 OpenShiftSDN 1. Add a new multus network network oc edit networks.operator.openshift.io cluster spec: additionalNetworks: - name: bridge-ipam-dhcp namespace: openshift-multus rawCNIConfig: '{ "name": "bridge-ipam-dhcp", "cniVersion": "0.3.1", "type": "bridge", "master": "ens5", "ipam": { "type": "dhcp" } }' type: Raw 2. Verify dhcp daemon pods are created in multus namespace oc get -n openshift-multus pods -l app=dhcp-daemon 3. scale the CNO to 0 and verify the pod is deleted oc -n openshift-network-operator scale deployment network-operator --replicas=0 4. oc edit networks.operator.openshift.io cluster and delete the additional network we added in step 1 5. oc -n openshift-network-operator scale deployment network-operator --replicas=1 6. verify the dhcp pods are still alive and have not been terminated oc get -n openshift-multus pods -l app=dhcp-daemon With these steps the dhcp-pods are not terminated on 4.4 On 4.5.0-0.nightly-2020-03-16-101116 the dhcp-daemon pods are terminated This seems to suggest something has been resolved on 4.5 With OpenShiftSDN I have never seen clusteroperator network 'relatedObjects' be nil @anusaxen reports that with OVNKubernetes he also has not seen 'relatedObjects' be nil Can the Kuryr team also look and see if the root cause for the 'relatedObjects' nil state can be identified as well? The fix was already in place with 4.5.0-0.nightly-2020-03-16-101116 release image. I could see the 'relatedObjects' fields not present also when using OpenShiftSDN, by constantly checking the field value with: 'oc get co network -o yaml -w' (it keeps the record of changes that happened in the object) The same issue can be seen with Kuryr, as the population of the relatedObjects only happens after the updated of ClusterOperator have happened. The fix solves the issue with Kuryr as well. Since the problem described in this bug report should be resolved in a recent advisory, it has been closed with a resolution of ERRATA. For information on the advisory, and where to find the updated files, follow the link below. If the solution does not work for you, open a new bug report. https://access.redhat.com/errata/RHBA-2020:2409 |