Created attachment 1671489 [details] SSP log Description of problem: Can't re-deploy HCO, SSP does not report conditions Version-Release number of selected component (if applicable): How reproducible: Steps to Reproduce: 1. Deploy KubeVirt + HCO 2. Delete the HCO operator and wait to all the other operators to be removed as well. 3. Create HCO again Actual results: HCO is Active and Upgradeble Expected results: HCO condition is always Processing = True, Available = False ``` - lastHeartbeatTime: '2020-03-18T08:07:18Z' lastTransitionTime: '2020-03-18T08:05:28Z' message: KubevirtNodeLabellerBundle resource has no conditions reason: KubevirtNodeLabellerBundleConditions status: 'False' type: Available - lastHeartbeatTime: '2020-03-18T08:07:18Z' lastTransitionTime: '2020-03-18T08:05:28Z' message: KubevirtNodeLabellerBundle resource has no conditions reason: KubevirtNodeLabellerBundleConditions status: 'True' type: Progressing ``` Additional info: ``` oc get KubevirtCommonTemplatesBundle -n openshift -o yaml apiVersion: v1 items: - apiVersion: ssp.kubevirt.io/v1 kind: KubevirtCommonTemplatesBundle metadata: creationTimestamp: 2020-03-18T13:33:29Z generation: 1 labels: app: kubevirt-hyperconverged name: common-templates-kubevirt-hyperconverged namespace: openshift resourceVersion: "604948" selfLink: /apis/ssp.kubevirt.io/v1/namespaces/openshift/kubevirtcommontemplatesbundles/common-templates-kubevirt-hyperconverged uid: 167ab782-95ad-424e-ac3b-3a31f5326b9a spec: {} kind: List metadata: resourceVersion: "" selfLink: "" ```
I think that the reproduction steps are not that accurate, the correct path is Steps to Reproduce: 1. Deploy CNV and create HCO CR, wait for it to be successfully running with ready=1 2. Delete the HCO CR and wait to all the other operators CRs to be removed as well 3. Create HCO CR again
It looks like ssp is randomly ignoring recreated CRs. I updated ssp with latest ansible operator, we will see if it fixes it.
Seems like this will block upgrade flow targeting to 2.3 we need PM ack
I found all root causes of this bug (https://github.com/MarSik/kubevirt-ssp-operator/pull/158, https://github.com/MarSik/kubevirt-ssp-operator/pull/157). New containers are being built.
Performed following steps: 1. Deploy CNV 2.3 with HCO CR (wait till all pods are running) 2. Deleted HCO and stopped for related pods to remove 3. Again added HCO CR in the cluster and wait for all pods are running Here is output of HCO hearbeats: - lastHeartbeatTime: '2020-04-01T12:44:30Z' lastTransitionTime: '2020-04-01T11:57:50Z' message: Reconcile completed successfully reason: ReconcileCompleted status: 'True' type: Available - lastHeartbeatTime: '2020-04-01T12:44:30Z' lastTransitionTime: '2020-04-01T11:57:50Z' message: Reconcile completed successfully reason: ReconcileCompleted status: 'False' type: Progressing Hence Verifying this Bugzilla.
Since the problem described in this bug report should be resolved in a recent advisory, it has been closed with a resolution of ERRATA. For information on the advisory, and where to find the updated files, follow the link below. If the solution does not work for you, open a new bug report. https://access.redhat.com/errata/RHEA-2020:2011
The needinfo request[s] on this closed bug have been removed as they have been unresolved for 1000 days