Bug 2003788
Summary: | CSR reconciler report error constantly when BYOH CSR approved by other Approver | ||
---|---|---|---|
Product: | OpenShift Container Platform | Reporter: | Joel Speed <jspeed> |
Component: | Cloud Compute | Assignee: | Joel Speed <jspeed> |
Cloud Compute sub component: | Other Providers | QA Contact: | Milind Yadav <miyadav> |
Status: | CLOSED ERRATA | Docs Contact: | |
Severity: | medium | ||
Priority: | medium | CC: | aos-bugs, mankulka, mohashai, sgao, team-winc |
Version: | 4.9 | ||
Target Milestone: | --- | ||
Target Release: | 4.10.0 | ||
Hardware: | Unspecified | ||
OS: | Unspecified | ||
Whiteboard: | |||
Fixed In Version: | Doc Type: | No Doc Update | |
Doc Text: | Story Points: | --- | |
Clone Of: | 2002961 | Environment: | |
Last Closed: | 2022-03-10 16:10:01 UTC | Type: | --- |
Regression: | --- | Mount Type: | --- |
Documentation: | --- | CRM: | |
Verified Versions: | Category: | --- | |
oVirt Team: | --- | RHEL 7.3 requirements from Atomic Host: | |
Cloudforms Team: | --- | Target Upstream Version: | |
Embargoed: |
Comment 2
Milind Yadav
2021-10-07 03:14:41 UTC
Adding more logs for comment#2 (windows csrs)- [miyadav@miyadav ~]$ oc get csr NAME AGE SIGNERNAME REQUESTOR REQUESTEDDURATION CONDITION csr-72257 60m kubernetes.io/kubelet-serving system:node:ip-10-0-223-46.us-east-2.compute.internal <none> Approved,Issued csr-778f6 52m kubernetes.io/kubelet-serving system:node:ip-10-0-142-79.us-east-2.compute.internal <none> Approved,Issued csr-kskdc 65m kubernetes.io/kube-apiserver-client-kubelet system:node:ip-10-0-170-225.us-east-2.compute.internal <none> Approved,Issued csr-qkvsm 61m kubernetes.io/kubelet-serving system:node:ip-10-0-159-91.us-east-2.compute.internal <none> Approved,Issued csr-shnn2 77m kubernetes.io/kube-apiserver-client-kubelet system:node:ip-10-0-150-207.us-east-2.compute.internal <none> Approved,Issued csr-wjbnm 78m kubernetes.io/kube-apiserver-client-kubelet system:node:ip-10-0-175-245.us-east-2.compute.internal <none> Approved,Issued oc logs deployment.apps/machine-approver machine-approver-controller -n openshift-cluster-machine-approver https://privatebin-it-iso.int.open.paas.redhat.com/?ee2d69bec6c1c7be#DAju7NuwFJ4tnyEjUVt8mwszsuQStuhuoLfL3aPn1hKk Another windows worker csr logs - https://privatebin-it-iso.int.open.paas.redhat.com/?93211568b74d08f8#HJ6bzDP4SLy1HA7zbifUqAGPLv6hPVHzHs24c7an56uQ I can't see any issues within the logs, but I'm not sure it actually executed the code path. Reproducing this issue is very very difficult as we need to queue the CSR and then have it approved. I'm not sure if it's worth our time trying to hunt an explicit event where this happens. It's likely to be sporadic and needs the WMCO and CSR approver running simultaneously and then it might happen when we add a new windows machine Thanks @jspeed for review , Also , need input on debug messages .. I mean granuality of messages in logs - https://privatebin-it-iso.int.open.paas.redhat.com/?341fab9d51cc2ac9#GdwgbQWWHEosBxcHwEfLEGrLdbuGqpnuF4GJAYWyvq93 . Those logs are part of the WMCO, probably best to check with them about how much they are logging. I don't think that necessarily affects this bug, but might be worth asking on https://bugzilla.redhat.com/show_bug.cgi?id=2002961 in which the WMCO team are solving the same sort of issue as here Thanks Joel and Mansi for comments , moving to VERIFIED. Since the problem described in this bug report should be resolved in a recent advisory, it has been closed with a resolution of ERRATA. For information on the advisory (Moderate: OpenShift Container Platform 4.10.3 security update), and where to find the updated files, follow the link below. If the solution does not work for you, open a new bug report. https://access.redhat.com/errata/RHSA-2022:0056 |