Bug 1834895
Summary: | MCO e2e-gcp-op tests fail consistently on timeouts | ||
---|---|---|---|
Product: | OpenShift Container Platform | Reporter: | Yu Qi Zhang <jerzhang> |
Component: | Machine Config Operator | Assignee: | Yu Qi Zhang <jerzhang> |
Status: | CLOSED ERRATA | QA Contact: | Michael Nguyen <mnguyen> |
Severity: | high | Docs Contact: | |
Priority: | unspecified | ||
Version: | 4.5 | CC: | kgarriso, nmoraiti, pasik, skumari, wking |
Target Milestone: | --- | ||
Target Release: | 4.5.0 | ||
Hardware: | Unspecified | ||
OS: | Unspecified | ||
Whiteboard: | |||
Fixed In Version: | Doc Type: | No Doc Update | |
Doc Text: | Story Points: | --- | |
Clone Of: | Environment: | ||
Last Closed: | 2020-07-13 17:37:59 UTC | Type: | Bug |
Regression: | --- | Mount Type: | --- |
Documentation: | --- | CRM: | |
Verified Versions: | Category: | --- | |
oVirt Team: | --- | RHEL 7.3 requirements from Atomic Host: | |
Cloudforms Team: | --- | Target Upstream Version: | |
Embargoed: |
Description
Yu Qi Zhang
2020-05-12 15:40:44 UTC
Also opened: https://bugzilla.redhat.com/show_bug.cgi?id=1835042 Because something seems to have changed along the way and these weird logs have started appearing. Possibly related: https://bugzilla.redhat.com/show_bug.cgi?id=1835368 Wondering if this is somehow related to https://bugzilla.redhat.com/show_bug.cgi?id=1802534 I don't think this needs a doc update. When we tripped over the bug fixed by mco#1731, the downside would be a kubelet-heartbeat (5 minute?) potential delay noticing the new desiredConfig. But eventually that heartbeat (or other node change) would come through and the MCD would notice and apply the desiredConfig. It only bit us because we have tight timeout limits in the e2e suite that customers are unlikely to have in production clusters. Or at least, any customer limits on desiredConfig application that require <5m latencies are already brittle, so doesn't seem worth a doc callout to say "maybe under some conditions we will exceed your overly-strict desuredConfig latency assumptions" or whatever a doc update would look like ;). *** Bug 1817465 has been marked as a duplicate of this bug. *** CI is not failing consistently anymore due to timeouts. Considering this verified as the original issue was reported in CI. Since the problem described in this bug report should be resolved in a recent advisory, it has been closed with a resolution of ERRATA. For information on the advisory, and where to find the updated files, follow the link below. If the solution does not work for you, open a new bug report. https://access.redhat.com/errata/RHBA-2020:2409 |