Bug 1990140
Summary: | Samples operator management Removed failed to contact registry.redhat.io | ||
---|---|---|---|
Product: | OpenShift Container Platform | Reporter: | Dan Seals <dseals> |
Component: | Samples | Assignee: | Gabe Montero <gmontero> |
Status: | CLOSED ERRATA | QA Contact: | Jitendar Singh <jitsingh> |
Severity: | medium | Docs Contact: | |
Priority: | medium | ||
Version: | 4.6 | CC: | adam.kaplan, aos-bugs, xiuwang |
Target Milestone: | --- | ||
Target Release: | 4.9.0 | ||
Hardware: | Unspecified | ||
OS: | Unspecified | ||
Whiteboard: | |||
Fixed In Version: | Doc Type: | Bug Fix | |
Doc Text: |
Cause: the samples operator performs a quick connection test to registry.redhat.io to help determine if it is in a disconnected environment; it was not setting a connection timeout on that attempt
Consequence: if the underlying environment's socket connection defaults were lengthy enough, it would result in long times for this test to complete, and delay the samples operator completing start up and reporting to the cluster version operator
Fix: samples operator now sets a reasonable connection timeout
Result: There are no longer connection based delays reporting to the cluster version operator
|
Story Points: | --- |
Clone Of: | Environment: | ||
Last Closed: | 2021-10-18 17:44:59 UTC | Type: | Bug |
Regression: | --- | Mount Type: | --- |
Documentation: | --- | CRM: | |
Verified Versions: | Category: | --- | |
oVirt Team: | --- | RHEL 7.3 requirements from Atomic Host: | |
Cloudforms Team: | --- | Target Upstream Version: | |
Embargoed: |
Description
Dan Seals
2021-08-04 20:28:02 UTC
Full samples operator logs http://pastebin.test.redhat.com/992111 When default set to removed, no connection session again. time="2021-09-07T03:13:44Z" level=info msg="test connection with timeout failed with dial tcp 104.81.144.251:443: i/o timeout" time="2021-09-07T03:14:04Z" level=info msg="test connection with timeout failed with dial tcp 104.105.42.89:443: i/o timeout" time="2021-09-07T03:14:15Z" level=error msg="unable to sync: config.samples.operator.openshift.io \"cluster\" not found, requeuing" time="2021-09-07T03:14:15Z" level=error msg="unable to sync: config.samples.operator.openshift.io \"cluster\" not found, requeuing" time="2021-09-07T03:14:18Z" level=info msg="metrics sample config retrieval failed with: config.samples.operator.openshift.io \"cluster\" not found" time="2021-09-07T03:14:24Z" level=info msg="test connection with timeout failed with dial tcp 104.105.42.89:443: i/o timeout" time="2021-09-07T03:14:40Z" level=info msg="metrics sample config retrieval failed with: config.samples.operator.openshift.io \"cluster\" not found" time="2021-09-07T03:14:44Z" level=info msg="test connection with timeout failed with dial tcp 104.105.42.89:443: i/o timeout" time="2021-09-07T03:15:04Z" level=info msg="test connection with timeout failed with dial tcp 104.105.42.89:443: i/o timeout" time="2021-09-07T03:15:18Z" level=info msg="metrics sample config retrieval failed with: config.samples.operator.openshift.io \"cluster\" not found" time="2021-09-07T03:15:24Z" level=info msg="test connection with timeout failed with dial tcp 104.105.42.89:443: i/o timeout" time="2021-09-07T03:15:37Z" level=error msg="unable to sync: config.samples.operator.openshift.io \"cluster\" not found, requeuing" time="2021-09-07T03:15:37Z" level=error msg="unable to sync: config.samples.operator.openshift.io \"cluster\" not found, requeuing" time="2021-09-07T03:15:40Z" level=info msg="metrics sample config retrieval failed with: config.samples.operator.openshift.io \"cluster\" not found" time="2021-09-07T03:15:44Z" level=info msg="test connection with timeout failed with dial tcp 104.105.42.89:443: i/o timeout" time="2021-09-07T03:16:04Z" level=info msg="test connection with timeout failed with dial tcp 104.105.42.89:443: i/o timeout" time="2021-09-07T03:16:18Z" level=info msg="metrics sample config retrieval failed with: config.samples.operator.openshift.io \"cluster\" not found" time="2021-09-07T03:16:24Z" level=info msg="test connection with timeout failed with dial tcp 104.105.42.89:443: i/o timeout" time="2021-09-07T03:16:40Z" level=info msg="metrics sample config retrieval failed with: config.samples.operator.openshift.io \"cluster\" not found" time="2021-09-07T03:16:44Z" level=info msg="test connection with timeout failed with dial tcp 104.105.42.89:443: i/o timeout" time="2021-09-07T03:17:04Z" level=info msg="test connection with timeout failed with dial tcp 104.105.42.89:443: i/o timeout" time="2021-09-07T03:17:18Z" level=info msg="metrics sample config retrieval failed with: config.samples.operator.openshift.io \"cluster\" not found" time="2021-09-07T03:17:24Z" level=info msg="test connection with timeout failed with dial tcp 104.105.42.89:443: i/o timeout" time="2021-09-07T03:17:40Z" level=info msg="metrics sample config retrieval failed with: config.samples.operator.openshift.io \"cluster\" not found" time="2021-09-07T03:17:44Z" level=info msg="test connection with timeout failed with dial tcp 104.105.42.89:443: i/o timeout" time="2021-09-07T03:18:04Z" level=info msg="test connection with timeout failed with dial tcp 104.105.42.89:443: i/o timeout" time="2021-09-07T03:18:18Z" level=info msg="metrics sample config retrieval failed with: config.samples.operator.openshift.io \"cluster\" not found" time="2021-09-07T03:18:21Z" level=error msg="unable to sync: config.samples.operator.openshift.io \"cluster\" not found, requeuing" time="2021-09-07T03:18:21Z" level=error msg="unable to sync: config.samples.operator.openshift.io \"cluster\" not found, requeuing" time="2021-09-07T03:18:24Z" level=info msg="test connection with timeout failed with dial tcp 104.105.42.89:443: i/o timeout" time="2021-09-07T03:18:39Z" level=info msg="test connection with timeout failed with dial tcp 104.105.42.89:443: i/o timeout" time="2021-09-07T03:18:39Z" level=info msg="unable to establish HTTPS connection to registry.redhat.io after 3 minutes, bootstrap to Removed" time="2021-09-07T03:18:39Z" level=info msg="creating default Config" time="2021-09-07T03:18:40Z" level=info msg="metrics sample config retrieval failed with: config.samples.operator.openshift.io \"cluster\" not found" time="2021-09-07T03:18:42Z" level=info msg="Attempting stage 1 Removed management state: RemovePending == true" time="2021-09-07T03:18:42Z" level=info msg="CRDUPDATE process mgmt update spec Removed status " time="2021-09-07T03:18:45Z" level=info msg="management state set to removed so deleting samples" time="2021-09-07T03:18:45Z" level=info msg="Attempting stage 2 Removed management state: Status == Removed" time="2021-09-07T03:18:45Z" level=info msg="CRDUPDATE process mgmt update spec Removed status Removed" time="2021-09-07T03:18:48Z" level=info msg="Attempting stage 3 Removed management state: RemovePending == false" time="2021-09-07T03:18:48Z" level=info msg="CRDUPDATE process mgmt update spec Removed status Removed" Since the problem described in this bug report should be resolved in a recent advisory, it has been closed with a resolution of ERRATA. For information on the advisory (Moderate: OpenShift Container Platform 4.9.0 bug fix and security update), and where to find the updated files, follow the link below. If the solution does not work for you, open a new bug report. https://access.redhat.com/errata/RHSA-2021:3759 |