Bug 1603611
Summary: | [3.10] Could not find an allocated subnet for node | ||
---|---|---|---|
Product: | OpenShift Container Platform | Reporter: | N. Harrison Ripps <hripps> |
Component: | Node | Assignee: | Avesh Agarwal <avagarwa> |
Status: | CLOSED ERRATA | QA Contact: | sheng.lao <shlao> |
Severity: | high | Docs Contact: | |
Priority: | high | ||
Version: | 3.7.1 | CC: | akostadi, aos-bugs, avagarwa, dma, jchaloup, jialiu, jokerman, jupierce, mifiedle, mmccomas, shlao, sjenning, wmeng |
Target Milestone: | --- | Keywords: | Regression |
Target Release: | 3.10.z | ||
Hardware: | Unspecified | ||
OS: | Unspecified | ||
Whiteboard: | |||
Fixed In Version: | Doc Type: | Bug Fix | |
Doc Text: |
Recently, cloudResourceSyncManager was implemented which continuously fetched node addresses from cloud providers. Kubelet then received node addresses from the cloudResourceSyncManager. At the time of node registration or kubelet start, kubelet fetches node addresses in a blocking loop from cloudResourceSyncManager. The issue was that cloudResourceSyncManager was not started before kubelet had started fetching node addresses from it for the first time, and due to this, kubelet got stuck in the blocking loop and never returned. It caused node failures at network level, and no node could be registered. Also as kubelet blocked early, the cloudResourceSyncManager never got a chance to start.
Solution: CloudResourceSyncManager is now started early in the kubelet startup process so that kubelet does not get blocked on it and cloudResourceSyncManager is always started.
|
Story Points: | --- |
Clone Of: | 1601378 | Environment: | |
Last Closed: | 2018-08-31 06:18:10 UTC | Type: | Bug |
Regression: | --- | Mount Type: | --- |
Documentation: | --- | CRM: | |
Verified Versions: | Category: | --- | |
oVirt Team: | --- | RHEL 7.3 requirements from Atomic Host: | |
Cloudforms Team: | --- | Target Upstream Version: | |
Embargoed: | |||
Bug Depends On: | 1601378 | ||
Bug Blocks: | 1538616 |
Comment 1
N. Harrison Ripps
2018-07-19 18:04:24 UTC
The PR is here: https://github.com/openshift/ose/pull/1364 Fix merged in enterprise-3.10 branch: https://github.com/openshift/ose/commit/0f1103b222705421380805acae4c56bb462e9b3f *** Bug 1609056 has been marked as a duplicate of this bug. *** Since the problem described in this bug report should be resolved in a recent advisory, it has been closed with a resolution of ERRATA. For information on the advisory, and where to find the updated files, follow the link below. If the solution does not work for you, open a new bug report. https://access.redhat.com/errata/RHBA-2018:2376 |