Bug 1601749 - [3.6] Could not find an allocated subnet for node
Summary: [3.6] Could not find an allocated subnet for node
Keywords:
Status: CLOSED ERRATA
Alias: None
Product: OpenShift Container Platform
Classification: Red Hat
Component: Node
Version: 3.6.1
Hardware: Unspecified
OS: Unspecified
high
high
Target Milestone: ---
: 3.6.z
Assignee: Avesh Agarwal
QA Contact: Weihua Meng
URL:
Whiteboard:
Depends On:
Blocks:
TreeView+ depends on / blocked
 
Reported: 2018-07-17 07:42 UTC by Weihua Meng
Modified: 2018-08-09 22:10 UTC (History)
12 users (show)

Fixed In Version:
Doc Type: Bug Fix
Doc Text:
Recently, cloudResourceSyncManager was implemented which continuously fetched node addresses from cloud providers. Kubelet then received node addresses from the cloudResourceSyncManager. At the time of node registration or kubelet start, kubelet fetches node addresses in a blocking loop from cloudResourceSyncManager. The issue was that cloudResourceSyncManager was not started before kubelet had started fetching node addresses from it for the first time, and due to this, kubelet got stuck in the blocking loop and never returned. It caused node failures at network level, and no node could be registered. Also as kubelet blocked early, the cloudResourceSyncManager never got a chance to start. Solution: CloudResourceSyncManager is now started early in the kubelet startup process so that kubelet does not get blocked on it and cloudResourceSyncManager is always started.
Clone Of: 1601378
Environment:
Last Closed: 2018-08-09 22:10:03 UTC
Target Upstream Version:
Embargoed:


Attachments (Terms of Use)


Links
System ID Private Priority Status Summary Last Updated
Red Hat Bugzilla 1583129 0 urgent CLOSED [3.6] Error creating subnet for node - just error message in logs - OCP on Azure 2021-02-22 00:41:40 UTC
Red Hat Product Errata RHBA-2018:2339 0 None None None 2018-08-09 22:10:33 UTC

Internal Links: 1583129

Comment 1 Weihua Meng 2018-07-17 09:57:46 UTC
FYI.
https://bugzilla.redhat.com/show_bug.cgi?id=1583129#c62

Comment 2 Avesh Agarwal 2018-07-19 16:09:34 UTC
https://github.com/openshift/ose/pull/1360 is merged so moving it to modified.

Comment 6 Weihua Meng 2018-08-01 09:07:24 UTC
Fixed.

atomic-openshift-3.6.173.0.128-1.git.0.8da0828.el7.x86_64

openshift v3.6.173.0.128

Kernel Version: 3.10.0-862.9.1.el7.x86_64
Operating System: Red Hat Enterprise Linux Server 7.5 (Maipo)

Comment 8 errata-xmlrpc 2018-08-09 22:10:03 UTC
Since the problem described in this bug report should be
resolved in a recent advisory, it has been closed with a
resolution of ERRATA.

For information on the advisory, and where to find the updated
files, follow the link below.

If the solution does not work for you, open a new bug report.

https://access.redhat.com/errata/RHBA-2018:2339


Note You need to log in before you can comment on or make changes to this bug.