1949677 – multus is the first pod on a new node and the last to go ready

Bug 1949677 - multus is the first pod on a new node and the last to go ready

Summary: multus is the first pod on a new node and the last to go ready

Keywords:
Status:	CLOSED ERRATA
Alias:	None
Product:	OpenShift Container Platform
Classification:	Red Hat
Component:	Networking
Sub Component:
Version:	4.8
Hardware:	Unspecified
OS:	Unspecified
Priority:	high
Severity:	medium
Target Milestone:	---
Target Release:	4.8.0
Assignee:	Douglas Smith
QA Contact:	Weibin Liang
Docs Contact:
URL:
Whiteboard:
Depends On:
Blocks:
TreeView+	depends on / blocked

Reported:	2021-04-14 19:00 UTC by Douglas Smith
Modified:	2021-07-27 23:01 UTC (History)
CC List:	0 users
Fixed In Version:
Doc Type:	Enhancement
Doc Text:	Feature: Splits CNI plugin installation into a separate daemonset, to not block node readiness when pulling a lot of images. Reason: Node readiness takes a long time due to the long image pull times. Result: Node readiness happens faster.
Clone Of:
Environment:
Last Closed:	2021-07-27 23:00:48 UTC
Target Upstream Version:
Embargoed:

Attachments	(Terms of Use)

Links
System	ID	Private	Priority	Status	Summary	Last Updated
Github	openshift cluster-network-operator pull 1058	0	None	open	Bug 1949677: The Multus daemonset should be separate other CNI binary installation	2021-04-14 22:18:33 UTC
Red Hat Product Errata	RHSA-2021:2438	0	None	None	None	2021-07-27 23:01:03 UTC

Description Douglas Smith 2021-04-14 19:00:07 UTC

Description of problem: (From Clayton)  multus is the first pod on a new node and the last to go ready because it pulls 5 images.  it's taking about 1m15s.  that means nodes go ready slowly because of multus. (it's the longest pole).  the time is because init container images are pulled sequentially


In order to address this, the daemonset for Multus should be separate from the ancillary CNI binary installation.

Comment 3 Weibin Liang 2021-06-02 15:35:12 UTC

Tested and verified in 4.8.0-0.nightly-2021-06-02-025513

[weliang@weliang ~]$ oc get all
NAME                                      READY   STATUS    RESTARTS   AGE
pod/multus-additional-cni-plugins-5bln8   1/1     Running   0          128m
pod/multus-additional-cni-plugins-6k2qb   1/1     Running   0          120m
pod/multus-additional-cni-plugins-897zr   1/1     Running   0          128m
pod/multus-additional-cni-plugins-f7bnl   1/1     Running   0          119m
pod/multus-additional-cni-plugins-w92k7   1/1     Running   0          128m
pod/multus-additional-cni-plugins-z4r5p   1/1     Running   0          119m
pod/multus-admission-controller-bvzhl     2/2     Running   0          125m
pod/multus-admission-controller-jtvhb     2/2     Running   0          125m
pod/multus-admission-controller-kdt58     2/2     Running   0          125m
pod/multus-cjf52                          1/1     Running   0          128m
pod/multus-g7bsg                          1/1     Running   0          128m
pod/multus-lkzmf                          1/1     Running   0          119m
pod/multus-mbfhr                          1/1     Running   0          120m
pod/multus-q67t5                          1/1     Running   0          119m
pod/multus-rs7sg                          1/1     Running   0          128m
pod/network-metrics-daemon-2qgss          2/2     Running   0          120m
pod/network-metrics-daemon-cwxfj          2/2     Running   0          128m
pod/network-metrics-daemon-ghs92          2/2     Running   0          128m
pod/network-metrics-daemon-p6h2d          2/2     Running   0          128m
pod/network-metrics-daemon-sj6zz          2/2     Running   0          119m
pod/network-metrics-daemon-wnkvk          2/2     Running   0          119m

NAME                                  TYPE        CLUSTER-IP       EXTERNAL-IP   PORT(S)            AGE
service/multus-admission-controller   ClusterIP   172.30.173.199   <none>        443/TCP,8443/TCP   128m
service/network-metrics-service       ClusterIP   None             <none>        8443/TCP           128m

NAME                                           DESIRED   CURRENT   READY   UP-TO-DATE   AVAILABLE   NODE SELECTOR                     AGE
daemonset.apps/multus                          6         6         6       6            6           kubernetes.io/os=linux            128m
daemonset.apps/multus-additional-cni-plugins   6         6         6       6            6           kubernetes.io/os=linux            128m
daemonset.apps/multus-admission-controller     3         3         3       3            3           node-role.kubernetes.io/master=   128m
daemonset.apps/network-metrics-daemon          6         6         6       6            6           kubernetes.io/os=linux            128m
[weliang@weliang ~]$

Comment 6 errata-xmlrpc 2021-07-27 23:00:48 UTC

Since the problem described in this bug report should be
resolved in a recent advisory, it has been closed with a
resolution of ERRATA.

For information on the advisory (Moderate: OpenShift Container Platform 4.8.2 bug fix and security update), and where to find the updated
files, follow the link below.

If the solution does not work for you, open a new bug report.

https://access.redhat.com/errata/RHSA-2021:2438

Note You need to log in before you can comment on or make changes to this bug.