Description of problem: When ovs hardware offload is enabled, creating sriov pod results in the following ovsargs error: Warning FailedCreatePodSandBox 4m44s kubelet Failed to create pod sandbox: rpc error: code = Unknown desc = failed to create pod network sandbox k8s_ft-client-pod-sriov-86vld_default_1051f065-e947-411d-961f-5bada3ae965b_0(c56bc6fa336c74022207d27332822d2cd840e7700e4543402b467262f86e8ab3): error adding pod default_ft-client-pod-sriov-86vld to CNI network "multus-cni-network": plugin type="multus" name="multus-cni-network" failed (add): [default/ft-client-pod-sriov-86vld/1051f065-e947-411d-961f-5bada3ae965b:ftnetattach]: error adding container to network "ftnetattach": CNI request failed with status 400: '[default/ft-client-pod-sriov-86vld c56bc6fa336c74022207d27332822d2cd840e7700e4543402b467262f86e8ab3] [default/ft-client-pod-sriov-86vld c56bc6fa336c74022207d27332822d2cd840e7700e4543402b467262f86e8ab3] failed to configure pod interface: failure in plugging pod interface: failed to run 'ovs-vsctl --timeout=30 add-port br-int c56bc6fa336c740 other_config:transient=true -- set interface c56bc6fa336c740 external_ids:attached_mac=0a:58:0a:81:02:55 external_ids:iface-id=default_ft-client-pod-sriov-86vld external_ids:iface-id-ver=1051f065-e947-411d-961f-5bada3ae965b external_ids:ip_addresses=10.129.2.85/23 external_ids:sandbox=c56bc6fa336c74022207d27332822d2cd840e7700e4543402b467262f86e8ab3 external_ids:vf-netdev-name=%s ens8f0v1': exit status 1 "ovs-vsctl: Interface does not contain a column whose name matches \"ens8f0v1\"\n" "" ' Version-Release number of selected component (if applicable): 4.10.0-0.nightly-2022-01-11-065245 ovn-kubernetes: 822ae6d6 How reproducible: 100%
verified on build 4.10.0-0.nightly-2022-01-25-023600. Enable offload on cluster and create pods with VF, VF can run well. # oc describe pod test-pod Name: test-pod Namespace: offload-testing Priority: 0 Node: worker001-r640/192.168.216.14 Start Time: Wed, 26 Jan 2022 11:56:06 +0000 Labels: <none> Annotations: k8s.ovn.org/pod-networks: {"default":{"ip_addresses":["10.131.0.47/23"],"mac_address":"0a:58:0a:83:00:2f","gateway_ips":["10.131.0.1"],"ip_address":"10.131.0.47/23"... k8s.v1.cni.cncf.io/network-status: [{ "name": "offload-testing/default", "interface": "eth0", "ips": [ "10.131.0.47" ], "mac": "0a:58:0a:83:00:2f", "default": true, "dns": {} }] k8s.v1.cni.cncf.io/networks-status: [{ "name": "offload-testing/default", "interface": "eth0", "ips": [ "10.131.0.47" ], "mac": "0a:58:0a:83:00:2f", "default": true, "dns": {} }] openshift.io/scc: privileged v1.multus-cni.io/default-network: offload-testing/default Status: Running IP: 10.131.0.47 IPs: IP: 10.131.0.47 Containers: appcntr1: Container ID: cri-o://22df6adef394f156866e4f06048d88cab7dda1aa8cf686f45900b1d7820f4b31 Image: quay.io/zshi/centos:httpd-iperf Image ID: quay.io/zshi/centos@sha256:e9dc69c848aaef8cb8ac266eba90739a6b17f920698394d2e9a24b1a40ea0af0 Port: <none> Host Port: <none> State: Running Started: Wed, 26 Jan 2022 11:56:14 +0000 Ready: True Restart Count: 0 Limits: openshift.io/mlxnics: 1 Requests: openshift.io/mlxnics: 1 Environment: <none> Mounts: /etc/podnetinfo from podnetinfo (ro) /var/run/secrets/kubernetes.io/serviceaccount from kube-api-access-l958z (ro) Conditions: Type Status Initialized True Ready True ContainersReady True PodScheduled True Volumes: kube-api-access-l958z: Type: Projected (a volume that contains injected data from multiple sources) TokenExpirationSeconds: 3607 ConfigMapName: kube-root-ca.crt ConfigMapOptional: <nil> DownwardAPI: true ConfigMapName: openshift-service-ca.crt ConfigMapOptional: <nil> podnetinfo: Type: DownwardAPI (a volume populated by information about the pod) Items: metadata.annotations -> annotations QoS Class: BestEffort Node-Selectors: kubernetes.io/hostname=worker001-r640 Tolerations: node.kubernetes.io/not-ready:NoExecute op=Exists for 300s node.kubernetes.io/unreachable:NoExecute op=Exists for 300s Events: Type Reason Age From Message ---- ------ ---- ---- ------- Normal Scheduled 5m33s default-scheduler Successfully assigned offload-testing/test-pod to worker001-r640 Normal AddedInterface 5m32s multus Add eth0 [10.131.0.47/23] from offload-testing/default Normal Pulling 5m32s kubelet Pulling image "quay.io/zshi/centos:httpd-iperf" Normal Pulled 5m26s kubelet Successfully pulled image "quay.io/zshi/centos:httpd-iperf" in 6.391881167s Normal Created 5m26s kubelet Created container appcntr1 Normal Started 5m25s kubelet Started container appcntr1
Since the problem described in this bug report should be resolved in a recent advisory, it has been closed with a resolution of ERRATA. For information on the advisory (Moderate: OpenShift Container Platform 4.10.3 security update), and where to find the updated files, follow the link below. If the solution does not work for you, open a new bug report. https://access.redhat.com/errata/RHSA-2022:0056