Bug 2040540 - ovs hardware offload: ovsargs format error when adding vf netdev name
Summary: ovs hardware offload: ovsargs format error when adding vf netdev name
Keywords:
Status: CLOSED ERRATA
Alias: None
Product: OpenShift Container Platform
Classification: Red Hat
Component: Networking
Version: 4.10
Hardware: Unspecified
OS: Unspecified
high
high
Target Milestone: ---
: 4.10.0
Assignee: zenghui.shi
QA Contact: Ying Wang
URL:
Whiteboard:
Depends On:
Blocks:
TreeView+ depends on / blocked
 
Reported: 2022-01-14 01:36 UTC by zenghui.shi
Modified: 2022-03-10 16:39 UTC (History)
2 users (show)

Fixed In Version:
Doc Type: If docs needed, set a value
Doc Text:
Clone Of:
Environment:
Last Closed: 2022-03-10 16:39:13 UTC
Target Upstream Version:
Embargoed:


Attachments (Terms of Use)


Links
System ID Private Priority Status Summary Last Updated
Github openshift ovn-kubernetes pull 904 0 None open Bug 2040540: Fix String formatting error 2022-01-14 01:41:20 UTC
Red Hat Product Errata RHSA-2022:0056 0 None None None 2022-03-10 16:39:33 UTC

Description zenghui.shi 2022-01-14 01:36:12 UTC
Description of problem:


When ovs hardware offload is enabled, creating sriov pod results in the following ovsargs error:

  Warning  FailedCreatePodSandBox  4m44s  kubelet            Failed to create pod sandbox: rpc error: code = Unknown desc = failed to create pod network sandbox k8s_ft-client-pod-sriov-86vld_default_1051f065-e947-411d-961f-5bada3ae965b_0(c56bc6fa336c74022207d27332822d2cd840e7700e4543402b467262f86e8ab3): error adding pod default_ft-client-pod-sriov-86vld to CNI network "multus-cni-network": plugin type="multus" name="multus-cni-network" failed (add): [default/ft-client-pod-sriov-86vld/1051f065-e947-411d-961f-5bada3ae965b:ftnetattach]: error adding container to network "ftnetattach": CNI request failed with status 400: '[default/ft-client-pod-sriov-86vld c56bc6fa336c74022207d27332822d2cd840e7700e4543402b467262f86e8ab3] [default/ft-client-pod-sriov-86vld c56bc6fa336c74022207d27332822d2cd840e7700e4543402b467262f86e8ab3] failed to configure pod interface: failure in plugging pod interface: failed to run 'ovs-vsctl --timeout=30 add-port br-int c56bc6fa336c740 other_config:transient=true -- set interface c56bc6fa336c740 external_ids:attached_mac=0a:58:0a:81:02:55 external_ids:iface-id=default_ft-client-pod-sriov-86vld external_ids:iface-id-ver=1051f065-e947-411d-961f-5bada3ae965b external_ids:ip_addresses=10.129.2.85/23 external_ids:sandbox=c56bc6fa336c74022207d27332822d2cd840e7700e4543402b467262f86e8ab3 external_ids:vf-netdev-name=%s ens8f0v1': exit status 1
  "ovs-vsctl: Interface does not contain a column whose name matches \"ens8f0v1\"\n"
  ""
'

Version-Release number of selected component (if applicable):
4.10.0-0.nightly-2022-01-11-065245

ovn-kubernetes: 822ae6d6

How reproducible:
100%

Comment 4 Ying Wang 2022-01-26 12:03:52 UTC
verified on build 4.10.0-0.nightly-2022-01-25-023600. 
Enable offload on cluster and create pods with VF, VF can run well.


# oc describe pod test-pod
Name:         test-pod
Namespace:    offload-testing
Priority:     0
Node:         worker001-r640/192.168.216.14
Start Time:   Wed, 26 Jan 2022 11:56:06 +0000
Labels:       <none>
Annotations:  k8s.ovn.org/pod-networks:
                {"default":{"ip_addresses":["10.131.0.47/23"],"mac_address":"0a:58:0a:83:00:2f","gateway_ips":["10.131.0.1"],"ip_address":"10.131.0.47/23"...
              k8s.v1.cni.cncf.io/network-status:
                [{
                    "name": "offload-testing/default",
                    "interface": "eth0",
                    "ips": [
                        "10.131.0.47"
                    ],
                    "mac": "0a:58:0a:83:00:2f",
                    "default": true,
                    "dns": {}
                }]
              k8s.v1.cni.cncf.io/networks-status:
                [{
                    "name": "offload-testing/default",
                    "interface": "eth0",
                    "ips": [
                        "10.131.0.47"
                    ],
                    "mac": "0a:58:0a:83:00:2f",
                    "default": true,
                    "dns": {}
                }]
              openshift.io/scc: privileged
              v1.multus-cni.io/default-network: offload-testing/default
Status:       Running
IP:           10.131.0.47
IPs:
  IP:  10.131.0.47
Containers:
  appcntr1:
    Container ID:   cri-o://22df6adef394f156866e4f06048d88cab7dda1aa8cf686f45900b1d7820f4b31
    Image:          quay.io/zshi/centos:httpd-iperf
    Image ID:       quay.io/zshi/centos@sha256:e9dc69c848aaef8cb8ac266eba90739a6b17f920698394d2e9a24b1a40ea0af0
    Port:           <none>
    Host Port:      <none>
    State:          Running
      Started:      Wed, 26 Jan 2022 11:56:14 +0000
    Ready:          True
    Restart Count:  0
    Limits:
      openshift.io/mlxnics:  1
    Requests:
      openshift.io/mlxnics:  1
    Environment:             <none>
    Mounts:
      /etc/podnetinfo from podnetinfo (ro)
      /var/run/secrets/kubernetes.io/serviceaccount from kube-api-access-l958z (ro)
Conditions:
  Type              Status
  Initialized       True 
  Ready             True 
  ContainersReady   True 
  PodScheduled      True 
Volumes:
  kube-api-access-l958z:
    Type:                    Projected (a volume that contains injected data from multiple sources)
    TokenExpirationSeconds:  3607
    ConfigMapName:           kube-root-ca.crt
    ConfigMapOptional:       <nil>
    DownwardAPI:             true
    ConfigMapName:           openshift-service-ca.crt
    ConfigMapOptional:       <nil>
  podnetinfo:
    Type:  DownwardAPI (a volume populated by information about the pod)
    Items:
      metadata.annotations -> annotations
QoS Class:       BestEffort
Node-Selectors:  kubernetes.io/hostname=worker001-r640
Tolerations:     node.kubernetes.io/not-ready:NoExecute op=Exists for 300s
                 node.kubernetes.io/unreachable:NoExecute op=Exists for 300s
Events:
  Type    Reason          Age    From               Message
  ----    ------          ----   ----               -------
  Normal  Scheduled       5m33s  default-scheduler  Successfully assigned offload-testing/test-pod to worker001-r640
  Normal  AddedInterface  5m32s  multus             Add eth0 [10.131.0.47/23] from offload-testing/default
  Normal  Pulling         5m32s  kubelet            Pulling image "quay.io/zshi/centos:httpd-iperf"
  Normal  Pulled          5m26s  kubelet            Successfully pulled image "quay.io/zshi/centos:httpd-iperf" in 6.391881167s
  Normal  Created         5m26s  kubelet            Created container appcntr1
  Normal  Started         5m25s  kubelet            Started container appcntr1

Comment 7 errata-xmlrpc 2022-03-10 16:39:13 UTC
Since the problem described in this bug report should be
resolved in a recent advisory, it has been closed with a
resolution of ERRATA.

For information on the advisory (Moderate: OpenShift Container Platform 4.10.3 security update), and where to find the updated
files, follow the link below.

If the solution does not work for you, open a new bug report.

https://access.redhat.com/errata/RHSA-2022:0056


Note You need to log in before you can comment on or make changes to this bug.