Bug 1822202 - vfio-pci driver cannot be loaded to VF if the config daemon is interrupt during draining node
Summary: vfio-pci driver cannot be loaded to VF if the config daemon is interrupt duri...
Keywords:
Status: CLOSED ERRATA
Alias: None
Product: OpenShift Container Platform
Classification: Red Hat
Component: Networking
Version: 4.5
Hardware: Unspecified
OS: Unspecified
unspecified
unspecified
Target Milestone: ---
: 4.5.0
Assignee: Peng Liu
QA Contact: zhaozhanqi
URL:
Whiteboard:
Depends On:
Blocks: 1822543
TreeView+ depends on / blocked
 
Reported: 2020-04-08 13:36 UTC by Peng Liu
Modified: 2020-07-13 17:26 UTC (History)
0 users

Fixed In Version:
Doc Type: If docs needed, set a value
Doc Text:
Clone Of:
: 1822543 1822546 (view as bug list)
Environment:
Last Closed: 2020-07-13 17:26:16 UTC
Target Upstream Version:
Embargoed:


Attachments (Terms of Use)


Links
System ID Private Priority Status Summary Last Updated
Github openshift sriov-network-operator pull 179 0 None closed Bug 1822202: Check the /proc/cmdline to see if a reboot is needed for enabling iommu 2020-06-23 09:54:03 UTC
Red Hat Product Errata RHBA-2020:2409 0 None None None 2020-07-13 17:26:47 UTC

Description Peng Liu 2020-04-08 13:36:41 UTC
Description of problem:
vfio-pci driver cannot be loaded to VF if the config daemon is interrupt during draining node

Version-Release number of selected component (if applicable):
4.5

How reproducible:
A freshly installed worker node, which hasn't been configured with any vfio-pci driver.


Steps to Reproduce:
1. Apply following SriovNetworkNodePolicy CR with driverType vfio-pci.
---
apiVersion: sriovnetwork.openshift.io/v1
kind: SriovNetworkNodePolicy
metadata:
  name: policy-mlx-1
spec:
  resourceName: nicmlx1
  nodeSelector:
    feature.node.kubernetes.io/network-sriov.capable: "true"
  priority: 99
  # isRdma: true
  numVfs: 4
  nicSelector:
    vendor: "15b3"
    pfNames: ['ens785f0#0-2']
    rootDevices: ['0000:18:00.0']
  deviceType: "vfio-pci"
2. Kill the sriov-network-config-daemon pod when it starts to drain the node. 
3. 

Actual results:
The node reboot is not triggered. sriov-network-config-daemon pod keeps restarting, and the vfio-pci driver cannot be bind to the VFs.

Expected results:
The node shall reboot, and the vfio-pci driver shall be bind to the VFs successfully.

Additional info:

Comment 4 errata-xmlrpc 2020-07-13 17:26:16 UTC
Since the problem described in this bug report should be
resolved in a recent advisory, it has been closed with a
resolution of ERRATA.

For information on the advisory, and where to find the updated
files, follow the link below.

If the solution does not work for you, open a new bug report.

https://access.redhat.com/errata/RHBA-2020:2409


Note You need to log in before you can comment on or make changes to this bug.