Bug 1842733
| Summary: | SR-IOV not working for HPE intel and mellanox NICs | ||||||
|---|---|---|---|---|---|---|---|
| Product: | OpenShift Container Platform | Reporter: | vkhanna | ||||
| Component: | Networking | Assignee: | Peng Liu <pliu> | ||||
| Networking sub component: | SR-IOV | QA Contact: | zhaozhanqi <zzhao> | ||||
| Status: | CLOSED DUPLICATE | Docs Contact: | |||||
| Severity: | unspecified | ||||||
| Priority: | unspecified | CC: | bbennett, pliu, vkhanna | ||||
| Version: | 4.4 | ||||||
| Target Milestone: | --- | ||||||
| Target Release: | --- | ||||||
| Hardware: | Unspecified | ||||||
| OS: | Unspecified | ||||||
| Whiteboard: | |||||||
| Fixed In Version: | Doc Type: | If docs needed, set a value | |||||
| Doc Text: | Story Points: | --- | |||||
| Clone Of: | Environment: | ||||||
| Last Closed: | 2020-06-04 15:19:37 UTC | Type: | Bug | ||||
| Regression: | --- | Mount Type: | --- | ||||
| Documentation: | --- | CRM: | |||||
| Verified Versions: | Category: | --- | |||||
| oVirt Team: | --- | RHEL 7.3 requirements from Atomic Host: | |||||
| Cloudforms Team: | --- | Target Upstream Version: | |||||
| Embargoed: | |||||||
| Attachments: |
|
||||||
|
Description
vkhanna
2020-06-02 01:21:31 UTC
@Varun Please collect help to collect the logs with script https://github.com/openshift/sriov-network-operator/blob/master/must-gather/collection-scripts/gather. Created attachment 1694615 [details]
must-gather logs
Hi Peng, Please find attached the must-gather output. I noticed that the node did not reboot while I was monitoring the console. Also, the pods get stuck into terminating state. $ oc get pods -n openshift-sriov-network-operator -o wide | grep worker-0 sriov-cni-w6t85 1/1 Terminating 0 10m 10.128.2.5 worker-0.clus0.t5g.lab.eng.rdu2.redhat.com <none> <none> sriov-device-plugin-4d4c7 1/1 Terminating 0 66s 10.1.24.4 worker-0.clus0.t5g.lab.eng.rdu2.redhat.com <none> <none> sriov-network-config-daemon-jbdmx 0/1 Terminating 0 16s 10.1.24.4 worker-0.clus0.t5g.lab.eng.rdu2.redhat.com <none> <none> Hi Varun, From the logs you provided, It looks like you might use a wrong channel when installing the operator. Could you provides the following output? 1. oc get csv -n openshift-sriov-network-operator -o yaml 2. oc get subscription -n openshift-sriov-network-operator -o yaml (The channel shall be "4.4") Maybe you are misled by the 4.4 docs. https://bugzilla.redhat.com/show_bug.cgi?id=1839068 It looks like that I don't have access to the BZ https://bugzilla.redhat.com/show_bug.cgi?id=1839068 As per the instructions in OCP 4.4 doc for sriov operator installation, following command returns a channel value of 4.2 $ oc get packagemanifest sriov-network-operator -n openshift-marketplace -o jsonpath='{.status.channels[].name}' Hardcoding the channel to 4.4 while creating a subscription fixes my issues. Thanks for your time. Setting the target to the current development branch. We can consider backporting a fix once the root cause has been identified. *** This bug has been marked as a duplicate of bug 1839068 *** |