| Summary: | [RHEL-6.8] Determining IP information for ocrdma_roce.43... failed | ||||||
|---|---|---|---|---|---|---|---|
| Product: | Red Hat Enterprise Linux 6 | Reporter: | zguo <zguo> | ||||
| Component: | libocrdma | Assignee: | Jarod Wilson <jarod> | ||||
| Status: | CLOSED WONTFIX | QA Contact: | Zhang Yi <yizhan> | ||||
| Severity: | medium | Docs Contact: | |||||
| Priority: | unspecified | ||||||
| Version: | 6.8 | CC: | ddutile, honli, jshortt, mschmidt, mstowell | ||||
| Target Milestone: | rc | ||||||
| Target Release: | --- | ||||||
| Hardware: | Unspecified | ||||||
| OS: | Unspecified | ||||||
| Whiteboard: | |||||||
| Fixed In Version: | Doc Type: | If docs needed, set a value | |||||
| Doc Text: | Story Points: | --- | |||||
| Clone Of: | Environment: | ||||||
| Last Closed: | 2017-12-06 12:05:47 UTC | Type: | Bug | ||||
| Regression: | --- | Mount Type: | --- | ||||
| Documentation: | --- | CRM: | |||||
| Verified Versions: | Category: | --- | |||||
| oVirt Team: | --- | RHEL 7.3 requirements from Atomic Host: | |||||
| Cloudforms Team: | --- | Target Upstream Version: | |||||
| Attachments: |
|
||||||
|
Description
zguo
2016-03-01 08:51:48 UTC
Hi developers, Would you like to help take a look? Thanks. you attached an rds-tools-qib.log file. I referenced the beaker log file to see relevant errors -- I see ocrdma_roce fail, but I also see rds stack traces in the log as well. Could you (re-)test with the -616 kernel. 4 patches were added to -617 for ocrdma, and I just want to eliminate those 4 separate patches as the source of the errors. Also, did this test pass in 6.7 ? I had already told all of you about this issue. The DHCP/VLAN/REORDER_HDR combination is the culprit. 1) comment VLAN_ID=43, it will take more than 80 seconds for roce.43 to get dhcp IP. 4 out of 5 succeed to get IP. 2) comment REORDER_HDR=0 The first round of "ifup ocrdma_roce.43" got IP in 5 second. However, the second round hang on and never exit. Ctrl+C killed it after 13 minutes. If you ifdown/ifup the vlan interface in a for loop, dhcp will not work at all with different error messages. So, the temporary workaround is to comment REORDER_HDR=0, and never abuse the vlan interface with ifup/ifdown. (In reply to Honggang LI from comment #3) > I had already told all of you about this issue. > > The DHCP/VLAN/REORDER_HDR combination is the culprit. > > 1) comment VLAN_ID=43, it will take more than 80 seconds for roce.43 > to get dhcp IP. 4 out of 5 succeed to get IP. > > 2) comment REORDER_HDR=0 > The first round of "ifup ocrdma_roce.43" got IP in 5 second. However, > the second round hang on and never exit. Ctrl+C killed it after 13 minutes. > If you ifdown/ifup the vlan interface in a for loop, dhcp will not work > at all with different error messages. > > So, the temporary workaround is to comment REORDER_HDR=0, and never > abuse the vlan interface with ifup/ifdown. I digged out the email Honggang sent before. We need "uncomment REORDER_HDR=0", and "uncomment VLAN_ID=43" in the /etc/sysconfig/network-scripts/ifcfg-ocrdma_roce.XX. I just tried this workaround and it works. Red Hat Enterprise Linux 6 is in the Production 3 Phase. During the Production 3 Phase, Critical impact Security Advisories (RHSAs) and selected Urgent Priority Bug Fix Advisories (RHBAs) may be released as they become available. The official life cycle policy can be reviewed here: http://redhat.com/rhel/lifecycle This issue does not meet the inclusion criteria for the Production 3 Phase and will be marked as CLOSED/WONTFIX. If this remains a critical requirement, please contact Red Hat Customer Support to request a re-evaluation of the issue, citing a clear business justification. Note that a strong business justification will be required for re-evaluation. Red Hat Customer Support can be contacted via the Red Hat Customer Portal at the following URL: https://access.redhat.com/ |