Bug 2054896
| Summary: | worker times out during inspection under 4.10 | ||||||
|---|---|---|---|---|---|---|---|
| Product: | OpenShift Container Platform | Reporter: | tonyg | ||||
| Component: | Bare Metal Hardware Provisioning | Assignee: | Tomas Sedovic <tsedovic> | ||||
| Bare Metal Hardware Provisioning sub component: | ironic | QA Contact: | Amit Ugol <augol> | ||||
| Status: | CLOSED NOTABUG | Docs Contact: | |||||
| Severity: | unspecified | ||||||
| Priority: | unspecified | CC: | bfournie, josearod, manrodri, nsilla, tkrishto, yliu1 | ||||
| Version: | 4.10 | ||||||
| Target Milestone: | --- | ||||||
| Target Release: | --- | ||||||
| Hardware: | x86_64 | ||||||
| OS: | Unspecified | ||||||
| Whiteboard: | |||||||
| Fixed In Version: | Doc Type: | If docs needed, set a value | |||||
| Doc Text: | Story Points: | --- | |||||
| Clone Of: | Environment: | ||||||
| Last Closed: | 2022-03-01 17:09:50 UTC | Type: | Bug | ||||
| Regression: | --- | Mount Type: | --- | ||||
| Documentation: | --- | CRM: | |||||
| Verified Versions: | Category: | --- | |||||
| oVirt Team: | --- | RHEL 7.3 requirements from Atomic Host: | |||||
| Cloudforms Team: | --- | Target Upstream Version: | |||||
| Embargoed: | |||||||
| Attachments: |
|
||||||
|
Description
tonyg
2022-02-15 22:34:40 UTC
Are you able to attach to the console when the error occurs? Is it possible to get a screen shot so we can see what state the host is in? Thanks. Hi, If it helps, we found a mismatch in the MAC address for the provisioning interface configured in the inventory and the actual value. This resulted in some error messages in the console during the bootstrap phase: ``` Attempting to boot from MAC f4-03-43-d0-72-c0 pxelinux.cfg/f4-03-43-d0-72-c0... No such file or directory (http://ipxe.org/2d0c618e) pxelinux.cfg/f4-03-43-d0-72-c0... No such file or directory (http://ipxe.org/2d0c618e) ``` We're currently verifying if setting the right address has any effect on the result. In any case this hasn't been a problem with versions prior to 4.10 and even with older 4.10 RCs. Is f4-03-43-d0-72-c0 the correct mac? Is it the same physical worker that fails all of the time? From the screen shot it looks like it booted off of the disk which would happen if the IPA doesn't boot in the case of an invalid mac, for example. Please let us know the results with this change, its curious that this just starting happening with 4.10 even with the same config. Yes, f4-03-43-d0-72-c0 is the correct MAC. And for now it has been only this node, in previous deployments with version < 4.10 has been working fine even with the wrong MAC. We have not seen this issue anymore once we set the correct MAC, I guess prior versions it was not considered as it is 4.10+. Thanks for checking, I think this can be closed now. Thanks Tony. Its still unexplained as to why this worked < 4.10 as we would have expected a config issue to cause a problem. We'll close this out. Feel free to open a lower priority bug for the 4.9 issue. |