Bug 2061278
Summary: | [IPI] OCP-4.10 baremetal - boot partition is not mounted on temporary directory | ||
---|---|---|---|
Product: | OpenShift Container Platform | Reporter: | Derek Higgins <derekh> |
Component: | Bare Metal Hardware Provisioning | Assignee: | Derek Higgins <derekh> |
Bare Metal Hardware Provisioning sub component: | ironic | QA Contact: | Lubov <lshilin> |
Status: | CLOSED DUPLICATE | Docs Contact: | |
Severity: | high | ||
Priority: | high | CC: | andbartl, asalvati, augol, bfournie, bmuchiny, eglottma, gvillani, josearod, lshilin, manrodri, mcornea, openshift-bugs-escalate, skrenger, snetting, tonyg |
Version: | 4.10 | Keywords: | AutomationBlocker, Regression, Triaged |
Target Milestone: | --- | ||
Target Release: | 4.10.z | ||
Hardware: | Unspecified | ||
OS: | Unspecified | ||
Whiteboard: | |||
Fixed In Version: | Doc Type: | If docs needed, set a value | |
Doc Text: | Story Points: | --- | |
Clone Of: | 2053752 | Environment: | |
Last Closed: | 2022-05-25 08:13:20 UTC | Type: | --- |
Regression: | --- | Mount Type: | --- |
Documentation: | --- | CRM: | |
Verified Versions: | Category: | --- | |
oVirt Team: | --- | RHEL 7.3 requirements from Atomic Host: | |
Cloudforms Team: | --- | Target Upstream Version: | |
Embargoed: | |||
Bug Depends On: | 2053752 | ||
Bug Blocks: |
Description
Derek Higgins
2022-03-07 09:13:57 UTC
verification failed on 4.10.0-0.nightly-2022-04-30-165345 msg="Error: cannot go from state 'deploy failed' to state 'manageable' , last error was 'Deploy step deploy.install_coreos failed on node 0e28efd2-6885-4c01-af13-aa3bee3d6031. Could not verify uefi on device /dev/sda, failed with Unexpected error while running command." time="2022-05-01T14:44:22+03:00" level=error msg="Command: mount /dev/sda2 /tmp/tmph8ib8cp0/boot/efi" here is must-gather https://s3.upshift.redhat.com/DH-PROD-OCP-EDGE-QE-CI/Infra/must-gather/1042/index.html bootstrap log bundle: http://rhos-compute-node-10.lab.eng.rdu2.redhat.com/logs/BZ2061278_log-bundle-bootstrap.tar.gz Looking at the mustgather I can see that the patch was used and failed to fix the problem (although It did seem to work on 4.11, maybe due to different hardware), so we need to do more. I had another PR pushed upstream here https://github.com/metal3-io/ironic-agent-image/pull/19 This adds more retries with some sleep time between them So we need to merge this, sync it downstream, test on the same hardware and assuming it fixes the problem then backport it to 4.10 (In reply to Derek Higgins from comment #11) > Looking at the mustgather I can see that the patch was used and failed to > fix the problem > (although It did seem to work on 4.11, maybe due to different hardware), > so we need to do more. > > I had another PR pushed upstream here > https://github.com/metal3-io/ironic-agent-image/pull/19 > This adds more retries with some sleep time between them > > So we need to merge this, sync it downstream, test on the same hardware and > assuming it fixes the problem then backport it to 4.10 we had only one setup, where this problem happens. I tested fix for 4.11 on the same setup. Hope, that this failure is not happens from time to time now after the first fix Both bz 2053752 and 2061278 are refering to the same bug in 4.10 closing this one *** This bug has been marked as a duplicate of bug 2053752 *** |