Bug 2279328

Summary: Error performing deploy_step write_image: Command execution failed: Unable to stat device /dev/dm-0p2 after attempting to verify 3 times
Product: Red Hat OpenStack Reporter: alisci <alisci>
Component: openstack-tripleo-commonAssignee: Nobody <nobody>
Status: CLOSED ERRATA QA Contact: David Rosenfeld <drosenfe>
Severity: medium Docs Contact:
Priority: medium    
Version: 17.1 (Wallaby)CC: dhill, hjensas, jkreger, jparoly, mariel, mburns, rhos-maint, sbaker, slinaber
Target Milestone: z4Keywords: Triaged
Target Release: 17.1Flags: alisci: needinfo-
jparoly: needinfo-
Hardware: Unspecified   
OS: Unspecified   
Whiteboard:
Fixed In Version: openstack-tripleo-common-15.4.1-17.1.20240911100820.e5b18f2.el9ost Doc Type: If docs needed, set a value
Doc Text:
Story Points: ---
Clone Of: Environment:
Last Closed: 2024-11-21 09:32:19 UTC Type: Bug
Regression: --- Mount Type: ---
Documentation: --- CRM:
Verified Versions: Category: ---
oVirt Team: --- RHEL 7.3 requirements from Atomic Host:
Cloudforms Team: --- Target Upstream Version:
Embargoed:

Description alisci 2024-05-06 14:41:43 UTC
Description of problem:
after OSP 17.1 FFU, scaling out the the new nodes fails at ironic with: 

ERROR ironic.conductor.utils [-] Agent returned error for deploy step {'step': 'write_image', 'priority': 80, 'argsinfo': None, 'interface': 'deploy'} on node xxxxxxxx-xxxx-xxxx-xxxx-xxxxxxxxxxxx : Error performing deploy_step write_image: Command execution failed: Unable to stat device /dev/dm-0p2 after attempting to verify 3 times..

Version-Release number of selected component (if applicable):
OSP 17.1.2

HW: Cisco UCS-B200-M5 blades

more details will follow as private comment

Comment 36 Steve Baker 2024-06-24 19:51:20 UTC
This is a bit different to bz #2252076 but the would have had an easier path if the proposed change had been applied.

Comment 43 David Hill 2024-10-08 15:38:08 UTC
guestmount -a /tmp/overcloud-hardened-uefi-full.qcow2 -m /dev/vg/lv_root /mnt
cat<<EOF>/mnt/etc/multipath.conf
defaults {
 user_friendly_names  yes
  find_multipaths   yes
   max_fds     8192
    polling_interval  5
     queue_without_daemon no
}
EOF
sed -i 's/no_timer_check/rd.multipath=default no_timer_check/' /mnt/etc/default/grub
umount /mnt

guestmount -a /tmp/overcloud-hardened-uefi-full.qcow2 -m /dev/sda3 /mnt
sed -i 's/mkfs_boot/mkfs_boot rd.multipath=default/g' /mnt/loader/entries/*
umount /mnt


qemu-img convert -O raw /tmp/overcloud-hardened-uefi-full.qcow2 /tmp/overcloud-hardened-uefi-full.raw
cp /tmp/overcloud-hardened-uefi-full.* /home/stack/images

find /var/lib/ironic -name \*.converted -exec rm -rf {} \;

source /home/stack/stackrc
openstack overcloud image upload --update-existing --image-path /home/stack/images/

Comment 57 errata-xmlrpc 2024-11-21 09:32:19 UTC
Since the problem described in this bug report should be
resolved in a recent advisory, it has been closed with a
resolution of ERRATA.

For information on the advisory (Moderate: RHOSP 17.1.4 (openstack-tripleo-common and python-tripleoclient) security update), and where to find the updated
files, follow the link below.

If the solution does not work for you, open a new bug report.

https://access.redhat.com/errata/RHSA-2024:9990