Description of problem: after OSP 17.1 FFU, scaling out the the new nodes fails at ironic with: ERROR ironic.conductor.utils [-] Agent returned error for deploy step {'step': 'write_image', 'priority': 80, 'argsinfo': None, 'interface': 'deploy'} on node xxxxxxxx-xxxx-xxxx-xxxx-xxxxxxxxxxxx : Error performing deploy_step write_image: Command execution failed: Unable to stat device /dev/dm-0p2 after attempting to verify 3 times.. Version-Release number of selected component (if applicable): OSP 17.1.2 HW: Cisco UCS-B200-M5 blades more details will follow as private comment
This is a bit different to bz #2252076 but the would have had an easier path if the proposed change had been applied.
guestmount -a /tmp/overcloud-hardened-uefi-full.qcow2 -m /dev/vg/lv_root /mnt cat<<EOF>/mnt/etc/multipath.conf defaults { user_friendly_names yes find_multipaths yes max_fds 8192 polling_interval 5 queue_without_daemon no } EOF sed -i 's/no_timer_check/rd.multipath=default no_timer_check/' /mnt/etc/default/grub umount /mnt guestmount -a /tmp/overcloud-hardened-uefi-full.qcow2 -m /dev/sda3 /mnt sed -i 's/mkfs_boot/mkfs_boot rd.multipath=default/g' /mnt/loader/entries/* umount /mnt qemu-img convert -O raw /tmp/overcloud-hardened-uefi-full.qcow2 /tmp/overcloud-hardened-uefi-full.raw cp /tmp/overcloud-hardened-uefi-full.* /home/stack/images find /var/lib/ironic -name \*.converted -exec rm -rf {} \; source /home/stack/stackrc openstack overcloud image upload --update-existing --image-path /home/stack/images/
Since the problem described in this bug report should be resolved in a recent advisory, it has been closed with a resolution of ERRATA. For information on the advisory (Moderate: RHOSP 17.1.4 (openstack-tripleo-common and python-tripleoclient) security update), and where to find the updated files, follow the link below. If the solution does not work for you, open a new bug report. https://access.redhat.com/errata/RHSA-2024:9990