Bug 2279328 - Error performing deploy_step write_image: Command execution failed: Unable to stat device /dev/dm-0p2 after attempting to verify 3 times
Summary: Error performing deploy_step write_image: Command execution failed: Unable to...
Keywords:
Status: CLOSED ERRATA
Alias: None
Product: Red Hat OpenStack
Classification: Red Hat
Component: openstack-tripleo-common
Version: 17.1 (Wallaby)
Hardware: Unspecified
OS: Unspecified
medium
medium
Target Milestone: z4
: 17.1
Assignee: Nobody
QA Contact: David Rosenfeld
URL:
Whiteboard:
Depends On:
Blocks:
TreeView+ depends on / blocked
 
Reported: 2024-05-06 14:41 UTC by alisci
Modified: 2024-11-21 09:32 UTC (History)
9 users (show)

Fixed In Version: openstack-tripleo-common-15.4.1-17.1.20240911100820.e5b18f2.el9ost
Doc Type: If docs needed, set a value
Doc Text:
Clone Of:
Environment:
Last Closed: 2024-11-21 09:32:19 UTC
Target Upstream Version:
Embargoed:
alisci: needinfo-
jparoly: needinfo-


Attachments (Terms of Use)


Links
System ID Private Priority Status Summary Last Updated
Red Hat Issue Tracker OSP-32019 0 None None None 2024-05-06 14:48:44 UTC
Red Hat Knowledge Base (Solution) 7090774 0 None None None 2024-10-09 00:08:07 UTC
Red Hat Product Errata RHSA-2024:9990 0 None None None 2024-11-21 09:32:21 UTC

Description alisci 2024-05-06 14:41:43 UTC
Description of problem:
after OSP 17.1 FFU, scaling out the the new nodes fails at ironic with: 

ERROR ironic.conductor.utils [-] Agent returned error for deploy step {'step': 'write_image', 'priority': 80, 'argsinfo': None, 'interface': 'deploy'} on node xxxxxxxx-xxxx-xxxx-xxxx-xxxxxxxxxxxx : Error performing deploy_step write_image: Command execution failed: Unable to stat device /dev/dm-0p2 after attempting to verify 3 times..

Version-Release number of selected component (if applicable):
OSP 17.1.2

HW: Cisco UCS-B200-M5 blades

more details will follow as private comment

Comment 36 Steve Baker 2024-06-24 19:51:20 UTC
This is a bit different to bz #2252076 but the would have had an easier path if the proposed change had been applied.

Comment 43 David Hill 2024-10-08 15:38:08 UTC
guestmount -a /tmp/overcloud-hardened-uefi-full.qcow2 -m /dev/vg/lv_root /mnt
cat<<EOF>/mnt/etc/multipath.conf
defaults {
 user_friendly_names  yes
  find_multipaths   yes
   max_fds     8192
    polling_interval  5
     queue_without_daemon no
}
EOF
sed -i 's/no_timer_check/rd.multipath=default no_timer_check/' /mnt/etc/default/grub
umount /mnt

guestmount -a /tmp/overcloud-hardened-uefi-full.qcow2 -m /dev/sda3 /mnt
sed -i 's/mkfs_boot/mkfs_boot rd.multipath=default/g' /mnt/loader/entries/*
umount /mnt


qemu-img convert -O raw /tmp/overcloud-hardened-uefi-full.qcow2 /tmp/overcloud-hardened-uefi-full.raw
cp /tmp/overcloud-hardened-uefi-full.* /home/stack/images

find /var/lib/ironic -name \*.converted -exec rm -rf {} \;

source /home/stack/stackrc
openstack overcloud image upload --update-existing --image-path /home/stack/images/

Comment 57 errata-xmlrpc 2024-11-21 09:32:19 UTC
Since the problem described in this bug report should be
resolved in a recent advisory, it has been closed with a
resolution of ERRATA.

For information on the advisory (Moderate: RHOSP 17.1.4 (openstack-tripleo-common and python-tripleoclient) security update), and where to find the updated
files, follow the link below.

If the solution does not work for you, open a new bug report.

https://access.redhat.com/errata/RHSA-2024:9990


Note You need to log in before you can comment on or make changes to this bug.