Bug 1892573 - RHVH 4.4.2 fails to boot from SAN when using UUID for /boot partition
Summary: RHVH 4.4.2 fails to boot from SAN when using UUID for /boot partition
Keywords:
Status: CLOSED ERRATA
Alias: None
Product: Red Hat Enterprise Virtualization Manager
Classification: Red Hat
Component: redhat-virtualization-host
Version: 4.4.2
Hardware: x86_64
OS: Linux
unspecified
high
Target Milestone: ovirt-4.4.5
: 4.4.5
Assignee: Sandro Bonazzola
QA Contact: cshao
URL:
Whiteboard:
Depends On:
Blocks:
TreeView+ depends on / blocked
 
Reported: 2020-10-29 08:37 UTC by Juan Orti
Modified: 2024-06-13 23:18 UTC (History)
11 users (show)

Fixed In Version:
Doc Type: If docs needed, set a value
Doc Text:
Clone Of:
Environment:
Last Closed: 2021-04-14 11:44:48 UTC
oVirt Team: Node
Target Upstream Version:
Embargoed:


Attachments (Terms of Use)


Links
System ID Private Priority Status Summary Last Updated
Red Hat Knowledge Base (Solution) 5527191 0 None None None 2020-10-29 08:37:41 UTC
Red Hat Product Errata RHSA-2021:1189 0 None None None 2021-04-14 11:48:55 UTC

Description Juan Orti 2020-10-29 08:37:41 UTC
Description of problem:
A fresh installation of RHVH 4.4.2 on an external FC disk fails to boot in the first reboot after Anaconda.

After switching to the real root, the /boot partition fails to mount and the boot process is then stuck forever after failing to start the kdump service.

We made it to boot by using the multipath device instead of the UUID in /etc/fstab for the /boot and /boot/efi filesystems.

If we interrupt the boot process with 'rd.break=pre-pivot', we correctly see the multipath device with 4 healthy paths. Running blkid displays the UUID of the /boot filesystem duplicated for every path.


Version-Release number of selected component (if applicable):
RHVH 4.4.2 (RHVH-4.4-20200722.1-RHVH-x86_64-dvd1.iso)

How reproducible:
It has to be a race condition, because a 10% of the boots using the UUID succeed. The customer has to install dozens of new servers and it happens in all of them.

Steps to Reproduce:
1. Install RHVH 4.4.2 on a FC disk. In Anaconda select the multipath device.
2. Reboot

Actual results:

~~~
[ TIME ] Timed out waiting for device dev-mapper-rhvh\x2dswap.device.
[DEPEND] Dependency failed for Resume from hibernation using device /dev/mapper/rhvh-swap.

[FAILED] Failed to mount /boot.
See 'systemctl status boot.mount' for details.
[DEPEND] Dependency failed for /boot/efi.
[DEPEND] Dependency failed for Local File Systems.

[FAILED] Failed to start Crash recovery kernel arming.
~~~

Expected results:
A system booting without problems.


Additional info:
- We tried to add the options 'x-systemd.device-timeout=0,x-systemd.mount-timeout=0' to /etc/fstab but the mount process hung forever.
- Storage array is NetApp in C-Mode
- HBA using driver qla2xxx: Fibre Channel [0c04]: QLogic Corp. ISP2722-based 16/32Gb Fibre Channel to PCIe Adapter [1077:2261] (rev 01)

Comment 4 cshao 2020-11-02 03:15:42 UTC
RHVH QE can't reproduce this issue both on automation and manual testing, could you please provide more detail steps? 

Test version:
RHVH-4.4-20200930.0-RHVH-x86_64-dvd1.iso

Test steps:
1. Install RHVH-4.4-20200930.0-RHVH-x86_64-dvd1.iso on FC machine via anaconda.
2. Select the FC disk as boot device.
3. Finish the installation.
4. Reboot and login RHVH.

Test result:
RHVH can boot from FC lun.

Comment 19 Sandro Bonazzola 2021-03-02 08:41:33 UTC
The issue is not reproducible anymore in 4.4.5 latest builds consuming new kernel.

Comment 32 errata-xmlrpc 2021-04-14 11:44:48 UTC
Since the problem described in this bug report should be
resolved in a recent advisory, it has been closed with a
resolution of ERRATA.

For information on the advisory (Important: Red Hat Virtualization security, bug fix, and enhancement update), and where to find the updated
files, follow the link below.

If the solution does not work for you, open a new bug report.

https://access.redhat.com/errata/RHSA-2021:1189

Comment 33 errata-xmlrpc 2021-04-14 11:48:30 UTC
Since the problem described in this bug report should be
resolved in a recent advisory, it has been closed with a
resolution of ERRATA.

For information on the advisory (Important: Red Hat Virtualization security, bug fix, and enhancement update), and where to find the updated
files, follow the link below.

If the solution does not work for you, open a new bug report.

https://access.redhat.com/errata/RHSA-2021:1189

Comment 34 Red Hat Bugzilla 2023-09-15 00:50:19 UTC
The needinfo request[s] on this closed bug have been removed as they have been unresolved for 500 days


Note You need to log in before you can comment on or make changes to this bug.