Note: This bug is displayed in read-only format because the product is no longer active in Red Hat Bugzilla.

Bug 2091467

Summary: Does bare-metal support multi-path configuration of the boot area?
Product: OpenShift Container Platform Reporter: ryoji noma <rnoma>
Component: RHCOSAssignee: RHCOS Bug Triage <rhcos-triage>
Status: CLOSED NOTABUG QA Contact: Michael Nguyen <mnguyen>
Severity: medium Docs Contact:
Priority: unspecified    
Version: 4.7CC: dornelas, jlebon, jligon, jsafrane, mrussell, nstielau
Target Milestone: ---   
Target Release: ---   
Hardware: Unspecified   
OS: Unspecified   
Whiteboard:
Fixed In Version: Doc Type: If docs needed, set a value
Doc Text:
Story Points: ---
Clone Of: Environment:
Last Closed: 2022-05-30 13:47:50 UTC Type: Bug
Regression: --- Mount Type: ---
Documentation: --- CRM:
Verified Versions: Category: ---
oVirt Team: --- RHEL 7.3 requirements from Atomic Host:
Cloudforms Team: --- Target Upstream Version:
Embargoed:

Description ryoji noma 2022-05-30 02:19:41 UTC
Description of problem:

Does bare-metal support multi-path configuration of the boot area?

The release notes say that multipath can support primary disks from 4.7, but I'm not sure if "primary disk" also refers to the boot device or not.

https://docs.openshift.com/container-platform/4.7/release_notes/ocp-4-7-release-notes.html#ocp-4-7-rhcos-supports-multipath

The following BZ linked in the above release notes appears to be mlutipath for the sysroot (root) area. In other words, multipath support seems to be only for the root area.

Bug 1886229 - Multipath support for RHCOS sysroot
https://bugzilla.redhat.com/show_bug.cgi?id=1886229

On the other hand, RFE shows boot and additional partition.

[RFE] Enable multipath for boot and additional partition using SAN on RHCOS
https://issues.redhat.com/browse/GRPA-1721

The configuration example in the documentation sets the kernel parameter 'root=/dev/disk/by-label/dm-mpath-root', but is it possible to change root to boot, for example?

Enabling multipathing with kernel arguments on RHCOS
https://docs.openshift.com/container-platform/4.7/post_installation_configuration/machine-configuration-tasks.html#rhcos-enabling-multipath_post-install-machine-configuration-tasks

What exactly is meant by 'primary disk' and can multipath be used for the boot area?

Version-Release number of selected component (if applicable):


Expected results:

Please clarify which area's multipath configuration is a supported scenario and include it in the documentation.

Comment 1 Jonathan Lebon 2022-05-30 13:47:50 UTC
The primary disk refers to the boot disk, which includes the boot partition and normally also the root partition (unless the user is using Ignition to move the rootfs onto another disk, which is unusual). So the instructions available in the documentation does cover both the boot and root filesystems.

It is also possible to enable multipath for secondary disks, though we do not yet have official documentation for this. There is an RHBZ somewhere requesting that documentation which I can't find right now.

Comment 2 ryoji noma 2022-06-01 06:24:03 UTC
Thanks, Jonathan

I understand that the documentation setup covers both the boot and root areas. Thank you very much.

I am having trouble with multipath conversion in a customer environment. Currently there is no output in multipath -ll and I assume that the devices via multipath are not visible.

Comment 3 Jonathan Lebon 2022-06-03 17:24:48 UTC
Hi Ryoji,

I had a look at the customer case and noticed references to `/dev/mapper/coreos-luks-root-nocrypt`. This indicates that they're working with hosts installed from RHCOS 4.6 (or earlier) images. Multipathing is supported starting from RHCOS 4.7, where the machines have to be installed from RHCOS 4.7 images. If this is a cluster installation, they need to make sure they're using the right RHCOS 4.7 images. If this is a cluster born before 4.7, multipath cannot be enabled on existing nodes, but it'd be possible to enable it on new nodes if they update the boot installation media used to 4.7 (but then, they'd have a mix of multipathed and non-multipathed nodes, which isn't great).

Comment 4 ryoji noma 2022-06-06 07:05:48 UTC
Thanks, Jonathan

The customer has upgraded OpenShift to 4.7, and the RHCOS version may have been out of date when the cluster was installed.
If that is the case, does that mean that the node must be built anew?

Comment 5 Jonathan Lebon 2022-06-06 14:27:32 UTC
Correct. We do not support turning on multipath day-2 on nodes that have been upgraded from 4.6 or earlier. They can enable it on new nodes (that have been installed with 4.7 installation media) by manually using `rpm-ostree kargs --append rd.multipath=default --append root=/dev/disk/by-label/dm-mpath-root` directly on the machine (see the steps in https://docs.openshift.com/container-platform/4.7/nodes/nodes/nodes-nodes-working.html#nodes-nodes-working-evacuating_nodes-nodes-working to do this safely).

Comment 6 ryoji noma 2022-06-08 09:04:46 UTC
Thanks, Jonathan

> We do not support turning on multipath day-2 on nodes that have been upgraded from 4.6 or earlier. 

I think this should be clarified in the documentation, so I have reported a doc bug.

Bug 2094726 - [DOC]It should be clarified that multipath is not supported in environments upgraded from 4.6 or earlier. 
https://bugzilla.redhat.com/show_bug.cgi?id=2094726

Comment 7 ryoji noma 2022-06-09 06:27:13 UTC
Jonathan

The following document says that when adding a node, we must use the same ISO image when installing a cluster. The latest documents say the same thing too. Is it supported to use a different iso from the installation?

Creating Red Hat Enterprise Linux CoreOS (RHCOS) machines
https://docs.openshift.com/container-platform/4.10/machine_management/user_infra/adding-bare-metal-compute-user-infra.html#creating-rhcos-machines-bare-metal

```	
Note
You must use the same ISO image that you used to install a cluster to deploy all new nodes in a cluster. It is recommended to use the same Ignition config file. The nodes automatically upgrade themselves on the first boot before running the workloads. You can add the nodes before or after the upgrade.
```

Comment 8 Jonathan Lebon 2022-06-09 16:20:16 UTC
In general, it's safer to keep reusing the same installation media. However, in some circumstances it is necessary and supported to use a newer ISO matching the target OCP release. For example, for new hardware enablement or to workaround Ignition versioning issues or situations like we have here.