Bug 2297022

Summary:	ODF nodes require a manual reboot after compute vmotion is performed due to a disk change order on the nodes
Product:	[Red Hat Storage] Red Hat OpenShift Data Foundation	Reporter:	palshure
Component:	ceph	Assignee:	Kotresh HR <khiremat>
ceph sub component:	CephFS	QA Contact:	Elad <ebenahar>
Status:	NEW ---	Docs Contact:
Severity:	medium
Priority:	unspecified	CC:	bniver, etamir, khiremat, muagarwa, palshure, sostapov, vshankar
Version:	4.14	Flags:	vshankar: needinfo? (palshure)
Target Milestone:	---
Target Release:	---
Hardware:	Unspecified
OS:	Unspecified
Whiteboard:
Fixed In Version:		Doc Type:	If docs needed, set a value
Doc Text:		Story Points:	---
Clone Of:		Environment:
Last Closed:		Type:	Bug
Regression:	---	Mount Type:	---
Documentation:	---	CRM:
Verified Versions:		Category:	---
oVirt Team:	---	RHEL 7.3 requirements from Atomic Host:
Cloudforms Team:	---	Target Upstream Version:
Embargoed:

Description palshure 2024-07-10 05:40:13 UTC

Description of problem (please be detailed as possible and provide log
snippests):
Case 1:
Whether compute vmotion is performed manually or automatically, ODF becomes unhealthy and a disk order change on the nodes can be observed. Rebooting the nodes solves the issue.

Case 2:
Powering off the VM, then doing compute vMotion, then powering it back on (all manual activities) does not cause the issue, but it is not an optimal situation.

Version of all relevant components (if applicable):
4.14.6

Does this issue impact your ability to continue to work with the product
(please explain in detail what is the user impact)?
disabling compute vmotion on storage nodes

Is there any workaround available to the best of your knowledge?
powering the nodes off first

Rate from 1 - 5 the complexity of the scenario you performed that caused this
bug (1 - very simple, 5 - very complex)?
1

Can this issue reproducible?
yes

Can this issue reproduce from the UI?
n/a

If this is a regression, please provide more details to justify this:
n/a

Steps to Reproduce:
Described earlier

Actual results:
ODF nodes require a manual reboot after compute vmotion is performed due to a disk change order on the nodes

Expected results:
ODF should recover automatically after compute vmotion

Additional info:
disk.EnableUUID parameter is enabled, OSD disks deployed via lso are persistent.