Bug 2029414

Summary: Alibaba Disk snapshots with XFS filesystem cannot be used
Product: OpenShift Container Platform Reporter: Jan Safranek <jsafrane>
Component: StorageAssignee: Jan Safranek <jsafrane>
Storage sub component: Kubernetes External Components QA Contact: Rohit Patil <ropatil>
Status: CLOSED ERRATA Docs Contact:
Severity: high    
Priority: unspecified CC: aos-bugs
Version: 4.10   
Target Milestone: ---   
Target Release: 4.10.0   
Hardware: Unspecified   
OS: Unspecified   
Whiteboard:
Fixed In Version: Doc Type: If docs needed, set a value
Doc Text:
Story Points: ---
Clone Of: Environment:
Last Closed: 2022-03-10 16:32:18 UTC Type: Bug
Regression: --- Mount Type: ---
Documentation: --- CRM:
Verified Versions: Category: ---
oVirt Team: --- RHEL 7.3 requirements from Atomic Host:
Cloudforms Team: --- Target Upstream Version:
Embargoed:

Description Jan Safranek 2021-12-06 12:57:42 UTC
It is not possible to restore a snapshot of a XFS volume and use it on the same node as the original volume.

Steps to reproduce:

1. Create a StorageClass with csi.storage.k8s.io/fstype: xfs.
2. Create PVC A + Pod A using it, store some data on the provisioned volume.
3. Stop the Pod A
4. Take a snapshot of the PVC A, restore it into a new volume as PVC B.
5. Run both Pod A (with PVC A) and Pod B (with PVC B) on the same node.

Actual result: one of the pods can't be started:

MountVolume.MountDevice failed for volume "pvc-25a7129e-2cd7-4170-9050-550057ed0e20" : rpc error: code = Internal desc = mount failed: exit status 32
Mounting command: mount
Mounting arguments: -t xfs -o shared,defaults /dev/vde /var/lib/kubelet/plugins/kubernetes.io/csi/pv/pvc-25a7129e-2cd7-4170-9050-550057ed0e20/globalmount
Output: mount: /var/lib/kubelet/plugins/kubernetes.io/csi/pv/pvc-25a7129e-2cd7-4170-9050-550057ed0e20/globalmount: wrong fs type, bad option, bad superblock on /dev/vde, missing codepage or helper program, or other error.


Kernel says:

[17567.886004] XFS (vde): Filesystem has duplicate UUID b5820d11-d42d-4fc1-8e4a-ae431b33039d - can't mount


Expected result: both pods can run.

Comment 1 Jan Safranek 2021-12-06 12:58:16 UTC
This has been fixed upstream in https://github.com/kubernetes-sigs/alibaba-cloud-csi-driver/pull/570. We need to backport the PR.

Comment 2 Jan Safranek 2021-12-06 13:04:17 UTC
This needs to be fixed before declaring the CSI driver GA in 4.10.

Comment 9 errata-xmlrpc 2022-03-10 16:32:18 UTC
Since the problem described in this bug report should be
resolved in a recent advisory, it has been closed with a
resolution of ERRATA.

For information on the advisory (Moderate: OpenShift Container Platform 4.10.3 security update), and where to find the updated
files, follow the link below.

If the solution does not work for you, open a new bug report.

https://access.redhat.com/errata/RHSA-2022:0056