Bug 2135381
| Summary: | Live migration of OpenShift Virtualization VMs with ODF (ceph storage) based disks is failing consistently | ||
|---|---|---|---|
| Product: | Container Native Virtualization (CNV) | Reporter: | pbunev <pbunev> |
| Component: | Virtualization | Assignee: | Jed Lejosne <jlejosne> |
| Status: | VERIFIED --- | QA Contact: | zhe peng <zpeng> |
| Severity: | high | Docs Contact: | |
| Priority: | high | ||
| Version: | 4.10.5 | CC: | acardace, bdumont, dzilberm, fdeutsch, ipinto, ktenzer, pbunev, pelauter, prince.tcet, sgott, vromanso, yadu, ycui, zpeng |
| Target Milestone: | --- | ||
| Target Release: | 4.14.0 | ||
| Hardware: | Unspecified | ||
| OS: | Linux | ||
| Whiteboard: | |||
| Fixed In Version: | v4.14.0.rhel9-1569 | Doc Type: | If docs needed, set a value |
| Doc Text: | Story Points: | --- | |
| Clone Of: | 2016584 | Environment: | |
| Last Closed: | Type: | --- | |
| Regression: | --- | Mount Type: | --- |
| Documentation: | --- | CRM: | |
| Verified Versions: | Category: | --- | |
| oVirt Team: | --- | RHEL 7.3 requirements from Atomic Host: | |
| Cloudforms Team: | --- | Target Upstream Version: | |
| Embargoed: | |||
| Bug Depends On: | 2092271, 2016584, 2174226 | ||
| Bug Blocks: | |||
|
Description
pbunev@redhat.com
2022-10-17 13:03:29 UTC
This bug is reported against 4.10, not sure why the target version is set to 4.8.4, so let's re-target it in bug scrub meeting.
And from old virt launcher log:
{"component":"virt-launcher","kind":"","level":"error","msg":"Recevied a live migration error. Will check the latest migration status.","name":"fedora-cephfs","namespace":"vm-testproj","pos":"live-migration-source.go:805","reason":"error encountered during MigrateToURI3 libvirt api call: virError(Code=1, Domain=10, Message='internal error: unable to execute QEMU command 'cont': Failed to get \"write\" lock')","timestamp":"2022-10-17T09:55:42.993889Z","uid":"2a34c2ae-71af-4d4e-a116-5ce9621ce88a"}
*** Bug 2152909 has been marked as a duplicate of this bug. *** Deferring to 4.13.1 due to capacity. Deferring to 4.14 due to priority. verify with build: CNV-v4.14.0.rhel9-1632
step:
1. create vm with ocs-storagecluster-cephfs
...
storage:
resources:
requests:
storage: 30Gi
storageClassName: ocs-storagecluster-cephfs
...
check pvc:
...
resources:
requests:
storage: "34087042032"
storageClassName: ocs-storagecluster-cephfs
volumeMode: Filesystem
volumeName: pvc-19fd569a-a750-4298-99fc-81e0767ea167
...
2. start vm
$ oc get pods
NAME READY STATUS RESTARTS AGE
virt-launcher-vm-fedora-9pgkv 1/1 Running 0 2m41s
3. do live migration
$ oc get pods
NAME READY STATUS RESTARTS AGE
virt-launcher-vm-fedora-6tgzl 1/1 Running 0 16s
virt-launcher-vm-fedora-9pgkv 0/1 Completed 0 3m22s
$ oc get vm
NAME AGE STATUS READY
vm-fedora 4m31s Running True
$ oc get virtualmachineinstancemigrations.kubevirt.io
NAME PHASE VMI
vm-fedora-migration-o514o Succeeded vm-fedora
move to verified.
|