Bug 1396183
| Summary: | [Scale] vms were not recovered, several days, after vdsm restart. | ||||||||
|---|---|---|---|---|---|---|---|---|---|
| Product: | [oVirt] ovirt-engine | Reporter: | Ilanit Stein <istein> | ||||||
| Component: | BLL.Virt | Assignee: | Michal Skrivanek <michal.skrivanek> | ||||||
| Status: | CLOSED INSUFFICIENT_DATA | QA Contact: | Ilanit Stein <istein> | ||||||
| Severity: | medium | Docs Contact: | |||||||
| Priority: | unspecified | ||||||||
| Version: | 4.0.5.5 | CC: | bazulay, bugs, istein, lsurette, mperina, s.kieske, srevivo, tjelinek, tnisan, ycui, ykaul | ||||||
| Target Milestone: | --- | Flags: | istein:
needinfo-
istein: devel_ack? |
||||||
| Target Release: | --- | ||||||||
| Hardware: | Unspecified | ||||||||
| OS: | Unspecified | ||||||||
| Whiteboard: | |||||||||
| Fixed In Version: | Doc Type: | If docs needed, set a value | |||||||
| Doc Text: | Story Points: | --- | |||||||
| Clone Of: | Environment: | ||||||||
| Last Closed: | 2017-02-13 18:49:52 UTC | Type: | Bug | ||||||
| Regression: | --- | Mount Type: | --- | ||||||
| Documentation: | --- | CRM: | |||||||
| Verified Versions: | Category: | --- | |||||||
| oVirt Team: | Virt | RHEL 7.3 requirements from Atomic Host: | |||||||
| Cloudforms Team: | --- | Target Upstream Version: | |||||||
| Embargoed: | |||||||||
| Attachments: |
|
||||||||
|
Description
Ilanit Stein
2016-11-17 17:10:21 UTC
This bug might be related to Bug 1393295. Liron please have a look if it's indeed related Ilanit, can you please attach the relevant logs? thanks, Liron. I've checked the vdsm code, the code is related to libvirt domains (vms) and not for storage domains. Moving to virt for further inspection of the issue. Created attachment 1223921 [details]
initlal vdsm log
In this log, see vdsm was rebooted and started at
MainThread::INFO::2016-11-07 09:10:24,776::vdsm::135::vds::(run) (PID: 1758) I am the actual vdsm 4.18.15-1.el7ev b01-h18-r620.rhev.openstack.engineering.redhat.com (3.10.0-514.el7.x86_64)
Created attachment 1223922 [details]
final vdsm log
See in log
clientIFinit::INFO::2016-11-15 07:01:02,512::clientIF::545::vds::(_waitForDomainsUp) recovery: waiting for 2 domains to go up
seems like 2 VMs never responded when querying them in libvirt during recovery. Can you reproduce the problem or dig out the libvirt logs? If you didn't have debug enabled in libvirt then it needs to be reproduced, unfortunately. Also, did you have fencing enabled? It may be skipped when it's returning "recovery" Martine? ...hmm...not good I do not have the libvirt records. There are similar scale testing planned for the coming days. I can track if this problem reproduces. Please re-run with libvirt debug logs. Ilanit, any chance to get this info? I am still waiting to getting a scale machine to test it on. ok, so putting the needinfo back on you to mark that we are waiting for some info. (In reply to Tomas Jelinek from comment #12) > ok, so putting the needinfo back on you to mark that we are waiting for some > info. I'm closing for the time being, please re-open when reproduced. Removing need info, as problem was not reproduced so far. I shall reopen bug, if it will reproduce. |