Bug 1099874 - [ ERROR ] Failed to execute stage 'Misc configuration': The read operation timed out
Summary: [ ERROR ] Failed to execute stage 'Misc configuration': The read operation ti...
Keywords:
Status: CLOSED INSUFFICIENT_DATA
Alias: None
Product: Red Hat Enterprise Virtualization Manager
Classification: Red Hat
Component: ovirt-hosted-engine-setup
Version: 3.4.0
Hardware: x86_64
OS: Linux
unspecified
high
Target Milestone: ---
: 3.4.4
Assignee: Yedidyah Bar David
QA Contact: Leonid Natapov
URL:
Whiteboard: integration
: 1152564 (view as bug list)
Depends On:
Blocks: 1147410
TreeView+ depends on / blocked
 
Reported: 2014-05-21 11:51 UTC by Nikolai Sednev
Modified: 2019-04-28 10:06 UTC (History)
12 users (show)

Fixed In Version:
Doc Type: Bug Fix
Doc Text:
Clone Of:
Environment:
Last Closed: 2014-10-28 08:51:39 UTC
oVirt Team: ---
Target Upstream Version:
Embargoed:


Attachments (Terms of Use)
sosreport from host deploying the self hosted engine (5.92 MB, application/x-xz)
2014-09-29 09:47 UTC, Syl DES
no flags Details

Comment 1 Nikolai Sednev 2014-05-21 12:00:13 UTC
Only when I cleaned up NFS share and wiped out all inside files and folders, setup succeeded to continue.

[root@nsednev mnt]# cd nsednev_HE_3_4/3433e547-ab16-4401-b7e5-db02ba981acc/images/
7280dd92-c713-42fa-a11f-1012de7c6e8d/ 85c439e5-75ad-428a-a386-c41d9dda2466/ ddf356f6-42a5-4162-853a-64bb002432c8/ 

[root@nsednev mnt]# cd nsednev_HE_3_4/
[root@nsednev nsednev_HE_3_4]# ls     
3433e547-ab16-4401-b7e5-db02ba981acc  __DIRECT_IO_TEST__
[root@nsednev nsednev_HE_3_4]# rm -rf *

Comment 2 Sandro Bonazzola 2014-05-22 13:36:01 UTC
Federico, is something changed in VDSM that can cause activateStorageDomain to timeout?

Comment 3 Sandro Bonazzola 2014-05-22 13:36:45 UTC
Nikolai, can you attach vdsm logs?

Comment 4 Nikolai Sednev 2014-05-25 06:46:17 UTC
(In reply to Sandro Bonazzola from comment #3)
> Nikolai, can you attach vdsm logs?

My bad, can't add them as wiped out my setup last week, will add when this scenario will happen again.

Comment 5 Sandro Bonazzola 2014-05-27 12:27:53 UTC
Please reopen if you manage to reproduce, thanks.

Comment 6 Syl DES 2014-09-29 09:47:04 UTC
Created attachment 942248 [details]
sosreport from host deploying the self hosted engine

Comment 7 Syl DES 2014-09-29 09:48:46 UTC
Hello,

I had exactly the same problem, solved by manually deleting contetn of the storage domain.

I attached the sosreport.

Regards

Comment 8 Sandro Bonazzola 2014-10-20 08:21:32 UTC
didi can you check if this is duplicate of bug #1152564 ?

Comment 13 Yedidyah Bar David 2014-10-28 08:37:52 UTC
*** Bug 1152564 has been marked as a duplicate of this bug. ***

Comment 15 Artyom 2014-10-28 09:53:56 UTC
you know it pass week and it's my work host, so I can just hold in some state with broken hosted-engine --deploy. Week it's not enough to get all logs from host?

Comment 19 Nikolai Sednev 2014-12-16 09:10:51 UTC
Works for me on these components and had not reproduced since then until Today:
mom-0.4.1-4.el6ev.noarch
libvirt-0.10.2-46.el6_6.2.x86_64
vdsm-4.16.8.1-3.el6ev.x86_64
ovirt-hosted-engine-setup-1.2.1-8.el6ev.noarch
sanlock-2.8-1.el6.x86_64
ovirt-host-deploy-1.3.0-2.el6ev.noarch
ovirt-hosted-engine-ha-1.2.4-3.el6ev.noarch
rhevm-3.5.0-0.25.el6ev.noarch

Comment 20 Yedidyah Bar David 2015-01-05 18:10:16 UTC
Looked at this again after a report on users@.

Managed to reproduce and fix. Updating here for reference.

Reproduction was, more-or-less:

Created a VM with nested-kvm to be used as a host

Installed RHEL6

yum install ovirt-hosted-engine-setup from 3.4 repo

hosted-engine --deploy using nfs storage
- On first prompt ("The VM has been started.  Install the OS...") replied 3 (abort)

Ran again, it said the machine is running

hosted-engine --vm-poweroff
- it killed the machine

hosted-engine --set-maintenance --mode=global
- it hung, killed it after a few minutes

sanlock status
- output some info

sanlock shutdown -f 1
- it said it shutdown

sanlock status
- no output

Removed the data in the SD created before

hosted-engine --deploy
- This time it failed as in this bug

At this point, running:

vdsClient -s 0 getVdsStats
- also got stuck, didn't see anything suspicios in vdsm.log

How to solve?

vdsm has code to check if hosted-engine is setup, and if so, connects to hosted-engine-ha (agent/broker). But at this point, it is "set up", but ha is still down. Doing:

# rm /etc/ovirt-hosted-engine/hosted-engine.conf

was enough to make it not try that anymore.

Leaving the bug closed for now. If we decide to reopen, the proper subject should probably be "hosted-engine has no cleanup tool". In 3.5 we added an option '4' at that prompt, which also kills the vm and thus manages to release sanlock. I then verified that running deploy again works, which seems enough.


Note You need to log in before you can comment on or make changes to this bug.