Created attachment 1242102 [details] engine.log Description of problem: If host-deploy reinstall fails and leaves vdsmd down, moving the host to Maintenance gets stuck with status 'Preparing for Maintenance'. A quick search in the code finds HostPreparingForMaintenanceIdleTime, which is set to 300 seconds (also verified that with engine-config), but after more than 5 minutes nothing changed and nothing was written to engine.log. Restarting the engine solved this. Host was still non-responsive, and moving it to maintenance worked immediately. Version-Release number of selected component (if applicable): Current master snapshot How reproducible: Not sure, always on my current system. Steps to Reproduce: 1. deploy a host successfully 2. reinstall it, and make this fail somehow, including vdsmd. Perhaps it's enough to manually stop vdsmd, didn't try. 3. Move the host to Maintenance. Actual results: Host is stuck in 'Preparing for Maintenance'. Expected results: Host moves immediately, or at most after some timeout. Additional info: Restarting the engine solves this.
Didi, is this error reproducable? If not I'd close this as WORKSFORME as we haven't found any clue why it happened and we failed all attempts to reproduce this issue (host was always moved to Maintenance at the end).
The needinfo request[s] on this closed bug have been removed as they have been unresolved for 1000 days