Description of problem: vdsm.log has these entries: > Thread-720062::DEBUG::2017-01-13 > 07:29:46,832::__init__::386::IOProcessClient::(_startCommunication) Starting > communication thread for client ioprocess-5874 > Thread-720062::WARNING::2017-01-13 > 07:29:46,847::__init__::401::IOProcessClient::(_startCommunication) Timeout > waiting for communication thread for client ioprocess-5874 Version-Release number of selected component (if applicable): vdsm-4.18.11-1.el7.centos.x86_64 python-ioprocess-0.16.1-1.el7.noarch ioprocess-0.16.1-1.el7.x86_64 How reproducible: > We have an ovirt system with 3 clusters, all running centos7. > ovirt engine is running on separate host, > ovirt-engine-3.6.4.1-1.el7.centos.noarch > 2 of the clusters are running newer version of ovirt, 3 nodes each, > ovirt-engine-4.0.3-1.el7.centos.noarch, glusterfs-3.7.16-1.el7.x86_64, > vdsm-4.18.11-1.el7.centos.x86_64. > 1 cluster is still running the older version, > ovirt-engine-3.6.4.1-1.el7.centos.noarch. Steps to Reproduce: 1. happens just while running even before we had active Vms on cluster. 2. Only "DMZ" cluster is showing these timeouts. Not sure if firewall maybe causing issues? 3. Actual results: Expected results: Additional info:
Can you attach complete logs? both vdsm, supervdsm and /var/log/messages?
Created attachment 1242541 [details] vdsm.log, supervdsm.log, messages
Nir, could you please take a look?
Moving to ioprocess, IOProcessClient is part of ioprocess, not vdsm.
This message is a reminder that Fedora 25 is nearing its end of life. Approximately 4 (four) weeks from now Fedora will stop maintaining and issuing updates for Fedora 25. It is Fedora's policy to close all bug reports from releases that are no longer maintained. At that time this bug will be closed as EOL if it remains open with a Fedora 'version' of '25'. Package Maintainer: If you wish for this bug to remain open because you plan to fix it in a currently maintained version, simply change the 'version' to a later Fedora version. Thank you for reporting this issue and we are sorry that we were not able to fix it before Fedora 25 is end of life. If you would still like to see this bug fixed and are able to reproduce it against a later version of Fedora, you are encouraged change the 'version' to a later Fedora version prior this bug is closed as described in the policy above. Although we aim to fix as many bugs as possible during every release's lifetime, sometimes those efforts are overtaken by events. Often a more recent Fedora release includes newer upstream software that fixes bugs or makes them obsolete.
Patch was modified long time ago, but gerrit did not update the bug since the bug should be under ovirt.
Resetting target milestone for 4.2.1 - let's make sure we tag, build and ship this.
(In reply to Allon Mureinik from comment #7) > Resetting target milestone for 4.2.1 - let's make sure we tag, build and > ship this. 4.2.0, actually. Just clicked that we just did a beta for oVirt, not a GA.
Nir, are those errors appear only on DMZ clusters as mentioned in the first paragraph, or is there a different way to reproduce them?
We don't know how to reproduce the errors. The change in ioprocess was only increasing the default timeout because it was too low. Note that the change is include only in ioprocess 1.0 - and this version is not released yet, so we cannot test it yet.
ioprocess-1.0.0-1.fc27 has been submitted as an update to Fedora 27. https://bodhi.fedoraproject.org/updates/FEDORA-2018-fbe8141dd2
ioprocess-1.0.0-1.fc27 has been pushed to the Fedora 27 testing repository. If problems still persist, please make note of it in this bug report. See https://fedoraproject.org/wiki/QA:Updates_Testing for instructions on how to install test updates. You can provide feedback for this update here: https://bodhi.fedoraproject.org/updates/FEDORA-2018-fbe8141dd2
ioprocess-1.0.2-1.fc27 has been submitted as an update to Fedora 27. https://bodhi.fedoraproject.org/updates/FEDORA-2018-5fc2a37e8a
ioprocess-1.0.2-1.fc27 has been pushed to the Fedora 27 testing repository. If problems still persist, please make note of it in this bug report. See https://fedoraproject.org/wiki/QA:Updates_Testing for instructions on how to install test updates. You can provide feedback for this update here: https://bodhi.fedoraproject.org/updates/FEDORA-2018-5fc2a37e8a
Full tier1 automation ran using ioprocess-1.0.2-1.el7ev.x86_64, no errors or regressions occurred related to the new ioprocess. moving to verify.
This bugzilla is included in oVirt 4.2.0 release, published on Dec 20th 2017. Since the problem described in this bug report should be resolved in oVirt 4.2.0 release, published on Dec 20th 2017, it has been closed with a resolution of CURRENT RELEASE. If the solution does not work for you, please open a new bug report.
ioprocess-1.0.2-1.fc27 has been pushed to the Fedora 27 stable repository. If problems still persist, please make note of it in this bug report.