Bug 1256446
Summary: | OSError: [Errno 24] Too many open files while running automation tests | ||||||||
---|---|---|---|---|---|---|---|---|---|
Product: | Red Hat Enterprise Virtualization Manager | Reporter: | Meni Yakove <myakove> | ||||||
Component: | vdsm | Assignee: | Piotr Kliczewski <pkliczew> | ||||||
Status: | CLOSED ERRATA | QA Contact: | Meni Yakove <myakove> | ||||||
Severity: | high | Docs Contact: | |||||||
Priority: | unspecified | ||||||||
Version: | 3.6.0 | CC: | bazulay, gklein, lsurette, mgoldboi, myakove, oourfali, pkliczew, pstehlik, ycui, yeylon, ykaul | ||||||
Target Milestone: | ovirt-3.6.0-rc3 | Keywords: | Automation, AutomationBlocker, ZStream | ||||||
Target Release: | 3.6.0 | ||||||||
Hardware: | x86_64 | ||||||||
OS: | Linux | ||||||||
Whiteboard: | |||||||||
Fixed In Version: | v4.17.5 | Doc Type: | Bug Fix | ||||||
Doc Text: | Story Points: | --- | |||||||
Clone Of: | |||||||||
: | 1265965 (view as bug list) | Environment: | |||||||
Last Closed: | 2016-03-09 19:44:00 UTC | Type: | Bug | ||||||
Regression: | --- | Mount Type: | --- | ||||||
Documentation: | --- | CRM: | |||||||
Verified Versions: | Category: | --- | |||||||
oVirt Team: | Infra | RHEL 7.3 requirements from Atomic Host: | |||||||
Cloudforms Team: | --- | Target Upstream Version: | |||||||
Embargoed: | |||||||||
Bug Depends On: | |||||||||
Bug Blocks: | 1265965 | ||||||||
Attachments: |
|
Description
Meni Yakove
2015-08-24 15:14:30 UTC
Created attachment 1066477 [details]
engine logs
Created attachment 1066479 [details]
vdsm logs - host is host_mixed_1 - 10.35.128.28
I've looked at the VDSM logs. and VDSM runs out of its allowed 1024 file descriptors. Following the open FDs during several runs of the tests, VDSM is constantly leaking FDs at relatively steady pace when the tests are active, furthermore, leak is limited to a single type, VDSM is leaking TCP sockets. I've tried to intercept its syscalls and I came across multiple accept(2) calls that never closed their descriptors during the whole time of the syscall trace (1~2 minutes), I'd suggest continuing the investigation there. It seems that it still randomly happens. We need to determine the steps how to reproduce the issue again. It is related to setupNetworks BZ #1262051. Please provide the steps to reproduce. Marked as a GA blocker for now, since no clear repo steps and frequency seems to be down. not a beta1 blocker. I have access to the env so working on it now. This isn't a regression. Removing regression flag. Cloned also to 3.5.Z. Since the problem described in this bug report should be resolved in a recent advisory, it has been closed with a resolution of ERRATA. For information on the advisory, and where to find the updated files, follow the link below. If the solution does not work for you, open a new bug report. https://rhn.redhat.com/errata/RHBA-2016-0362.html |