Bug 1373395
Summary: | overcloud api do not respond after some time | |||
---|---|---|---|---|
Product: | Red Hat OpenStack | Reporter: | Gurenko Alex <agurenko> | |
Component: | openstack-tripleo-heat-templates | Assignee: | Carlos Camacho <ccamacho> | |
Status: | CLOSED ERRATA | QA Contact: | Gurenko Alex <agurenko> | |
Severity: | urgent | Docs Contact: | ||
Priority: | unspecified | |||
Version: | 10.0 (Newton) | CC: | agurenko, ahirshbe, ccamacho, dbecker, jcoufal, jschluet, jslagle, mburns, mcornea, mkrcmari, morazi, oblaut, rduartes, rhel-osp-director-maint, sasha, sclewis | |
Target Milestone: | beta | Keywords: | Triaged | |
Target Release: | 10.0 (Newton) | |||
Hardware: | x86_64 | |||
OS: | Linux | |||
Whiteboard: | ||||
Fixed In Version: | openstack-tripleo-heat-templates-5.0.0-0.20160907212643.90c852e.2.el7ost | Doc Type: | If docs needed, set a value | |
Doc Text: | Story Points: | --- | ||
Clone Of: | ||||
: | 1406417 (view as bug list) | Environment: | ||
Last Closed: | 2016-12-14 15:57:03 UTC | Type: | Bug | |
Regression: | --- | Mount Type: | --- | |
Documentation: | --- | CRM: | ||
Verified Versions: | Category: | --- | ||
oVirt Team: | --- | RHEL 7.3 requirements from Atomic Host: | ||
Cloudforms Team: | --- | Target Upstream Version: | ||
Embargoed: |
Description
Gurenko Alex
2016-09-06 07:47:08 UTC
carlos, can you take a look at this one? please move to ASSIGNED when you're able to start work on it Reproduced. Was able to interact with overcloud after bouncing httpd on all controllers. Provided upstream fix: https://review.openstack.org/#/c/374136/ (In reply to Carlos Camacho from comment #5) > Provided upstream fix: https://review.openstack.org/#/c/374136/ carlos, thanks for the fix. can you add a link to the patch under External Trackers towards the top of this bug? and please move the bug to the ON_DEV state. thanks. to summarize, this issue is isolated to low memory environments (8GB RAM). This is unlikely to cause issues in production. (In reply to James Slagle from comment #9) > to summarize, this issue is isolated to low memory environments (8GB RAM). > This is unlikely to cause issues in production. Not sure how this was assumed but I can reproduce this problem on controllers with 32 GB of RAM which is our minimal recommendation. It seems that httpd spawned processes are hung. strace shows for most of the spawned httpd processes following: Process 32595 attached connect(24, {sa_family=AF_LOCAL, sun_path="/var/run/wsgi.9929.0.2.sock"}, 110 Setting MaxClients to 32 just makes this happen faster. Hi, Last week I tried to reproduce this bug in my local environment without luck, I had the issue before the submitted upstream fix. But not anymore (following the official docs from tripleo.org), can you provide me with some feedback about any additional configuration that you might be deploying without the default parameters? Issue is gone for last few builds. Last verified for 2016-10-25.2 build Hi The issue is seen on many QE setups This is usually seen after about 12-24 hours Both BM & Virt setups Ofer please add your output in the bug Cannot reproduce the issue on 2016-11-2.2 build. Tried it for 2 days now, it looks stable to me. Had this issue on 2016-10-31.2 though. Since the problem described in this bug report should be resolved in a recent advisory, it has been closed with a resolution of ERRATA. For information on the advisory, and where to find the updated files, follow the link below. If the solution does not work for you, open a new bug report. https://rhn.redhat.com/errata/RHEA-2016-2948.html |