Bug 1381745
Summary: | Controllers shut down under heavy AWS API throttling | ||
---|---|---|---|
Product: | OpenShift Container Platform | Reporter: | Stefanie Forrester <dakini> |
Component: | Node | Assignee: | Paul Morie <pmorie> |
Status: | CLOSED NOTABUG | QA Contact: | DeShuai Ma <dma> |
Severity: | medium | Docs Contact: | |
Priority: | medium | ||
Version: | 3.3.0 | CC: | aos-bugs, bingli, decarr, jokerman, mifiedle, mmccomas, wmeng |
Target Milestone: | --- | Keywords: | Reopened |
Target Release: | --- | ||
Hardware: | Unspecified | ||
OS: | Unspecified | ||
Whiteboard: | |||
Fixed In Version: | Doc Type: | If docs needed, set a value | |
Doc Text: | Story Points: | --- | |
Clone Of: | Environment: | ||
Last Closed: | 2016-10-26 18:10:22 UTC | Type: | Bug |
Regression: | --- | Mount Type: | --- |
Documentation: | --- | CRM: | |
Verified Versions: | Category: | --- | |
oVirt Team: | --- | RHEL 7.3 requirements from Atomic Host: | |
Cloudforms Team: | --- | Target Upstream Version: | |
Embargoed: |
Description
Stefanie Forrester
2016-10-04 21:36:54 UTC
This is a duplicate of 1377483 *** This bug has been marked as a duplicate of bug 1377483 *** tracking the online issue separate from the non-online issue separately. I thought I'd mention a difference between the two bugs that are now marked as duplicates: bz 1377483 caused a harder crash (the controllers did not start back up afterwards). And it left core files behind on the file system after the crashes. Whereas this bug (bz 1381745) does a graceful shutdown and recovers, so the controllers are able to keep running. Also, I wanted to give an update on the impact of this bug for Ops. This issue is less severe than it was a few days ago. I was able to get the controller crashes down to about 11-15 times per day by decreasing the amount of AWS API calls made for DeleteVolume. So with fewer API calls, there's less throttling, which means Ops isn't being impacted so badly by this anymore. This is the bug related to the DeleteVolume requests https://bugzilla.redhat.com/show_bug.cgi?id=1377486#c22 Stefanie- Is this still something you're running into? No, we're not hitting the issue anymore because the API DoS problem has been fixed (bz #1367229). Since we're not being throttled, there are no more controller crashes. Stefanie - does this bug need to remain open if the issue no longer exists? Stefanie - and if not, can an RFE process be used to handle throttling specific concerns? Derek, we can close this one. It probably doesn't make sense to add support for DoSing your cloud service provider :) |