1331816 – [dev-preview-int] Pods won't deploy when memory settings are too low

Bug 1331816 - [dev-preview-int] Pods won't deploy when memory settings are too low

Summary: [dev-preview-int] Pods won't deploy when memory settings are too low

Keywords:
Status:	CLOSED ERRATA
Alias:	None
Product:	OpenShift Container Platform
Classification:	Red Hat
Component:	Management Console
Sub Component:
Version:	3.2.0
Hardware:	Unspecified
OS:	Unspecified
Priority:	unspecified
Severity:	high
Target Milestone:	---
Target Release:	3.2.1
Assignee:	Samuel Padgett
QA Contact:	Yadan Pei
Docs Contact:
URL:
Whiteboard:
Duplicates (1):	1333029 (view as bug list)
Depends On:
Blocks:
TreeView+	depends on / blocked

Reported:	2016-04-29 15:55 UTC by Steve Speicher
Modified:	2016-06-27 15:06 UTC (History)
CC List:	10 users (show)
Fixed In Version:
Doc Type:	Bug Fix
Doc Text:	The web console has been updated to more accurately reflect memory limit values.
Clone Of:
Environment:
Last Closed:	2016-06-27 15:06:10 UTC
Target Upstream Version:
Embargoed:

Attachments	(Terms of Use)

Links
System	ID	Private	Priority	Status	Summary	Last Updated
Red Hat Product Errata	RHBA-2016:1343	0	normal	SHIPPED_LIVE	Red Hat OpenShift Enterprise 3.2.1.1 bug fix and enhancement update	2016-06-27 19:04:05 UTC

Description Steve Speicher 2016-04-29 15:55:53 UTC

Description of problem:
If the requested CPU is below a threshold it fails. I'm only supplying the MEM request which is at or near the lower limit.

I would not expect it to fail if memory was above the limit. I would also expect it to just inform me it will use more than what has been requested.

Version-Release number of selected component (if applicable):

OpenShift Master:
v3.2.0.40
Kubernetes Master:
v1.2.0-36-g4a3f9c5

How reproducible:

I reproduced this 2 different ways.
1st nodejs-mongo-example, modified the memory requests for Nodejs to be 150 MiB which gave the same error below.
2nd used 3rd party template at https://raw.githubusercontent.com/GrahamDumpleton/openshift3-kallithea/master/template.json


Steps to Reproduce:
1. deploy one of the 2 apps above
2. ensure request is low enough (minimum 150 MiB)
3. observe events

Expected results:
Either my request would be increased to allow app to deploy or it would accept my min mem request.


Additional info:

11:45:34 AM	
Replication Controller
kallithea-db-1	
Failed create  
Error creating: pods "kallithea-db-1-" is forbidden: [Minimum cpu usage per Pod is 30m, but request is 22m., Minimum memory usage per Pod is 150Mi, but request is 120586240., Minimum cpu usage per Container is 30m, but request is 22m., Minimum memory usage per Container is 150Mi, but request is 115Mi.]

Comment 1 Derek Carr 2016-04-29 17:13:10 UTC

Luke - is this expected behavior?  were we intending to change user input as requested above?

Comment 2 Luke Meyer 2016-04-29 17:49:00 UTC

It's expected behavior.

In online, the limits specified by the user are overridden based on the ClusterResourceOverrides admission plugin. This happens at the time the pod is instantiated (not when a pod template is given in e.g. a RC). This works exactly like the LimitRanger plugin normally would - if you specified limit/request out of bounds, it fails at the point where the pod is instantiated. The difference is that it's obvious what's going on when the user specifies the wrong numbers, but with the CRO plugin in place the numbers are actually rewritten so the user won't even recognize where they come from. Bad UX.

There are a number of ways to address this.

It would seem obvious to just set a floor, either an absolute one or use the per-project LimitRange, and reset too-low values to the floor. Abhishek did not like this as it violates the purpose of the CRO plugin which is to manage overcommit, arguing that in order to maintain the desired limit-to-request ratios we should just reject pods rather than set the floor.

Other ways would be for ops to adjust the LimitRange to have lower request limits, or adjust the CRO plugin parameters to prevent the violation of limits. Finally, the templates could be adjusted.

If we're shipping the nodejs-mongo-example online I would suggest we fix its request settings and lower the severity of this bug at least until we have agreement on how best to address it.

Comment 3 Steve Speicher 2016-05-05 11:35:57 UTC

When will this get addressed (either in template/example or other mechanism)? I fear it will be the source of confusion with a number of our end users, when they use Actions -> Set Resource Limits and their deployments fail, even though they set a value within range.

Comment 4 Luke Meyer 2016-05-05 17:04:16 UTC

Dan/Abhishek/David any opinions?

Comment 5 Dan McPherson 2016-05-05 19:36:24 UTC

I had opened a dup of this bug here:

https://bugzilla.redhat.com/show_bug.cgi?id=1333029

My suggestion is that we should fix this in the UI to change the lower limit to 10/6 * the request.  It actually doesn't matter whether you are using the extra admission controller.  If the field is a limit field, both values should be for the limit.  If the field is a request field, the min and max should be request values.

Comment 6 Dan McPherson 2016-05-05 19:37:28 UTC

*** Bug 1333029 has been marked as a duplicate of this bug. ***

Comment 7 Xingxing Xia 2016-05-06 01:48:12 UTC

Discussed in https://bugzilla.redhat.com/show_bug.cgi?id=1324825

Comment 8 Samuel Padgett 2016-05-06 16:01:34 UTC

https://github.com/openshift/origin/pull/8775

Comment 10 Xingxing Xia 2016-05-10 11:11:12 UTC

https://bugzilla.redhat.com/show_bug.cgi?id=1324825#c15

Comment 11 Xingxing Xia 2016-05-10 11:18:32 UTC

Seemed https://github.com/openshift/origin/pull/8775 is fix in web console Set Resource Limits page but does not touch the CLI problem in this bug. Would leave the CLI user experience as it currently behaves?

Comment 12 Samuel Padgett 2016-05-10 11:42:45 UTC

The CLI doesn't do any validation again limit ranges even if you don't use ClusterResourceOverrides.

Comment 13 Xingxing Xia 2016-05-19 06:44:25 UTC

(In reply to Samuel Padgett from comment #12)
> The CLI doesn't do any validation again limit ranges even if you don't use
> ClusterResourceOverrides.

Yes, CLI doesn't do any validation again limit ranges. But the problem 150Mi ~ 256Mi don't work is raised specifically when using ClusterResourceOverrides. (Though this is ClusterResourceOverrides's expected behavior, as comment 2 said).

From user's view, `oc get limitrange -o yaml` will show acceptable Min is 150Mi. I think this bug doesn't intend to say: Pods should deploy when memory settings are < 150Mi. Instead, it may intend to say: it is surprise that "Pods won't deploy when memory settings are too low" between 150Mi ~ 256Mi.

Comment 14 Samuel Padgett 2016-05-25 11:36:55 UTC

Marking ON_QA to verify the web console changes.

Comment 15 Yadan Pei 2016-05-26 08:46:33 UTC

Found that PR are merged to openshift:master, now we only have puddles build from openshift:enterprise-3.2(latest one is 2016-05-25.3 and oc version is oc v3.2.0.45). Only when 3.2.1 puddle is ready could we verify the bug, right? 


@samuel, could you please help confirm? thanks

Comment 17 Xingxing Xia 2016-05-26 09:59:09 UTC

https://bugzilla.redhat.com/show_bug.cgi?id=1324825#c20  a minor problem.

Comment 18 Xingxing Xia 2016-06-08 08:31:29 UTC

The PR is merged in latest v3.2.1.1-1-g33fa4ea. The bug (web console range issue) is fixed and could be VERIFIED. Will you change it to ON_QA?

Comment 20 errata-xmlrpc 2016-06-27 15:06:10 UTC

Since the problem described in this bug report should be
resolved in a recent advisory, it has been closed with a
resolution of ERRATA.

For information on the advisory, and where to find the updated
files, follow the link below.

If the solution does not work for you, open a new bug report.

https://access.redhat.com/errata/RHBA-2016:1343

Note You need to log in before you can comment on or make changes to this bug.