Bug 1938467
Summary: | The default cluster-autoscaler should get default cpu and memory requests if user omits them | ||
---|---|---|---|
Product: | OpenShift Container Platform | Reporter: | Clayton Coleman <ccoleman> |
Component: | Cloud Compute | Assignee: | Danil Grigorev <dgrigore> |
Cloud Compute sub component: | Cluster Autoscaler | QA Contact: | sunzhaohua <zhsun> |
Status: | CLOSED ERRATA | Docs Contact: | |
Severity: | high | ||
Priority: | unspecified | CC: | aos-bugs, dgrigore, nelluri, wking |
Version: | 4.8 | ||
Target Milestone: | --- | ||
Target Release: | 4.8.0 | ||
Hardware: | Unspecified | ||
OS: | Unspecified | ||
Whiteboard: | |||
Fixed In Version: | Doc Type: | If docs needed, set a value | |
Doc Text: | Story Points: | --- | |
Clone Of: | Environment: | ||
Last Closed: | 2021-07-27 22:53:17 UTC | Type: | Bug |
Regression: | --- | Mount Type: | --- |
Documentation: | --- | CRM: | |
Verified Versions: | Category: | --- | |
oVirt Team: | --- | RHEL 7.3 requirements from Atomic Host: | |
Cloudforms Team: | --- | Target Upstream Version: | |
Embargoed: |
Description
Clayton Coleman
2021-03-13 17:39:43 UTC
Failed to verify, autoscaler po stuck in pending status. clusterversion: 4.8.0-0.nightly-2021-04-15-202330 steps: 1. create clusterautoscaler apiVersion: "autoscaling.openshift.io/v1" kind: "ClusterAutoscaler" metadata: name: "default" spec: resourceLimits: maxNodesTotal: 10 scaleDown: enabled: true delayAfterAdd: 10s delayAfterDelete: 10s delayAfterFailure: 10s unneededTime: 10s 2.check autoscaler pod, stuck in pending $ oc get po NAME READY STATUS RESTARTS AGE cluster-autoscaler-default-56659849fc-b49rk 0/1 Pending 0 5m46s cluster-autoscaler-default-74f84d9957-dp8xk 1/1 Running 0 5m37s cluster-autoscaler-operator-844d8f7b96-srq2v 2/2 Running 0 8m28s cluster-baremetal-operator-84f7c56bbc-j9v52 2/2 Running 0 79m machine-api-controllers-685988fb5d-2k5tf 7/7 Running 0 85m machine-api-operator-6cbcdcd4cd-l27jd 2/2 Running 0 85m $ oc describe po cluster-autoscaler-default-56659849fc-b49rk Requests: cpu: 20Mi memory: 10m maybe should: cpu: 10m memory: 20Mi $ oc describe po cluster-autoscaler-default-56659849fc-b49rk Events: Type Reason Age From Message ---- ------ ---- ---- ------- Warning FailedScheduling 6m12s default-scheduler 0/6 nodes are available: 3 Insufficient cpu, 3 node(s) didn't match Pod's node affinity/selector. Warning FailedScheduling 6m11s default-scheduler 0/6 nodes are available: 3 Insufficient cpu, 3 node(s) didn't match Pod's node affinity/selector. Moving back to POST so I can attach PR removing the test-suite exception too. Verified clusterversion: 4.8.0-0.nightly-2021-04-20-195442 # oc get po NAME READY STATUS RESTARTS AGE cluster-autoscaler-default-56fc5bc88c-zqqtf 1/1 Running 0 2m41s resources: requests: cpu: 10m memory: 20Mi Since the problem described in this bug report should be resolved in a recent advisory, it has been closed with a resolution of ERRATA. For information on the advisory (Moderate: OpenShift Container Platform 4.8.2 bug fix and security update), and where to find the updated files, follow the link below. If the solution does not work for you, open a new bug report. https://access.redhat.com/errata/RHSA-2021:2438 |