Bug 1747871
| Summary: | [ci] openshift-kube-scheduler operator fails | ||
|---|---|---|---|
| Product: | OpenShift Container Platform | Reporter: | Yadan Pei <yapei> |
| Component: | Networking | Assignee: | Casey Callendrello <cdc> |
| Networking sub component: | openshift-sdn | QA Contact: | zhaozhanqi <zzhao> |
| Status: | CLOSED DUPLICATE | Docs Contact: | |
| Severity: | low | ||
| Priority: | low | CC: | agarcial, aos-bugs, calfonso, hongkliu, kgarriso, mfojtik, sttts, yapei |
| Version: | 4.2.0 | Keywords: | Reopened |
| Target Milestone: | --- | ||
| Target Release: | 4.3.0 | ||
| Hardware: | Unspecified | ||
| OS: | Unspecified | ||
| Whiteboard: | buildcop | ||
| Fixed In Version: | Doc Type: | If docs needed, set a value | |
| Doc Text: | Story Points: | --- | |
| Clone Of: | Environment: | ||
| Last Closed: | 2019-12-03 10:50:02 UTC | Type: | Bug |
| Regression: | --- | Mount Type: | --- |
| Documentation: | --- | CRM: | |
| Verified Versions: | Category: | --- | |
| oVirt Team: | --- | RHEL 7.3 requirements from Atomic Host: | |
| Cloudforms Team: | --- | Target Upstream Version: | |
| Embargoed: | |||
|
Description
Yadan Pei
2019-09-02 06:55:24 UTC
I went through the logs and I don't see any problem with the scheduler, the operator is working as expected and scheduler is working properly. If there's a problem it looks like a problem with either the nodes being available which might be in turn a problem with openstack infrastructure. I'm closing this, if you thing the problem still exists please direct the bug at a specific component that is failing and not at a component that you happen to find a log matching it. The root cause is MCO has not finished the upgrade, so kube-apiserver is not ready (degraded) which in turn casues kube-scheduler to fail as well. I'll pass this over to the MCO team for an investigation. The SDN container seems to be crash looping. I'm moving this over to the networking team, although sine this bug is somewhat old it would be good to see if this is still an issue. I see the issue; it seems to be slow SDN startup time in concert with a poorly written liveness check on one of the nodes. Maybe that node is just slow or had other connectivity issues. We fixed that in 1761609. I see that CI has been reasonably green (though the release jobs are a trainwreck.. not this problem), so I think this is fixed. *** This bug has been marked as a duplicate of bug 1761609 *** |