Bug 1776402
| Summary: | Test Failure: release-openshift-origin-installer-e2e-aws-ovn-kubernetes-4.3 #49 : Run template e2e-aws - e2e-aws-ovn-kubernetes container setup | ||
|---|---|---|---|
| Product: | OpenShift Container Platform | Reporter: | Lokesh Mandvekar <lsm5> |
| Component: | Etcd | Assignee: | Sam Batschelet <sbatsche> |
| Status: | CLOSED INSUFFICIENT_DATA | QA Contact: | ge liu <geliu> |
| Severity: | unspecified | Docs Contact: | |
| Priority: | unspecified | ||
| Version: | 4.3.0 | CC: | aos-bugs, gblomqui, jokerman, lszaszki, mfojtik, nagrawal |
| Target Milestone: | --- | ||
| Target Release: | 4.4.0 | ||
| Hardware: | Unspecified | ||
| OS: | Unspecified | ||
| Whiteboard: | |||
| Fixed In Version: | Doc Type: | If docs needed, set a value | |
| Doc Text: | Story Points: | --- | |
| Clone Of: | Environment: | ||
| Last Closed: | 2020-02-25 10:30:54 UTC | Type: | Bug |
| Regression: | --- | Mount Type: | --- |
| Documentation: | --- | CRM: | |
| Verified Versions: | Category: | --- | |
| oVirt Team: | --- | RHEL 7.3 requirements from Atomic Host: | |
| Cloudforms Team: | --- | Target Upstream Version: | |
| Embargoed: | |||
| Bug Depends On: | 1775878 | ||
| Bug Blocks: | |||
|
Description
Lokesh Mandvekar
2019-11-25 15:54:13 UTC
> level=error msg="Cluster operator kube-apiserver Degraded is True with NodeInstallerDegradedInstallerPodFailed: NodeInstallerDegraded: 1 nodes are failing on revision 3:\nNodeInstallerDegraded: "
> level=info msg="Cluster operator kube-apiserver Progressing is True with Progressing: Progressing: 1 nodes are at revision 0; 1 nodes are at revision 2; 1 nodes are at revision 3; 0 nodes have achieved new revision 5"
> level=error msg="Cluster operator kube-controller-manager Degraded is True with NodeInstallerDegradedInstallerPodFailed: NodeInstallerDegraded: 1 nodes are failing on revision 5:\nNodeInstallerDegraded: "
> level=info msg="Cluster operator kube-controller-manager Progressing is True with Progressing: Progressing: 2 nodes are at revision 4; 1 nodes are at revision 6"
> level=info msg="Cluster operator kube-scheduler Progressing is True with Progressing: Progressing: 2 nodes are at revision 4; 1 nodes are at revision 5"
seems like problem with one of the nodes.
https://storage.googleapis.com/origin-ci-test/logs/release-openshift-origin-installer-e2e-aws-ovn-kubernetes-4.3/49/artifacts/e2e-aws/pods/openshift-etcd_etcd-member-ip-10-0-128-138.ec2.internal_etcd-member.log etcd started throwing errors of requests taking too long at around ~11:48:00. Likely the install didn't converge because of this reason. Moving this ticket to the etcd team. Adding dependency on bug #1775878 The timeout issues noted in the logs appear to overlap. Not positive, but wanted to dry the link. I wanted to see the logs from the other components just to see if they were reporting network errors but the link seems to be broken - https://gcsweb-ci.svc.ci.openshift.org/gcs/origin-ci-test/logs/release-openshift-origin-installer-e2e-aws-ovn-kubernetes-4.3/49/ |