Bug 1805487
| Summary: | oc command not work any more after rebooting all nodes | ||
|---|---|---|---|
| Product: | OpenShift Container Platform | Reporter: | Weibin Liang <weliang> |
| Component: | Etcd | Assignee: | Suresh Kolichala <skolicha> |
| Status: | CLOSED ERRATA | QA Contact: | ge liu <geliu> |
| Severity: | urgent | Docs Contact: | |
| Priority: | high | ||
| Version: | 4.4 | CC: | aconstan, jdesousa, mfojtik |
| Target Milestone: | --- | Flags: | weliang:
needinfo-
weliang: needinfo- |
| Target Release: | 4.4.0 | ||
| Hardware: | Unspecified | ||
| OS: | Unspecified | ||
| Whiteboard: | |||
| Fixed In Version: | Doc Type: | If docs needed, set a value | |
| Doc Text: | Story Points: | --- | |
| Clone Of: | Environment: | ||
| Last Closed: | 2020-05-04 11:38:36 UTC | Type: | Bug |
| Regression: | --- | Mount Type: | --- |
| Documentation: | --- | CRM: | |
| Verified Versions: | Category: | --- | |
| oVirt Team: | --- | RHEL 7.3 requirements from Atomic Host: | |
| Cloudforms Team: | --- | Target Upstream Version: | |
| Embargoed: | |||
|
Description
Weibin Liang
2020-02-20 21:25:21 UTC
Happens regardless of the sdn plugin, and the issue looks like etcd:
# crictl ps
CONTAINER IMAGE CREATED STATE NAME ATTEMPT POD ID
e9efb5c2ed817 5bcff854afb83e019bbe7a4ccf66ddc9e7f3a56cfd5ca98dad24f807a8d9cc5d 1 second ago Running etcd 16 54341e8fdae1a
ae31dc1f52f60 dee5b59b53245bbb743f4b61ff4e4cf662e919ae15a7705ff4c54ff0d60c5282 15 minutes ago Running kube-apiserver-insecure-readyz 1 a81cdd7fc52ec
bd21ec14cc8e5 b28324f4fa8d6103e8dd542ae7c61f2930be68140244f902ec15c9151463f9a7 15 minutes ago Running kube-controller-manager-cert-syncer 1 861445b239a11
d12bc3a66a32e dee5b59b53245bbb743f4b61ff4e4cf662e919ae15a7705ff4c54ff0d60c5282 15 minutes ago Running kube-apiserver-cert-syncer 1 a81cdd7fc52ec
938d6d7e16508 09d121f059abf8e5e217b666fedf0aa1607966bc5878be08e87d5178202f4c71 15 minutes ago Running cluster-policy-controller 1 861445b239a11
2789eed0edaef c7ee309a23bf38345d94aa2cee0c1bb8ed91184309e26db18affdc2cf74ffcdb 15 minutes ago Running scheduler 1 e3e8a82d232f4
5705b73f70333 c7ee309a23bf38345d94aa2cee0c1bb8ed91184309e26db18affdc2cf74ffcdb 15 minutes ago Running kube-controller-manager 1 861445b239a11
18c17ea0b8141 5bcff854afb83e019bbe7a4ccf66ddc9e7f3a56cfd5ca98dad24f807a8d9cc5d 15 minutes ago Running etcd-metrics 1 54341e8fdae1a
# crictl logs -f e9efb5c2ed817
{"level":"warn","ts":"2020-02-24T17:05:49.183Z","caller":"clientv3/retry_interceptor.go:61","msg":"retrying of unary invoker failed","target":"endpoint://client-c4107ef1-452f-4ae6-8032-6953f06b1696/10.0.61.129:2379","attempt":0,"error":"rpc error: code = DeadlineExceeded desc = latest connection error: connection error: desc = \"transport: Error while dialing dial tcp 10.0.64.254:2379: connect: connection refused\""}
Error: context deadline exceeded
Tested and verified in 4.4.0-0.nightly-2020-03-11-095741 Since the problem described in this bug report should be resolved in a recent advisory, it has been closed with a resolution of ERRATA. For information on the advisory, and where to find the updated files, follow the link below. If the solution does not work for you, open a new bug report. https://access.redhat.com/errata/RHBA-2020:0581 |