Bug 1851549
| Summary: | oc commands fail intermittently with TLS Handshake timeout Errors on Azure IPI installed cluster | ||
|---|---|---|---|
| Product: | OpenShift Container Platform | Reporter: | Walid A. <wabouham> |
| Component: | Networking | Assignee: | mcambria <mcambria> |
| Networking sub component: | openshift-sdn | QA Contact: | Arti Sood <asood> |
| Status: | CLOSED ERRATA | Docs Contact: | |
| Severity: | high | ||
| Priority: | high | CC: | aconstan, anusaxen, aos-bugs, bbennett, kuiwang, mcambria, mfojtik, mifiedle, oarribas, palonsor, rsandu, talessio, vuberti, xxia, zzhao |
| Version: | 4.5 | Keywords: | FastFix |
| Target Milestone: | --- | ||
| Target Release: | 4.6.z | ||
| Hardware: | x86_64 | ||
| OS: | Linux | ||
| Whiteboard: | |||
| Fixed In Version: | Doc Type: | No Doc Update | |
| Doc Text: | Story Points: | --- | |
| Clone Of: | Environment: | ||
| Last Closed: | 2021-07-14 07:16:34 UTC | Type: | Bug |
| Regression: | --- | Mount Type: | --- |
| Documentation: | --- | CRM: | |
| Verified Versions: | Category: | --- | |
| oVirt Team: | --- | RHEL 7.3 requirements from Atomic Host: | |
| Cloudforms Team: | --- | Target Upstream Version: | |
| Embargoed: | |||
| Bug Depends On: | 1967994 | ||
| Bug Blocks: | 1836052 | ||
|
Description
Walid A.
2020-06-26 22:20:40 UTC
Checked the must-gather in comment 1, found: $ grep "https://10..*:8443" namespaces/openshift-apiserver-operator/pods/openshift-apiserver-operator-688cdf6c5-qmnfc/openshift-apiserver-operator/openshift-apiserver-operator/logs/current.log ... 2020-06-26T13:09:24.644499509Z I0626 13:09:24.644452 1 event.go:278] Event(v1.ObjectReference{Kind:"Deployment", Namespace:"openshift-apiserver-operator", Name:"openshift-apiserver-operator", UID:"fce402b7-ae29-476d-a3cd-b201378f5181", APIVersion:"apps/v1", ResourceVersion:"", FieldPath:""}): type: 'Normal' reason: 'OperatorStatusChanged' Status for clusteroperator/openshift-apiserver changed: Available changed from True to False ("APIServicesAvailable: apiservices.apiregistration.k8s.io/v1.apps.openshift.io: not available: failing or missing response from https://10.129.0.64:8443/apis/apps.openshift.io/v1: Get https://10.129.0.64:8443/apis/apps.openshift.io/v1: net/http: request canceled while waiting for connection (Client.Timeout exceeded while awaiting headers)") ... ^ all grep results are from the pod with IP 10.129.0.64, which is on one master $ grep "podIP.*10.129.0.64" namespaces/openshift-apiserver/pods/apiserver-65b875f65c-*/*.yaml openshift-apiserver/pods/apiserver-65b875f65c-87cv4/apiserver-65b875f65c-87cv4.yaml: podIP: 10.129.0.64 $ vi namespaces/openshift-apiserver/pods/apiserver-65b875f65c-87cv4/openshift-apiserver/openshift-apiserver/logs/current.log ... 2020-06-26T16:53:29.52567754Z I0626 16:53:29.525656 1 log.go:172] http: TLS handshake error from 10.128.0.1:52796: EOF 2020-06-26T16:53:30.611304762Z I0626 16:53:30.611244 1 log.go:172] http: TLS handshake error from 10.128.0.1:52804: EOF ^ this "TLS handshake error from 10.128.0.1:52804: EOF" is summarized in bug 1825219#c19 . Since the problem described in this bug report should be resolved in a recent advisory, it has been closed with a resolution of ERRATA. For information on the advisory (OpenShift Container Platform 4.6.38 bug fix update), and where to find the updated files, follow the link below. If the solution does not work for you, open a new bug report. https://access.redhat.com/errata/RHBA-2021:2641 The needinfo request[s] on this closed bug have been removed as they have been unresolved for 500 days |