Bug 1689690
Summary: | starter-us-east-2 & 2a experienced outage because kube ip not present in master iptables: tcp 172.30.0.1:443: getsockopt: no route to host | ||
---|---|---|---|
Product: | OpenShift Container Platform | Reporter: | Justin Pierce <jupierce> |
Component: | Networking | Assignee: | Dan Williams <dcbw> |
Networking sub component: | openshift-sdn | QA Contact: | zhaozhanqi <zzhao> |
Status: | CLOSED WONTFIX | Docs Contact: | |
Severity: | high | ||
Priority: | high | CC: | aos-bugs, bbennett, danw, dcbw, dedgar, dmace, emahoney, eparis, jeder, jgoulding, mbenitez, pbergene, scuppett |
Version: | 3.11.0 | Keywords: | OpsBlocker |
Target Milestone: | --- | ||
Target Release: | 3.11.z | ||
Hardware: | Unspecified | ||
OS: | Unspecified | ||
Whiteboard: | |||
Fixed In Version: | Doc Type: | If docs needed, set a value | |
Doc Text: | Story Points: | --- | |
Clone Of: | Environment: | ||
Last Closed: | 2019-11-20 14:42:47 UTC | Type: | Bug |
Regression: | --- | Mount Type: | --- |
Documentation: | --- | CRM: | |
Verified Versions: | Category: | --- | |
oVirt Team: | --- | RHEL 7.3 requirements from Atomic Host: | |
Cloudforms Team: | --- | Target Upstream Version: | |
Embargoed: |
Description
Justin Pierce
2019-03-17 18:57:02 UTC
- In at least one situation, the listed procedure was not sufficient. I had to reboot the master before the 172.30.0.1 iptables entry was established. - In clearing cluster 2a, I also had to delete the router pods to get the web console loading again. Just for some background, it would be very unusual for issues with the PSAD iptables rule to be popping up now. It's been in place across all starter and OSIO clusters for upwards of 6 months across multiple reboots with no reported issues. My guess is that something else changed within iptables recently, which either: 1. Clashes with the existing PSAD rule 2. Changes something that the PSAD rule expected to stay the same, suddenly making it invalid PSAD configuration management has been disabled and, and its projects have been removed from OSIO and starter clusters to aid the troubleshooting efforts. I'll try to reproduce the issues in int or stg in the meantime. *** Bug 1651784 has been marked as a duplicate of this bug. *** *** Bug 1665763 has been marked as a duplicate of this bug. *** *** Bug 1668414 has been marked as a duplicate of this bug. *** |