Bug 1414907

Summary: POD Readiness probe fails due to POD not answering TCP SYN packages
Product: OpenShift Container Platform Reporter: Miheer Salunke <misalunk>
Component: NetworkingAssignee: Dan Winship <danw>
Status: CLOSED INSUFFICIENT_DATA QA Contact: Meng Bo <bmeng>
Severity: medium Docs Contact:
Priority: high    
Version: 3.2.1CC: aos-bugs, atragler, bbennett, danw, dcbw, eparis, erich, misalunk, mleitner, pdwyer, sukulkar, zzhao
Target Milestone: ---   
Target Release: ---   
Hardware: Unspecified   
OS: Unspecified   
Whiteboard:
Fixed In Version: Doc Type: If docs needed, set a value
Doc Text:
Story Points: ---
Clone Of: Environment:
Last Closed: 2017-06-02 14:55:41 UTC Type: Bug
Regression: --- Mount Type: ---
Documentation: --- CRM:
Verified Versions: Category: ---
oVirt Team: --- RHEL 7.3 requirements from Atomic Host:
Cloudforms Team: --- Target Upstream Version:
Embargoed:

Comment 7 Dan Winship 2017-01-31 19:50:22 UTC
The dump-flows output show that there are pods with duplicate IP addresses. And so the health check fails for this pod because the packets are getting sent to the *other* pod that has that IP address.

Probably this was caused by some sort of bug in the 3.2->3.3 upgrade; the journal output attached above doesn't go back far enough to show how we originally got into this situation.

I think the simplest fix is to make the node unschedulable, delete all of the pods on it, reboot the node, and then make it schedulable again. Assuming the problem actually was upgrade-related, it shouldn't recur.

Comment 15 Ben Bennett 2017-04-19 14:02:00 UTC
Closing due to inactivity.  Please re-open if it is still happening, and provide the requested logs.

Comment 22 Red Hat Bugzilla 2023-09-14 03:52:27 UTC
The needinfo request[s] on this closed bug have been removed as they have been unresolved for 1000 days