Bug 1414907

Summary:	POD Readiness probe fails due to POD not answering TCP SYN packages
Product:	OpenShift Container Platform	Reporter:	Miheer Salunke <misalunk>
Component:	Networking	Assignee:	Dan Winship <danw>
Status:	CLOSED INSUFFICIENT_DATA	QA Contact:	Meng Bo <bmeng>
Severity:	medium	Docs Contact:
Priority:	high
Version:	3.2.1	CC:	aos-bugs, atragler, bbennett, danw, dcbw, eparis, erich, misalunk, mleitner, pdwyer, sukulkar, zzhao
Target Milestone:	---
Target Release:	---
Hardware:	Unspecified
OS:	Unspecified
Whiteboard:
Fixed In Version:		Doc Type:	If docs needed, set a value
Doc Text:		Story Points:	---
Clone Of:		Environment:
Last Closed:	2017-06-02 14:55:41 UTC	Type:	Bug
Regression:	---	Mount Type:	---
Documentation:	---	CRM:
Verified Versions:		Category:	---
oVirt Team:	---	RHEL 7.3 requirements from Atomic Host:
Cloudforms Team:	---	Target Upstream Version:
Embargoed:

Comment 7 Dan Winship 2017-01-31 19:50:22 UTC

The dump-flows output show that there are pods with duplicate IP addresses. And so the health check fails for this pod because the packets are getting sent to the *other* pod that has that IP address.

Probably this was caused by some sort of bug in the 3.2->3.3 upgrade; the journal output attached above doesn't go back far enough to show how we originally got into this situation.

I think the simplest fix is to make the node unschedulable, delete all of the pods on it, reboot the node, and then make it schedulable again. Assuming the problem actually was upgrade-related, it shouldn't recur.

Comment 15 Ben Bennett 2017-04-19 14:02:00 UTC

Closing due to inactivity.  Please re-open if it is still happening, and provide the requested logs.

Comment 22 Red Hat Bugzilla 2023-09-14 03:52:27 UTC

The needinfo request[s] on this closed bug have been removed as they have been unresolved for 1000 days