Bug 2062558 - Egress IP with openshift sdn in not functional on worker node.
Summary: Egress IP with openshift sdn in not functional on worker node.
Keywords:
Status: CLOSED ERRATA
Alias: None
Product: OpenShift Container Platform
Classification: Red Hat
Component: Networking
Version: 4.8
Hardware: x86_64
OS: Linux
high
high
Target Milestone: ---
: 4.11.0
Assignee: Surya Seetharaman
QA Contact: huirwang
URL:
Whiteboard:
Depends On:
Blocks:
TreeView+ depends on / blocked
 
Reported: 2022-03-10 06:25 UTC by Ashish Sharma
Modified: 2022-11-22 13:38 UTC (History)
5 users (show)

Fixed In Version:
Doc Type: No Doc Update
Doc Text:
Clone Of:
Environment:
Last Closed: 2022-08-10 10:53:13 UTC
Target Upstream Version:
Embargoed:


Attachments (Terms of Use)


Links
System ID Private Priority Status Summary Last Updated
Github openshift sdn pull 418 0 None open Bug 2062558: egressip: Continue to process other nodes if a node is not ready 2022-03-29 11:03:50 UTC
Red Hat Product Errata RHSA-2022:5069 0 None None None 2022-08-10 10:53:49 UTC

Description Ashish Sharma 2022-03-10 06:25:31 UTC
Description of problem:
Egress IP with openshift sdn in not functional on worker node, We are able to map the egress IP on the impacted node but after some time, We can check the egress ip details in "oc get hostsubnet" output, but the IP is not present with node primary interface.

We get following errors in the node sdn pod logs.
Node 10.78.46.24 is offline
2022-03-08T10:20:25.055872094Z W0308 10:20:25.055771    4118 egressip.go:242] Node 10.78.46.24 is offline

Also when this error messages are appeared in the sdn logs, egress ip is not moved to another node.

We are getting below error with insight rules.

"Node %s may be offline... retrying" appears in the sdn-controller log more than 5 times a minute for all nodes combined:




Version-Release number of selected component (if applicable):
ocp 4.8.28


How reproducible:
Its reproducible in CU environment on the impacted node.


Steps to Reproduce:
1.
2.
3.

Actual results:

Egress IP in not functional.


Expected results:
It should work as expected with automatic CIDR.


Additional info:

Must gather and sosreport is captured at the time of issue is available in support shell with following names.

drwxrwxrwx. 3 yank     yank           76 Mar  9 07:57 0280-sosreport-jprocpuatapp01-0296610022-2022-03-09-gfcftrd.tar.xz
drwxrwxrwx. 3 yank     yank           59 Mar  9 08:00 0290-must-gather.local.1936115711644553329.tar.xz

Comment 26 errata-xmlrpc 2022-08-10 10:53:13 UTC
Since the problem described in this bug report should be
resolved in a recent advisory, it has been closed with a
resolution of ERRATA.

For information on the advisory (Important: OpenShift Container Platform 4.11.0 bug fix and security update), and where to find the updated
files, follow the link below.

If the solution does not work for you, open a new bug report.

https://access.redhat.com/errata/RHSA-2022:5069


Note You need to log in before you can comment on or make changes to this bug.