Bug 1942506 - [4.6] dual stack nodes with OVN single ipv6 fails on bootstrap phase
Summary: [4.6] dual stack nodes with OVN single ipv6 fails on bootstrap phase
Keywords:
Status: CLOSED ERRATA
Alias: None
Product: OpenShift Container Platform
Classification: Red Hat
Component: Networking
Version: 4.6
Hardware: x86_64
OS: Linux
high
high
Target Milestone: ---
: 4.6.z
Assignee: Dan Winship
QA Contact: Daniel Del Ciancio
URL:
Whiteboard:
Depends On: 1942488
Blocks:
TreeView+ depends on / blocked
 
Reported: 2021-03-24 13:31 UTC by Dan Winship
Modified: 2021-08-12 01:36 UTC (History)
11 users (show)

Fixed In Version:
Doc Type: Bug Fix
Doc Text:
Cause: When bringing up a single-stack IPv6 cluster on nodes with IPv4 addresses, kubelet might use the IPv4 IP as the node IP rather than the IPv6 IP Consequence: host-network pods have IPv4 IPs rather than IPv6, making them unreachable from IPv6-only pods Fix: the node-IP-picking code was fixed to handle this case Result: nodes will have IPv6 IPs, not IPv4
Clone Of: 1942488
Environment:
Last Closed: 2021-08-12 01:36:41 UTC
Target Upstream Version:
Embargoed:


Attachments (Terms of Use)
openshift-install-log (69.95 KB, text/plain)
2021-06-28 17:17 UTC, raj.sarvaiya@bell.ca
no flags Details
must-gather part 1 (12.00 MB, application/gzip)
2021-06-28 17:32 UTC, raj.sarvaiya@bell.ca
no flags Details
must-gather part 2 (9.86 MB, application/octet-stream)
2021-06-28 17:33 UTC, raj.sarvaiya@bell.ca
no flags Details


Links
System ID Private Priority Status Summary Last Updated
Github openshift baremetal-runtimecfg pull 133 0 None open Bug 1942506: 4.6 EUS node-ip handling extravaganza 2021-05-18 20:08:40 UTC
Github openshift machine-config-operator pull 2589 0 None open Bug 1942506: [4.6] Use new --prefer-ipv6 flag to "runtimecfg node-ip" as appropriate 2021-05-24 15:42:50 UTC
Red Hat Product Errata RHBA-2021:3008 0 None None None 2021-08-12 01:36:51 UTC

Comment 2 Dan Winship 2021-05-21 20:03:29 UTC
needs an MCO fix too

Comment 8 Daniel Del Ciancio 2021-06-15 19:44:14 UTC
@raj.sarvaiya I know you have been trying to install a cluster on a newer release (4.7+), but can you verify this bug on 4.6 ?

Comment 11 Daniel Del Ciancio 2021-06-18 18:45:05 UTC
Customer responded that they will test this early next week since there’s no free dev envs at the moment.
Once they have results from their testing, they will reply to the BZ directly.

Comment 16 raj.sarvaiya@bell.ca 2021-06-28 17:15:51 UTC
Installation completed successfully. Attaching .openshift_install.log and generating must-gather

Comment 17 raj.sarvaiya@bell.ca 2021-06-28 17:17:37 UTC
Created attachment 1795508 [details]
openshift-install-log

openshift-install.log

Comment 18 raj.sarvaiya@bell.ca 2021-06-28 17:26:29 UTC
By the way multus issue that was seen previously is still there

> oc describe po envoyv4v6-67f58fc786-sfn9j | grep -i macvlan
> ...
>   Warning  FailedCreatePodSandBox  32s (x108 over 28m)  kubelet            (combined from similar events): Failed to create pod sandbox: rpc error: code = Unknown desc = failed to create pod network sandbox k8s_envoyv4v6-67f58fc786-sfn9j_bell-services_16d7fd8c-d88d-43c7-ae82-5cda8e1cc940_0(fd82738b09362927f2fbbb5dc4e25c3b38bf9fd5a6a7d651c6ca57641aa52e6e): [bell-services/envoyv4v6-67f58fc786-sfn9j:envoyv4]: error adding container to network "envoyv4": failed to create macvlan: device or resource busy

Comment 19 raj.sarvaiya@bell.ca 2021-06-28 17:32:53 UTC
Created attachment 1795509 [details]
must-gather part 1

must-gather part 1 (aa)

Comment 20 raj.sarvaiya@bell.ca 2021-06-28 17:33:29 UTC
Created attachment 1795510 [details]
must-gather part 2

must-gather part 2 (ab)

Comment 26 Daniel Del Ciancio 2021-07-22 17:33:38 UTC
Hi @raj.sarvaiya  - Can you verify that the cluster still installs fine minus the multus issue ?  Our QE team needs to verify the original install bug.  Let us know if this can be considered "verified" so we can issue the errata.

Comment 27 Daniel Del Ciancio 2021-07-22 17:33:38 UTC
Hi @raj.sarvaiya  - Can you verify that the cluster still installs fine minus the multus issue ?  Our QE team needs to verify the original install bug.  Let us know if this can be considered "verified" so we can issue the errata.

Comment 28 raj.sarvaiya@bell.ca 2021-07-22 17:37:00 UTC
Excluding the multus issue, the cluster does install fine

Comment 36 errata-xmlrpc 2021-08-12 01:36:41 UTC
Since the problem described in this bug report should be
resolved in a recent advisory, it has been closed with a
resolution of ERRATA.

For information on the advisory (OpenShift Container Platform 4.6.42 bug fix update), and where to find the updated
files, follow the link below.

If the solution does not work for you, open a new bug report.

https://access.redhat.com/errata/RHBA-2021:3008


Note You need to log in before you can comment on or make changes to this bug.