Created attachment 1655844 [details] install log bundle file Description of problem: Trying to bring up ovnkube cluster on geneve port 9081 but its failing in early stages. Also 9000-9999 are allowed in inbound rules on AWS. However cluster configures fine on default udp geneve port 6081. Version-Release number of selected component (if applicable): 4.4.0-0.nightly-2020-01-24-141203 How reproducible: Always Steps to Reproduce: 1. Create "cluster-network-03-config.yml" manifest and bring up cluster apiVersion: operator.openshift.io/v1 kind: Network metadata: name: cluster spec: clusterNetwork: - cidr: 10.128.0.0/14 hostPrefix: 23 serviceNetwork: - 172.30.0.0/16 defaultNetwork: type: OVNKubernetes ovnKubernetesConfig: mtu: 1600 genevePort: 9081 <<<<<<<<<<<<<<<<< 2. ./openshift-install create cluster Actual results: Cluster fails to come up when geneveport is set to 9081 Expected results: Cluster should come up fine when geneve port is set to 9081 Additional info:logs bundle file is attached with this bug $ oc get nodes NAME STATUS ROLES AGE VERSION ip-10-0-132-75.ap-south-1.compute.internal NotReady master 89m v1.17.1 ip-10-0-157-36.ap-south-1.compute.internal NotReady master 89m v1.17.1 ip-10-0-173-125.ap-south-1.compute.internal NotReady master 89m v1.17.1 $ ip a 1: lo: <LOOPBACK,UP,LOWER_UP> mtu 65536 qdisc noqueue state UNKNOWN group default qlen 1000 link/loopback 00:00:00:00:00:00 brd 00:00:00:00:00:00 inet 127.0.0.1/8 scope host lo valid_lft forever preferred_lft forever inet6 ::1/128 scope host valid_lft forever preferred_lft forever 2: ens3: <BROADCAST,MULTICAST,UP,LOWER_UP> mtu 9001 qdisc mq state UP group default qlen 1000 link/ether 02:2e:5f:bd:e9:62 brd ff:ff:ff:ff:ff:ff inet 10.0.10.43/20 brd 10.0.15.255 scope global dynamic noprefixroute ens3 valid_lft 2673sec preferred_lft 2673sec inet6 fe80::5248:4e16:a88f:7a6b/64 scope link noprefixroute valid_lft forever preferred_lft forever
@anurag Did you also do the "create install-config" and change type to OVN? It also didn't pick up the mtu change.
@Phil Yes, and I exactly followed these steps https://bugzilla.redhat.com/show_bug.cgi?id=1793720#c5 except I added "genevePort: 9081" as well along with mtu as mentioned in comment1
@anurag See: https://bugzilla.redhat.com/show_bug.cgi?id=1793720 comment #5 for testing this. Also, # oc get cm ovnkube-config -n openshift-ovn-kubernetes -oyaml | grep "^data:" -A17 data: should show the "encap-port=" to have your selected port. "mtu=" will show the mtu. Both 1793720 and 1795454 can use the same test. This also shows that SDN-456 is working.
This appears to me to be a real bug. The geneve tunnel is being configured wrong by ovnkube. Testing a fix.
Have a fix that works for me on an upstream setup here: https://github.com/ovn-org/ovn-kubernetes/pull/1030
Since the problem described in this bug report should be resolved in a recent advisory, it has been closed with a resolution of ERRATA. For information on the advisory, and where to find the updated files, follow the link below. If the solution does not work for you, open a new bug report. https://access.redhat.com/errata/RHBA-2020:0581