Description of problem:
Since https://bugzilla.redhat.com/show_bug.cgi?id=2083120 has been fixed, we recently added D/S CI periodic jobs installing OCP with Baremetal Workers on top of OSP 16.2.
In the CI run, we observed that our setup suffers some MTU mismatch in 16.2 but not in 16.1 -
OpenShift requests an OpenStack API request to create a baremetal worker instance, which works fine in OSP16.1.6, but seems to be too long in RHOS-16.2-RHEL-8-20221124.n.1. See the log:
```
W1201 09:46:59.887172 1 controller.go:374] ostest-hsxw9-worker-0-rvh92: failed to create machine: error creating Openstack instance: error creating Openstack instance: Post "https://10.46.44.75:13774/v2.1/servers": EOF
E1201 09:46:59.887307 1 controller.go:326] "msg"="Reconciler error" "error"="error creating Openstack instance: error creating Openstack instance: Post \"https://10.46.44.75:13774/v2.1/servers\": EOF" "controller"="machine-controller" "name"="ostest-hsxw9-worker-0-rvh92" "namespace"="openshift-machine-api" "object"={"name":"ostest-hsxw9-worker-0-rvh92","namespace":"openshift-machine-api"} "reconcileID"="f5db6738-b949-4909-b198-990ba309f3fe"
```
Reducing the MTU on the VM interface from 1500 to 1400 was helped, and requests started to work fine again.
We guess that there are differences in terms of tunneling are there between 16.1 and 16.2.
Version-Release number of selected component (if applicable):
OpenShift with Baremetal Workers on top of OSP 16.2 (RHOS-16.2-RHEL-8-20221124.n.1).
How reproducible:
Always
Steps to Reproduce:
Run the openshift installer with OpenShiftSDN network type and Baremetal Workers on top of OSP 16.2.
Actual results:
OpenStack API Post gets EOF, and the baremetal worker instance is not created.
Expected results:
OpenStack API request gets correctly, and the baremetal worker instance is created.
Comment 25Luis Tomas Bolivar
2023-03-14 18:01:29 UTC