Note: This bug is displayed in read-only format because the product is no longer active in Red Hat Bugzilla.

Bug 2153801

Summary: [OSP 16.2] [OVN-DVR] [Overcloud BM] Setup suffers some MTU mismatch in 16.2 but not in 16.1
Product: Red Hat OpenStack Reporter: Itay Matza <imatza>
Component: python-networking-ovnAssignee: Luis Tomas Bolivar <ltomasbo>
Status: CLOSED DUPLICATE QA Contact: Eran Kuris <ekuris>
Severity: high Docs Contact:
Priority: high    
Version: 16.2 (Train)CC: apevec, chrisw, jlibosva, lhh, lmartins, ltomasbo, majopela, mdulko, mlavalle, ralonsoh, scohen, ykarel
Target Milestone: z5Keywords: Triaged
Target Release: 16.2 (Train on RHEL 8.4)   
Hardware: Unspecified   
OS: Unspecified   
Whiteboard:
Fixed In Version: Doc Type: If docs needed, set a value
Doc Text:
Story Points: ---
Clone Of:
: 2174346 (view as bug list) Environment:
Last Closed: 2023-03-14 18:01:29 UTC Type: Bug
Regression: --- Mount Type: ---
Documentation: --- CRM:
Verified Versions: Category: ---
oVirt Team: --- RHEL 7.3 requirements from Atomic Host:
Cloudforms Team: --- Target Upstream Version:
Embargoed:
Bug Depends On: 2174346    
Bug Blocks:    

Description Itay Matza 2022-12-15 14:23:16 UTC
Description of problem:
Since https://bugzilla.redhat.com/show_bug.cgi?id=2083120 has been fixed, we recently added D/S CI periodic jobs installing OCP with Baremetal Workers on top of OSP 16.2.

In the CI run, we observed that our setup suffers some MTU mismatch in 16.2 but not in 16.1 -
OpenShift requests an OpenStack API request to create a baremetal worker instance, which works fine in OSP16.1.6, but seems to be too long in RHOS-16.2-RHEL-8-20221124.n.1. See the log:
```
W1201 09:46:59.887172       1 controller.go:374] ostest-hsxw9-worker-0-rvh92: failed to create machine: error creating Openstack instance: error creating Openstack instance: Post "https://10.46.44.75:13774/v2.1/servers": EOF

E1201 09:46:59.887307       1 controller.go:326]  "msg"="Reconciler error" "error"="error creating Openstack instance: error creating Openstack instance: Post \"https://10.46.44.75:13774/v2.1/servers\": EOF" "controller"="machine-controller" "name"="ostest-hsxw9-worker-0-rvh92" "namespace"="openshift-machine-api" "object"={"name":"ostest-hsxw9-worker-0-rvh92","namespace":"openshift-machine-api"} "reconcileID"="f5db6738-b949-4909-b198-990ba309f3fe" 
```

Reducing the MTU on the VM interface from 1500 to 1400 was helped, and requests started to work fine again.
We guess that there are differences in terms of tunneling are there between 16.1 and 16.2.
 

Version-Release number of selected component (if applicable):
OpenShift with Baremetal Workers on top of OSP 16.2 (RHOS-16.2-RHEL-8-20221124.n.1).

How reproducible:
Always

Steps to Reproduce:
Run the openshift installer with OpenShiftSDN network type and Baremetal Workers on top of OSP 16.2.

Actual results:
OpenStack API Post gets EOF, and the baremetal worker instance is not created.


Expected results:
OpenStack API request gets correctly, and the baremetal worker instance is created.

Comment 25 Luis Tomas Bolivar 2023-03-14 18:01:29 UTC
There is a better solution (distributed instead of centralized) being handled as part of https://bugzilla.redhat.com/show_bug.cgi?id=2177155, so closing this one in favor of the other

*** This bug has been marked as a duplicate of bug 2177155 ***