Bug 2172097

Summary: ovn-controller core dumped
Product: Red Hat Enterprise Linux Fast Datapath Reporter: David Hill <dhill>
Component: ovn2.13Assignee: Mark Michelson <mmichels>
Status: MODIFIED --- QA Contact: Ehsan Elahi <eelahi>
Severity: medium Docs Contact:
Priority: high    
Version: FDP 21.ICC: ctrautma, jiji, jishi, ljozsa, mmichels, ralongi, xsimonar
Target Milestone: ---   
Target Release: ---   
Hardware: x86_64   
OS: Linux   
Whiteboard:
Fixed In Version: ovn2.13-20.12.0-201.el8fdp Doc Type: If docs needed, set a value
Doc Text:
Story Points: ---
Clone Of: Environment:
Last Closed: Type: Bug
Regression: --- Mount Type: ---
Documentation: --- CRM:
Verified Versions: Category: ---
oVirt Team: --- RHEL 7.3 requirements from Atomic Host:
Cloudforms Team: --- Target Upstream Version:
Embargoed:

Description David Hill 2023-02-21 13:42:10 UTC
Description of problem:
ovn-controller core dumped and the corefile returns this:

Core was generated by `/usr/bin/ovn-controller --pidfile --log-file unix:/run/openvswitch/db.sock'.
Program terminated with signal SIGSEGV, Segmentation fault.
#0  0x000055fe51666847 in handle_deleted_lport ()
[Current thread is 1 (LWP 8)]
Missing separate debuginfos, use: yum debuginfo-install ovn2.13-host-20.12.0-196.el8fdp.x86_64
(gdb) bt
#0  0x000055fe51666847 in handle_deleted_lport ()
#1  0x000055fe5166a1c7 in binding_handle_port_binding_changes ()
#2  0x000055fe51690978 in runtime_data_sb_port_binding_handler ()
#3  0x000055fe516ab9db in engine_run ()
#4  0x000055fe51663b75 in main ()

Do we have debuginfo for ovn ?  I couldn't find it but it crashed there on RHOSP16.1.9 -5


Version-Release number of selected component (if applicable):
Latest

How reproducible:
This once

Steps to Reproduce:
1. Unknown
2.
3.

Actual results:
core dumped and we're sad about it

Expected results:
No core dumps so we're happy about it

Additional info:

Comment 6 Mark Michelson 2023-03-28 21:04:57 UTC
I have bumped the OVS submodule for ovn2.13. I have pushed the change and created a build for it as well. The build is ovn2.13-20.12.0-198.el8fdp .

I'm marking this issue as MODIFIED since this should be fixed now.

Comment 7 Mark Michelson 2023-03-30 14:48:27 UTC
I'm moving this back to ASSIGNED. The builds that I created yesterday all succeeded except for x86_64. It failed due to a crash in a test program. I have reproduced it locally and am debugging it. I'll update the issue back to MODIFIED and set the proper fixed-in version once I have the crash fixed. I suspect the crash is just in the test program and not due to anything wrong in OVN itself.

Comment 8 Mark Michelson 2023-03-30 19:48:31 UTC
I fixed the issues with the build. The fixed-in version is ovn2.13-20.12.0-201.el8fdp