Bug 1992281
Summary: | ovn-northd and ovsdb-server core dumps in upgrade job (4.8 -> 4.9) | ||
---|---|---|---|
Product: | Red Hat Enterprise Linux Fast Datapath | Reporter: | jamo luhrsen <jluhrsen> |
Component: | ovn2.12 | Assignee: | OVN Team <ovnteam> |
Status: | CLOSED DUPLICATE | QA Contact: | Jianlin Shi <jishi> |
Severity: | high | Docs Contact: | |
Priority: | unspecified | ||
Version: | FDP 20.I | CC: | anbhat, ctrautma, dcbw, i.maximets, jiji, ralongi |
Target Milestone: | --- | ||
Target Release: | FDP 20.I | ||
Hardware: | Unspecified | ||
OS: | Unspecified | ||
Whiteboard: | |||
Fixed In Version: | Doc Type: | If docs needed, set a value | |
Doc Text: | Story Points: | --- | |
Clone Of: | Environment: | ||
Last Closed: | 2021-08-17 16:46:44 UTC | Type: | Bug |
Regression: | --- | Mount Type: | --- |
Documentation: | --- | CRM: | |
Verified Versions: | Category: | --- | |
oVirt Team: | --- | RHEL 7.3 requirements from Atomic Host: | |
Cloudforms Team: | --- | Target Upstream Version: | |
Embargoed: |
Description
jamo luhrsen
2021-08-10 20:49:01 UTC
Hi. This looks very much as duplicate of BZ1957030. The sequence of events is following: 1. Container is terminating sending SIGTERM to the process inside. 2. Process intercepts the SIGTERM, finalizes few things and tries to re-raise the signal to terminate itself with a correct exit code. 3. Process calls signal(SIGTERM, SIG_DFL); raise(SIGTERM); 4. raise(SIGTERM) fails inside glibc! 5. process tries to terminate itself with abort() 6. abort() inside glibc calls raise(SIGABRT) and fails! 7. After few attempts to raise a signal, glibc gives up and executes ABORT_INSTRUCTION that basically generates SIGSEGV. 8. SIGSEGV terminates the process with a coredump. There is nothing really can be done from the application. The only way to fix that is to figure out why glibc fails raising of the signal. So, it's not an issue of OVN or OVS. If anything, this BZ should be re-assigned to glibc for the investigation. yes, agreed it is a dup of bz1957030. Not sure how I missed that in my original search for this. *** This bug has been marked as a duplicate of bug 1957030 *** The needinfo request[s] on this closed bug have been removed as they have been unresolved for 500 days |