Bug 2040443

Summary: pacemaker coredump with ocf:heartbeat:mysql resource
Product: Red Hat Enterprise Linux 9 Reporter: Ken Gaillot <kgaillot>
Component: pacemakerAssignee: Ken Gaillot <kgaillot>
Status: CLOSED ERRATA QA Contact: cluster-qe <cluster-qe>
Severity: urgent Docs Contact:
Priority: urgent    
Version: 9.0CC: cluster-maint, cluster-qe, mjuricek, msmazova, nwahl, sbradley
Target Milestone: rcKeywords: Regression, Triaged
Target Release: 9.0   
Hardware: All   
OS: All   
Whiteboard:
Fixed In Version: pacemaker-2.1.2-3.el9 Doc Type: No Doc Update
Doc Text:
This bug is in an intermediate build not yet released in RHEL
Story Points: ---
Clone Of: 2039675 Environment:
Last Closed: 2022-05-17 12:20:40 UTC Type: Bug
Regression: --- Mount Type: ---
Documentation: --- CRM:
Verified Versions: Category: ---
oVirt Team: --- RHEL 7.3 requirements from Atomic Host:
Cloudforms Team: --- Target Upstream Version:
Embargoed:
Bug Depends On: 2039675    
Bug Blocks:    

Description Ken Gaillot 2022-01-13 17:25:21 UTC
+++ This bug was initially created as a clone of Bug #2039675 +++

Description of problem:
pacemaker-controld repeatedly dumps a core after the ocf:heartbeat:mysql resource is configured in replication mode.


Version-Release number of selected component (if applicable):
pacemaker-2.1.2-2.el8.x86_64
resource-agents-4.9.0-12.el8.x86_64
mysql-server-8.0.26-1.module+el8.4.0+12359+b8928c02.x86_64


How reproducible:
100%

Steps to Reproduce:
1. Set up mysql database for replication
2. Configure ocf:heartbeat:mysql resource for replication


Actual results:
ocf:heartbeat:mysql resource doesnt' work as expected and pacemaker-controld dumps cores

Expected results:
No pacemaker coredumps


Additional info:

pcs cluster cib new
pcs -f new resource create mysql ocf:heartbeat:mysql \
   binary="/usr/sbin/mysqld" \
   replication_user="repl" \
   replication_passwd="repl" \
   op start timeout=120 interval=0 \
   op stop timeout=120 interval=0 \
   op promote timeout=120 interval=0 \
   op demote timeout=120 interval=0 \
   op monitor role=Master timeout=30 interval=10 \
   op monitor role=Slave timeout=30 interval=20 \
   op notify  timeout="60s" interval="0s"
pcs -f new resource promotable mysql promoted-max=1 notify=true


# coredumpctl dump
           PID: 64560 (pacemaker-contr)
           UID: 189 (hacluster)
           GID: 189 (haclient)
        Signal: 6 (ABRT)
     Timestamp: Wed 2022-01-12 09:11:06 CET (17min ago)
  Command Line: /usr/libexec/pacemaker/pacemaker-controld
    Executable: /usr/libexec/pacemaker/pacemaker-controld
 Control Group: /system.slice/pacemaker.service
          Unit: pacemaker.service
         Slice: system.slice
       Boot ID: 04164f5e50e74def99393d0d853e3960
    Machine ID: 6f854d05c83a434db011df59505082ce
      Hostname: virt-002
       Storage: /var/lib/systemd/coredump/core.pacemaker-contr.189.04164f5e50e74def99393d0d853e3960.64560.1641975066000000.lz4
       Message: Process 64560 (pacemaker-contr) of user 189 dumped core.
                
                Stack trace of thread 64560:
                #0  0x00007f54759fda4f raise (libc.so.6)
                #1  0x00007f54759d0db5 abort (libc.so.6)
                #2  0x00007f5475a40057 __libc_message (libc.so.6)
                #3  0x00007f5475a471bc malloc_printerr (libc.so.6)
                #4  0x00007f5475a47b88 malloc_consolidate (libc.so.6)
                #5  0x00007f5475a49f38 _int_malloc (libc.so.6)
                #6  0x00007f5475a4b6ee malloc (libc.so.6)
                #7  0x00007f5475e711a8 xmlDictCreate (libxml2.so.2)
                #8  0x00007f5475dacc25 xmlInitParserCtxt (libxml2.so.2)
                #9  0x00007f5475daccd1 xmlNewParserCtxt (libxml2.so.2)
                #10 0x00007f547858fbb3 string2xml (libcrmcommon.so.34)
                #11 0x000055711a52813b crmd_cs_dispatch (pacemaker-controld)
                #12 0x00007f5477ed3684 cpg_dispatch (libcpg.so.4)
                #13 0x00007f54787bf2ae pcmk_cpg_dispatch (libcrmcluster.so.29)
                #14 0x00007f547856c9c7 mainloop_gio_callback (libcrmcommon.so.34)
                #15 0x00007f547612995d g_main_context_dispatch (libglib-2.0.so.0)
                #16 0x00007f5476129d18 g_main_context_iterate.isra.21 (libglib-2.0.so.0)
                #17 0x00007f547612a042 g_main_loop_run (libglib-2.0.so.0)
                #18 0x000055711a521de3 crmd_init (pacemaker-controld)
                #19 0x000055711a521bfe main (pacemaker-controld)
                #20 0x00007f54759e9ca3 __libc_start_main (libc.so.6)
                #21 0x000055711a521c8e _start (pacemaker-controld)

<snip>

--- Additional comment from Ken Gaillot on 2022-01-13 17:19:48 UTC ---

Fixed upstream as of commit ed8b2c86a thanks to nwahl

Comment 7 errata-xmlrpc 2022-05-17 12:20:40 UTC
Since the problem described in this bug report should be
resolved in a recent advisory, it has been closed with a
resolution of ERRATA.

For information on the advisory (new packages: pacemaker), and where to find the updated
files, follow the link below.

If the solution does not work for you, open a new bug report.

https://access.redhat.com/errata/RHBA-2022:2293