Bug 746936

Summary: openais generates core files during cman_tool join/leave loop
Product: Red Hat Enterprise Linux 5 Reporter: Martin Juricek <mjuricek>
Component: cmanAssignee: Lon Hohberger <lhh>
Status: CLOSED DUPLICATE QA Contact: Cluster QE <mspqa-list>
Severity: high Docs Contact:
Priority: unspecified    
Version: 5.3.zCC: cluster-maint, edamato
Target Milestone: rc   
Target Release: ---   
Hardware: Unspecified   
OS: Linux   
Whiteboard:
Fixed In Version: Doc Type: Bug Fix
Doc Text:
Story Points: ---
Clone Of: Environment:
Last Closed: 2012-02-10 17:04:39 UTC Type: ---
Regression: --- Mount Type: ---
Documentation: --- CRM:
Verified Versions: Category: ---
oVirt Team: --- RHEL 7.3 requirements from Atomic Host:
Cloudforms Team: --- Target Upstream Version:

Description Martin Juricek 2011-10-18 10:24:56 UTC
Description of problem:

I hit this problem when testing bz739084. Simple bash loop with cman_tool join and cman_tool leave commands was run on all cluster nodes (3-node cluster, ia64 arch). As a result, core files are generated in /var/log/openais on all nodes.


Version-Release number of selected component (if applicable):
openais-0.80.3-22.el5_3.17
openais-0.80.3-22.el5_3.16

How reproducible:
100%

Steps to Reproduce:
1. enable creating of core files
2. run simple bash loop with cman_tool join/leave on all cluster nodes:
   ( while true; do cman_tool join; cman_tool leave; done )
3. core files are generated in /var/lib/openais
  
Actual results:
-core files generated

Expected results:
-no core files

Additional info:

gdb output of core file generated with the cman_tool join/leave loop run

[root@a2 cores_2]# gdb `which aisexec` core.9109
GNU gdb Fedora (6.8-27.el5)
Copyright (C) 2008 Free Software Foundation, Inc.
License GPLv3+: GNU GPL version 3 or later <http://gnu.org/licenses/gpl.html>
This is free software: you are free to change and redistribute it.
There is NO WARRANTY, to the extent permitted by law.  Type "show copying"
and "show warranty" for details.
This GDB was configured as "ia64-redhat-linux-gnu"...

warning: Can't read pathname for load map: Input/output error.
Reading symbols from /lib/libdl.so.2...done.
Loaded symbols for /lib/libdl.so.2
Reading symbols from /lib/libpthread.so.0...done.
Loaded symbols for /lib/libpthread.so.0
Reading symbols from /lib/libc.so.6.1...done.
Loaded symbols for /lib/libc.so.6.1
Reading symbols from /lib/ld-linux-ia64.so.2...done.
Loaded symbols for /lib/ld-linux-ia64.so.2
Reading symbols from /usr/libexec/lcrso/objdb.lcrso...Reading symbols from
/usr/lib/debug/usr/libexec/lcrso/objdb.lcrso.debug...done.
done.
Loaded symbols for /usr/libexec/lcrso/objdb.lcrso
Reading symbols from /usr/libexec/lcrso/service_cman.lcrso...done.
Loaded symbols for /usr/libexec/lcrso/service_cman.lcrso
Reading symbols from /lib/libnss_files.so.2...done.
Loaded symbols for /lib/libnss_files.so.2
Reading symbols from /usr/libexec/lcrso/service_cpg.lcrso...Reading symbols
from /usr/lib/debug/usr/libexec/lcrso/service_cpg.lcrso.debug...done.
done.
Loaded symbols for /usr/libexec/lcrso/service_cpg.lcrso
Reading symbols from /usr/libexec/lcrso/service_cfg.lcrso...Reading symbols
from /usr/lib/debug/usr/libexec/lcrso/service_cfg.lcrso.debug...done.
done.
Loaded symbols for /usr/libexec/lcrso/service_cfg.lcrso
Reading symbols from /usr/libexec/lcrso/service_msg.lcrso...Reading symbols
from /usr/lib/debug/usr/libexec/lcrso/service_msg.lcrso.debug...done.
done.
Loaded symbols for /usr/libexec/lcrso/service_msg.lcrso
Reading symbols from /usr/libexec/lcrso/service_lck.lcrso...Reading symbols
from /usr/lib/debug/usr/libexec/lcrso/service_lck.lcrso.debug...done.
done.
Loaded symbols for /usr/libexec/lcrso/service_lck.lcrso
Reading symbols from /usr/libexec/lcrso/service_evt.lcrso...Reading symbols
from /usr/lib/debug/usr/libexec/lcrso/service_evt.lcrso.debug...done.
done.
Loaded symbols for /usr/libexec/lcrso/service_evt.lcrso
Reading symbols from /usr/libexec/lcrso/service_ckpt.lcrso...Reading symbols
from /usr/lib/debug/usr/libexec/lcrso/service_ckpt.lcrso.debug...done.
done.
Loaded symbols for /usr/libexec/lcrso/service_ckpt.lcrso
Reading symbols from /usr/libexec/lcrso/service_amf.lcrso...Reading symbols
from /usr/lib/debug/usr/libexec/lcrso/service_amf.lcrso.debug...done.
done.
Loaded symbols for /usr/libexec/lcrso/service_amf.lcrso
Reading symbols from /usr/libexec/lcrso/service_clm.lcrso...Reading symbols
from /usr/lib/debug/usr/libexec/lcrso/service_clm.lcrso.debug...done.
done.
Loaded symbols for /usr/libexec/lcrso/service_clm.lcrso
Reading symbols from /usr/libexec/lcrso/service_evs.lcrso...Reading symbols
from /usr/lib/debug/usr/libexec/lcrso/service_evs.lcrso.debug...done.
done.
Loaded symbols for /usr/libexec/lcrso/service_evs.lcrso
Reading symbols from /lib/libgcc_s.so.1...done.
Loaded symbols for /lib/libgcc_s.so.1
Core was generated by `aisexec'.
Program terminated with signal 11, Segmentation fault.
[New process 9109]
[New process 9110]
#0  0x2000000000d6f7c0 in unbind_con () from
/usr/libexec/lcrso/service_cman.lcrso
(gdb) bt
#0  0x2000000000d6f7c0 in unbind_con () from
/usr/libexec/lcrso/service_cman.lcrso
#1  0x2000000000d68f40 in ?? () from /usr/libexec/lcrso/service_cman.lcrso
#2  0x2000000000d69390 in ?? () from /usr/libexec/lcrso/service_cman.lcrso
Backtrace stopped: previous frame inner to this frame (corrupt stack?)
(gdb)

Comment 1 Steven Dake 2011-10-18 13:17:12 UTC
This is Bug #590101.  That problem has been fixed in the latest version of cman.

Comment 2 Lon Hohberger 2012-02-10 17:04:39 UTC

*** This bug has been marked as a duplicate of bug 590101 ***