Bug 571804 - Segmentation fault in condor_q -better
Summary: Segmentation fault in condor_q -better
Keywords:
Status: CLOSED ERRATA
Alias: None
Product: Red Hat Enterprise MRG
Classification: Red Hat
Component: condor
Version: Development
Hardware: All
OS: Linux
high
high
Target Milestone: 1.3
: ---
Assignee: Matthew Farrellee
QA Contact: Martin Kudlej
URL:
Whiteboard:
: 595706 (view as bug list)
Depends On:
Blocks:
TreeView+ depends on / blocked
 
Reported: 2010-03-09 15:30 UTC by Martin Kudlej
Modified: 2018-10-27 15:19 UTC (History)
4 users (show)

Fixed In Version:
Doc Type: Bug Fix
Doc Text:
Clone Of:
Environment:
Last Closed: 2010-10-20 11:28:56 UTC
Target Upstream Version:
Embargoed:


Attachments (Terms of Use)
same type of coredumps from RHEL 5.5/4.8 x i386/x86_64 with condor-debuginfo (3.13 KB, application/x-gzip)
2010-04-23 12:15 UTC, Martin Kudlej
no flags Details

Description Martin Kudlej 2010-03-09 15:30:09 UTC
Description of problem:
I've set dynamic provisioning:
NUM_CPUS=8
NUM_SLOTS=4
NUM_SLOTS_TYPE_1=1
SLOT_TYPE_1=s=auto,c=auto,d=25%,25%
SLOT_TYPE_1_PARTITIONABLE=True
NUM_SLOTS_TYPE_2=1
SLOT_TYPE_2=s=auto,c=auto,d=25%,25%
SLOT_TYPE_2_PARTITIONABLE=True
NUM_SLOTS_TYPE_3=1
SLOT_TYPE_3=s=auto,c=auto,d=25%,25%
SLOT_TYPE_3_PARTITIONABLE=True
NUM_SLOTS_TYPE_4=1
SLOT_TYPE_4=s=auto,c=auto,d=25%,25%
SLOT_TYPE_4_PARTITIONABLE=True

and run long term jobs
echo -e "executable=/bin/sleep\narguments=600\nqueue 100" | condor_submit
There should be jobs which are not running and are waiting in queue.
After condor_q -better I've got:
3418.000:  Request is being serviced

---
3418.001:  Request is being serviced

---
3418.002:  Request is being serviced

---
3418.003:  Request is being serviced

Segmentation fault

Core:
$gdb `which condor_q` core.17850
GNU gdb Fedora (6.8-37.el5)
Copyright (C) 2008 Free Software Foundation, Inc.
License GPLv3+: GNU GPL version 3 or later <http://gnu.org/licenses/gpl.html>
This is free software: you are free to change and redistribute it.
There is NO WARRANTY, to the extent permitted by law.  Type "show copying"
and "show warranty" for details.
This GDB was configured as "i386-redhat-linux-gnu"...
(no debugging symbols found)

warning: Can't read pathname for load map: Input/output error.

warning: .dynamic section for "/usr/lib/libk5crypto.so.3" is not at the expected address

warning: difference appears to be caused by prelink, adjusting expectations
Reading symbols from /usr/lib/libclassad_ns.so.0...(no debugging symbols found)...done.
Loaded symbols for /usr/lib/libclassad_ns.so.0
....
....
Reading symbols from /lib/libsepol.so.1...(no debugging symbols found)...done.
Loaded symbols for /lib/libsepol.so.1
Reading symbols from /lib/libnss_files.so.2...
(no debugging symbols found)...done.
Loaded symbols for /lib/libnss_files.so.2
(no debugging symbols found)
Core was generated by `condor_q -better'.
Program terminated with signal 11, Segmentation fault.
[New process 17850]
#0  0x001db5ef in std::basic_string<char, std::char_traits<char>, std::allocator<char> >::basic_string () from /usr/lib/libstdc++.so.6
(gdb) thread apply all bt

Thread 1 (process 17850):
#0  0x001db5ef in std::basic_string<char, std::char_traits<char>, std::allocator<char> >::basic_string () from /usr/lib/libstdc++.so.6
#1  0x080c4719 in std::_Rb_tree<std::string, std::string, std::_Identity<std::string>, classad::CaseIgnLTStr, std::allocator<std::string> >::_M_insert ()
#2  0x080c4852 in std::_Rb_tree<std::string, std::string, std::_Identity<std::string>, classad::CaseIgnLTStr, std::allocator<std::string> >::insert_unique ()
#3  0x080c352f in AddExplicitTargets ()
#4  0x080a413b in ClassAdAnalyzer::MakeResourceGroup ()
#5  0x080ad3f6 in ClassAdAnalyzer::AnalyzeJobAttrsToBuffer ()
#6  0x0809f100 in ?? ()
#7  0x080a10e7 in ?? ()
#8  0x080a3457 in main ()
(gdb) info threads
* 1 process 17850  0x001db5ef in std::basic_string<char, std::char_traits<char>, std::allocator<char> >::basic_string () from /usr/lib/libstdc++.so.6

Version-Release number of selected component (if applicable):
condor-7.4.3-0.4.el5 and i386 architecture

Actual results:
It dumps core file.

Expected results:
It should get output without segmentation fault.

Comment 2 Martin Kudlej 2010-04-23 12:15:15 UTC
Created attachment 408588 [details]
same type of coredumps from RHEL 5.5/4.8 x i386/x86_64 with condor-debuginfo

Comment 3 Matthew Farrellee 2010-05-25 17:34:28 UTC
*** Bug 595706 has been marked as a duplicate of this bug. ***

Comment 4 Matthew Farrellee 2010-05-25 17:47:30 UTC
condor 7.4.3-0.14 was built against classads 1.0.4-1

there are no ABI guarantees on classads

verify you have classads 1.0.6-1 installed

please downgrade to 1.0.4-1 (immediate workaround) and verify the issue is gone

condor 7.4.3-0.15 should be built against classads 1.0.6-1 (with a hard dep)

Comment 7 Martin Kudlej 2010-06-11 09:09:23 UTC
Teste on RHEL5.5/4.8 x x86_64/i386 with condor-7.4.3-0.17, classads-1.0.6-1 and it works. -->VERIFIED


Note You need to log in before you can comment on or make changes to this bug.