Note: This bug is displayed in read-only format because the product is no longer active in Red Hat Bugzilla.

Bug 636951

Summary: negotiator crash in HFS code with off-balance group config
Product: Red Hat Enterprise MRG Reporter: Pete MacKinnon <pmackinn>
Component: gridAssignee: grid-maint-list <grid-maint-list>
Status: CLOSED DUPLICATE QA Contact: MRG Quality Engineering <mrgqe-bugs>
Severity: medium Docs Contact:
Priority: low    
Version: betaCC: matt
Target Milestone: 1.3   
Target Release: ---   
Hardware: All   
OS: Linux   
Whiteboard:
Fixed In Version: Doc Type: Bug Fix
Doc Text:
Story Points: ---
Clone Of: Environment:
Last Closed: 2010-09-23 19:23:17 UTC Type: ---
Regression: --- Mount Type: ---
Documentation: --- CRM:
Verified Versions: Category: ---
oVirt Team: --- RHEL 7.3 requirements from Atomic Host:
Cloudforms Team: --- Target Upstream Version:
Embargoed:

Description Pete MacKinnon 2010-09-23 18:57:38 UTC
09/23 14:54:32 negotiationtime: finished maxallowed calc -  slots 2 group msg quota 0.700000 nodequota 0.000000 maxAllowed 1.400000 nodemaxAllowed 0.000000 numsubmits 0  i 1
09/23 14:54:32 negotiationtime: finished maxallowed calc -  slots 2 group grid quota 0.010000 nodequota 0.000000 maxAllowed 0.020000 nodemaxAllowed 0.000000 numsubmits 0  i 2
09/23 14:54:32 negotiationtime: finished maxallowed calc -  slots 2 group rt quota 0.080000 nodequota 0.000000 maxAllowed 0.160000 nodemaxAllowed 0.000000 numsubmits 0  i 3
09/23 14:54:32 negotiationtime: finished maxallowed calc -  slots 2 group mgmt.cumin quota 0.030000 nodequota 0.000000 maxAllowed 553386531798359453103816704.000000 nodemaxAllowed 0.000000 numsubmits 0  i 4

A config like this...

GROUP_NAMES = msg, grid, mgmt.cumin, rt
GROUP_QUOTA_DYNAMIC_GRID = 0.01
#GROUP_QUOTA_DYNAMIC_MGMT = 0.01
GROUP_QUOTA_DYNAMIC_MGMT.CUMIN = 0.03
GROUP_QUOTA_DYNAMIC_MSG = 0.70
GROUP_QUOTA_DYNAMIC_RT = 0.08

leads to a crash on startup of the negotiator in what appears to be fairshare processing.

gdb>>>>

Program received signal SIGSEGV, Segmentation fault.
0x008f2c91 in _IO_vfprintf_internal (s=Cannot access memory at address 0xffffffff
) at vfprintf.c:1593
1593		  process_string_arg (((struct printf_spec *) NULL));
(gdb) where
#0  0x008f2c91 in _IO_vfprintf_internal (s=Cannot access memory at address 0xffffffff
) at vfprintf.c:1593
#1  0x080d5a44 in _condor_dprintf_va (flags=1024, 
    fmt=0x81700a0 "fairshare called on  %s auto %s index %d maxallowed %f numsubmits %d group unused %f called with unused %f \n", args=0xbfffe088 ")")
    at dprintf.c:353
#2  0x080d5ebf in dprintf (flags=1024, 
    fmt=0x81700a0 "fairshare called on  %s auto %s index %d maxallowed %f numsubmits %d group unused %f called with unused %f \n") at dprintf_common.c:76
#3  0x080a6a0c in Matchmaker::fairshare (this=0x81cc480, 
    groupArray=std::vector of length 5, capacity 8 = {...}, index=-1, 
    unused=1.84462199e+28) at matchmaker.cpp:1582
#4  0x080b2b1a in Matchmaker::negotiationTime (this=0x81cc480)
    at matchmaker.cpp:1368
#5  0x080d3657 in TimerManager::Timeout (this=0x81cc808)
    at timer_manager.cpp:419
#6  0x080c16a4 in DaemonCore::Driver (this=0x81e5bd0) at daemon_core.cpp:2934
#7  0x080cf935 in main (argc=1, argv=0xbfffec4c) at daemon_core_main.cpp:2295

Comment 1 Matthew Farrellee 2010-09-23 19:23:17 UTC

*** This bug has been marked as a duplicate of bug 636271 ***