Bug 883882

Summary: sssd-1.9.2-24 refuses to start after update, segmentation fault
Product: Red Hat Enterprise Linux 6 Reporter: Taunus <codezilla>
Component: sssdAssignee: Jakub Hrozek <jhrozek>
Status: CLOSED DUPLICATE QA Contact: Kaushik Banerjee <kbanerje>
Severity: urgent Docs Contact:
Priority: unspecified    
Version: 6.4CC: grajaiya, jgalipea, pbrezina
Target Milestone: rc   
Target Release: ---   
Hardware: Unspecified   
OS: Unspecified   
Whiteboard:
Fixed In Version: Doc Type: Bug Fix
Doc Text:
Story Points: ---
Clone Of: Environment:
Last Closed: 2012-12-06 09:43:46 UTC Type: Bug
Regression: --- Mount Type: ---
Documentation: --- CRM:
Verified Versions: Category: ---
oVirt Team: --- RHEL 7.3 requirements from Atomic Host:
Cloudforms Team: --- Target Upstream Version:
Embargoed:

Description Taunus 2012-12-05 13:51:01 UTC
Description of problem:
sssd refuses to start after update, segmentation fault

Version-Release number of selected component (if applicable):
sssd-1.9.2-24.el6.x86_64

How reproducible:
update RHEL 6.3 to RHEL 6.4 beta

Steps to Reproduce:
1. download iso
2. mount -o loop /home/user/Downloads/rhel-workstation-6.4-beta-x86_64-dvd.iso /mnt/iso
3. create repo file and yum -y update
  
Actual results:
update fails, sssd-1.9.2-24.el6.x86_64 fails to start. System becomes unstable.

Expected results:
sssd updates and starts up, update succeeds

Additional info:
Looks like segmentation fault "Child [a.domain.com] terminated with signal [11]"
See below


[root@host ~]# /usr/sbin/sssd -i -d 5
(Wed Dec  5 15:26:46 2012) [sssd] [sysdb_domain_init_internal] (0x0200): DB File for a.domain.com: /var/lib/sss/db/cache_a.domain.com.ldb
(Wed Dec  5 15:26:46 2012) [sssd] [get_ping_config] (0x0100): Time between service pings for [a.domain.com]: [10]
(Wed Dec  5 15:26:46 2012) [sssd] [get_ping_config] (0x0100): Time between SIGTERM and SIGKILL for [a.domain.com]: [60]
(Wed Dec  5 15:26:46 2012) [sssd] [start_service] (0x0100): Queueing service a.domain.com for startup
(Wed Dec  5 15:26:46 2012) [sssd[be[a.domain.com]]] [recreate_ares_channel] (0x0100): Initializing new c-ares channel
(Wed Dec  5 15:26:46 2012) [sssd[be[a.domain.com]]] [sysdb_domain_init_internal] (0x0200): DB File for a.domain.com: /var/lib/sss/db/cache_a.domain.com.ldb
(Wed Dec  5 15:26:46 2012) [sssd[be[a.domain.com]]] [sbus_init_connection] (0x0200): Adding connection 206EAB0
(Wed Dec  5 15:26:46 2012) [sssd] [sbus_server_init_new_connection] (0x0200): Entering.
(Wed Dec  5 15:26:46 2012) [sssd] [sbus_server_init_new_connection] (0x0200): Adding connection 0x796570.
(Wed Dec  5 15:26:46 2012) [sssd] [sbus_init_connection] (0x0200): Adding connection 796570
(Wed Dec  5 15:26:46 2012) [sssd[be[a.domain.com]]] [monitor_common_send_id] (0x0100): Sending ID: (%BE_a.domain.com,1)
(Wed Dec  5 15:26:46 2012) [sssd] [sbus_server_init_new_connection] (0x0200): Got a connection
(Wed Dec  5 15:26:46 2012) [sssd[be[a.domain.com]]] [sss_names_init] (0x0100): Using re [(((?P<domain>[^\\]+)\\(?P<name>.+$))|((?P<name>[^@]+)@(?P<domain>.+$))|(^(?P<name>[^@\\]+)$))].
(Wed Dec  5 15:26:46 2012) [sssd[be[a.domain.com]]] [fo_add_server] (0x0080): Adding new server 'ipa.a.domain.com', to service 'IPA'
(Wed Dec  5 15:26:46 2012) [sssd[be[a.domain.com]]] [common_parse_search_base] (0x0100): Search base added: [DEFAULT][cn=accounts,dc=a,dc=domain,dc=com][SUBTREE][]
(Wed Dec  5 15:26:46 2012) [sssd] [mt_svc_exit_handler] (0x0040): Child [a.domain.com] terminated with signal [11]
(Wed Dec  5 15:26:46 2012) [sssd] [sbus_dispatch] (0x0080): Connection is not open for dispatching.
(Wed Dec  5 15:26:46 2012) [sssd] [get_ping_config] (0x0100): Time between service pings for [a.domain.com]: [10]
(Wed Dec  5 15:26:46 2012) [sssd] [get_ping_config] (0x0100): Time between SIGTERM and SIGKILL for [a.domain.com]: [60]
(Wed Dec  5 15:26:46 2012) [sssd] [start_service] (0x0100): Queueing service a.domain.com for startup
(Wed Dec  5 15:26:46 2012) [sssd[be[a.domain.com]]] [recreate_ares_channel] (0x0100): Initializing new c-ares channel
(Wed Dec  5 15:26:46 2012) [sssd[be[a.domain.com]]] [sysdb_domain_init_internal] (0x0200): DB File for a.domain.com: /var/lib/sss/db/cache_a.domain.com.ldb
(Wed Dec  5 15:26:46 2012) [sssd[be[a.domain.com]]] [sbus_init_connection] (0x0200): (Wed Dec  5 15:26:46 2012) [sssd] [sbus_server_init_new_connection] (0x0200): Adding connection A8DAB0
Entering.
(Wed Dec  5 15:26:46 2012) [sssd] [sbus_server_init_new_connection] (0x0200): Adding connection 0x7a3f30.
(Wed Dec  5 15:26:46 2012) [sssd] [sbus_init_connection] (0x0200): Adding connection 7A3F30
(Wed Dec  5 15:26:46 2012) [sssd] [sbus_server_init_new_connection] (0x0200): Got a connection
(Wed Dec  5 15:26:46 2012) [sssd[be[a.domain.com]]] [monitor_common_send_id] (0x0100): Sending ID: (%BE_a.domain.com,1)
(Wed Dec  5 15:26:46 2012) [sssd[be[a.domain.com]]] [sss_names_init] (0x0100): Using re [(((?P<domain>[^\\]+)\\(?P<name>.+$))|((?P<name>[^@]+)@(?P<domain>.+$))|(^(?P<name>[^@\\]+)$))].
(Wed Dec  5 15:26:46 2012) [sssd[be[a.domain.com]]] [fo_add_server] (0x0080): Adding new server 'ipa.a.domain.com', to service 'IPA'
(Wed Dec  5 15:26:46 2012) [sssd[be[a.domain.com]]] [common_parse_search_base] (0x0100): Search base added: [DEFAULT][cn=accounts,dc=a,dc=domain,dc=com][SUBTREE][]
(Wed Dec  5 15:26:46 2012) [sssd] [mt_svc_exit_handler] (0x0040): Child [a.domain.com] terminated with signal [11]
(Wed Dec  5 15:26:46 2012) [sssd] [sbus_dispatch] (0x0080): Connection is not open for dispatching.





[root@host ~]# cat /etc/sssd/sssd.conf 
[domain/a.domain.com]

cache_credentials = True
krb5_store_password_if_offline = True
ipa_domain = a.domain.com
id_provider = ipa
auth_provider = ipa
access_provider = ipa
ipa_hostname = host.a.domain.com
chpass_provider = ipa
ipa_dyndns_update = True
ipa_server = _srv_, ipa.a.domain.com
ldap_tls_cacert = /etc/ipa/ca.crt
[domain/default]

cache_credentials = True
[sssd]
services = nss, pam, ssh
config_file_version = 2

domains = a.domain.com
[nss]

[pam]

[sudo]

[autofs]

[ssh]

Comment 2 Jakub Hrozek 2012-12-05 14:15:57 UTC
Is it possible to get a corefile or a backtrace with abrt, for example?

Comment 3 Taunus 2012-12-05 14:26:04 UTC
It is some child process crashing.. How do I get a corefile or backtrace? sssd prints out these over and over:

(Wed Dec  5 16:24:12 2012) [sssd] [monitor_quit] (0x0010): [10][No child processes] while waiting for [pac]

Comment 4 Jakub Hrozek 2012-12-05 14:51:23 UTC
(In reply to comment #3)
> It is some child process crashing.. How do I get a corefile or backtrace?

yum install abrt
service abrtd start

should do the trick.

> sssd prints out these over and over:
> 
> (Wed Dec  5 16:24:12 2012) [sssd] [monitor_quit] (0x0010): [10][No child
> processes] while waiting for [pac]

That's an unrelated bug fixed in a later build.

Comment 5 Jakub Hrozek 2012-12-06 09:38:25 UTC
Oh, I think I know what's going on. Can you try manually setting ldap_sasl_authid ?

I bet you're hitting #877972

Comment 6 Jakub Hrozek 2012-12-06 09:43:46 UTC

*** This bug has been marked as a duplicate of bug 878420 ***

Comment 7 Taunus 2012-12-08 00:23:42 UTC
Yes, setting the option in sssd.conf helps.