Bug 1365218
| Summary: | SSSD does not fail over to next GC | ||
|---|---|---|---|
| Product: | Red Hat Enterprise Linux 6 | Reporter: | German Parente <gparente> |
| Component: | sssd | Assignee: | SSSD Maintainers <sssd-maint> |
| Status: | CLOSED ERRATA | QA Contact: | Steeve Goveas <sgoveas> |
| Severity: | unspecified | Docs Contact: | |
| Priority: | medium | ||
| Version: | 6.7 | CC: | dlavu, grajaiya, jhrozek, lslebodn, mkosek, mzidek, nsoman, pbrezina, sssd-maint, tlavigne |
| Target Milestone: | rc | ||
| Target Release: | --- | ||
| Hardware: | Unspecified | ||
| OS: | Unspecified | ||
| Whiteboard: | |||
| Fixed In Version: | sssd-1.13.3-32.el6 | Doc Type: | If docs needed, set a value |
| Doc Text: | Story Points: | --- | |
| Clone Of: | Environment: | ||
| Last Closed: | 2017-03-21 09:57:13 UTC | Type: | Bug |
| Regression: | --- | Mount Type: | --- |
| Documentation: | --- | CRM: | |
| Verified Versions: | Category: | --- | |
| oVirt Team: | --- | RHEL 7.3 requirements from Atomic Host: | |
| Cloudforms Team: | --- | Target Upstream Version: | |
| Embargoed: | |||
| Bug Depends On: | |||
| Bug Blocks: | 1269194, 1365846 | ||
|
Description
German Parente
2016-08-08 16:17:13 UTC
sssd-1-13: * 6ff40ad8f0d604620210c9680ef8b1f9ed1e0417 * 9218ad4a750b46c8fc89a3a30c9a8411e620141d This is failing against sssd-1.13.3-53.el6.x86_64 (Thu Jan 12 16:46:02 2017) [sssd[be[sssdad2012r2.com]]] [ad_get_dc_servers_done] (0x0400): Found 2 domain controllers in domain sssdad2012r2.com (Thu Jan 12 16:46:02 2017) [sssd[be[sssdad2012r2.com]]] [ad_srv_plugin_dcs_done] (0x0400): About to locate suitable site (Thu Jan 12 16:46:02 2017) [sssd[be[sssdad2012r2.com]]] [sdap_connect_host_send] (0x0400): Resolving host bsod2-bdc.sssdad2012r2.com (Thu Jan 12 16:46:02 2017) [sssd[be[sssdad2012r2.com]]] [resolv_is_address] (0x4000): [bsod2-bdc.sssdad2012r2.com] does not look like an IP address (Thu Jan 12 16:46:02 2017) [sssd[be[sssdad2012r2.com]]] [resolv_gethostbyname_step] (0x2000): Querying files (Thu Jan 12 16:46:02 2017) [sssd[be[sssdad2012r2.com]]] [resolv_gethostbyname_files_send] (0x0100): Trying to resolve A record of 'bsod2-bdc.sssdad2012r2.com' in files (Thu Jan 12 16:46:02 2017) [sssd[be[sssdad2012r2.com]]] [resolv_gethostbyname_step] (0x2000): Querying files (Thu Jan 12 16:46:02 2017) [sssd[be[sssdad2012r2.com]]] [resolv_gethostbyname_files_send] (0x0100): Trying to resolve AAAA record of 'bsod2-bdc.sssdad2012r2.com' in files (Thu Jan 12 16:46:02 2017) [sssd[be[sssdad2012r2.com]]] [resolv_gethostbyname_next] (0x0200): No more address families to retry (Thu Jan 12 16:46:02 2017) [sssd[be[sssdad2012r2.com]]] [resolv_gethostbyname_step] (0x2000): Querying DNS (Thu Jan 12 16:46:02 2017) [sssd[be[sssdad2012r2.com]]] [resolv_gethostbyname_dns_query] (0x0100): Trying to resolve A record of 'bsod2-bdc.sssdad2012r2.com' in DNS (Thu Jan 12 16:46:02 2017) [sssd[be[sssdad2012r2.com]]] [schedule_request_timeout] (0x2000): Scheduling a timeout of 6 seconds (Thu Jan 12 16:46:02 2017) [sssd[be[sssdad2012r2.com]]] [schedule_timeout_watcher] (0x2000): Scheduling DNS timeout watcher (Thu Jan 12 16:46:03 2017) [sssd[be[sssdad2012r2.com]]] [check_fd_timeouts] (0x4000): Checking for DNS timeouts (Thu Jan 12 16:46:04 2017) [sssd[be[sssdad2012r2.com]]] [check_fd_timeouts] (0x4000): Checking for DNS timeouts (Thu Jan 12 16:46:04 2017) [sssd[be[sssdad2012r2.com]]] [unschedule_timeout_watcher] (0x4000): Unscheduling DNS timeout watcher (Thu Jan 12 16:46:04 2017) [sssd[be[sssdad2012r2.com]]] [resolv_gethostbyname_dns_parse] (0x1000): Parsing an A reply (Thu Jan 12 16:46:04 2017) [sssd[be[sssdad2012r2.com]]] [request_watch_destructor] (0x0400): Deleting request watch (Thu Jan 12 16:46:04 2017) [sssd[be[sssdad2012r2.com]]] [sdap_connect_host_resolv_done] (0x0400): Connecting to ldap://bsod2-bdc.sssdad2012r2.com:389 (Thu Jan 12 16:46:04 2017) [sssd[be[sssdad2012r2.com]]] [sss_ldap_init_send] (0x4000): Using file descriptor [23] for LDAP connection. (Thu Jan 12 16:46:04 2017) [sssd[be[sssdad2012r2.com]]] [sss_ldap_init_send] (0x0400): Setting 6 seconds timeout for connecting <---- SNIP ----> (Thu Jan 12 16:46:16 2017) [sssd[be[sssdad2012r2.com]]] [be_resolve_server_done] (0x1000): Server resolution failed: 14 (Thu Jan 12 16:46:16 2017) [sssd[be[sssdad2012r2.com]]] [check_online_callback] (0x0100): Backend returned: (1, 0, <NULL>) [Provider is Offline] (Thu Jan 12 16:46:16 2017) [sssd[be[sssdad2012r2.com]]] [fo_reset_services] (0x1000): Resetting all servers in all services We reached a service resolution timeout. https://fedorahosted.org/sssd/ticket/3217 Try to set dns_resolver_timeout = 3 ldap_opt_timeout = 9 Sadly, no luck. After issuing iptables -A INPUT -s $PRIMARY_DC -j DROP ; iptables -A OUTPUT -s $PRIMARY_DC -j DROP causes the timeout issue to the secondary_dc sssd.conf =============== [sssd] config_file_version = 2 services = nss, pam domains = sssdad2012r2.com [nss] default_shell = /bin/bash [domain/sssdad2012r2.com] debug_level = 0xFFF0 ad_enable_gc = true id_provider = ad cache_credentials = True krb5_store_password_if_offline = True use_fully_qualified_names = True fallback_homedir = /home/%d/%u dns_resolver_timeout = 3 ldap_opt_timeout = 9 logs =============== (Fri Jan 13 11:18:14 2017) [sssd[be[sssdad2012r2.com]]] [ad_get_dc_servers_done] (0x0400): Found 2 domain controllers in domain sssdad2012r2.com (Fri Jan 13 11:18:14 2017) [sssd[be[sssdad2012r2.com]]] [ad_srv_plugin_dcs_done] (0x0400): About to locate suitable site (Fri Jan 13 11:18:14 2017) [sssd[be[sssdad2012r2.com]]] [sdap_connect_host_send] (0x0400): Resolving host bsod2.sssdad2012r2.com (Fri Jan 13 11:18:14 2017) [sssd[be[sssdad2012r2.com]]] [resolv_is_address] (0x4000): [bsod2.sssdad2012r2.com] does not look like an IP address (Fri Jan 13 11:18:14 2017) [sssd[be[sssdad2012r2.com]]] [resolv_gethostbyname_step] (0x2000): Querying files (Fri Jan 13 11:18:14 2017) [sssd[be[sssdad2012r2.com]]] [resolv_gethostbyname_files_send] (0x0100): Trying to resolve A record of 'bsod2.sssdad2012r2.com' in files (Fri Jan 13 11:18:14 2017) [sssd[be[sssdad2012r2.com]]] [resolv_gethostbyname_step] (0x2000): Querying files (Fri Jan 13 11:18:14 2017) [sssd[be[sssdad2012r2.com]]] [resolv_gethostbyname_files_send] (0x0100): Trying to resolve AAAA record of 'bsod2.sssdad2012r2.com' in files (Fri Jan 13 11:18:14 2017) [sssd[be[sssdad2012r2.com]]] [resolv_gethostbyname_next] (0x0200): No more address families to retry (Fri Jan 13 11:18:14 2017) [sssd[be[sssdad2012r2.com]]] [resolv_gethostbyname_step] (0x2000): Querying DNS (Fri Jan 13 11:18:14 2017) [sssd[be[sssdad2012r2.com]]] [resolv_gethostbyname_dns_query] (0x0100): Trying to resolve A record of 'bsod2.sssdad2012r2.com' in DNS (Fri Jan 13 11:18:14 2017) [sssd[be[sssdad2012r2.com]]] [schedule_request_timeout] (0x2000): Scheduling a timeout of 6 seconds (Fri Jan 13 11:18:14 2017) [sssd[be[sssdad2012r2.com]]] [schedule_timeout_watcher] (0x2000): Scheduling DNS timeout watcher (Fri Jan 13 11:18:15 2017) [sssd[be[sssdad2012r2.com]]] [fo_resolve_service_timeout] (0x0080): Service resolving timeout reached (Fri Jan 13 11:18:15 2017) [sssd[be[sssdad2012r2.com]]] [request_watch_destructor] (0x0400): Deleting request watch (Fri Jan 13 11:18:15 2017) [sssd[be[sssdad2012r2.com]]] [be_resolve_server_done] (0x1000): Server resolution failed: 14 (Fri Jan 13 11:18:15 2017) [sssd[be[sssdad2012r2.com]]] [check_online_callback] (0x0100): Backend returned: (1, 0, <NULL>) [Provider is Offline] (Fri Jan 13 11:18:15 2017) [sssd[be[sssdad2012r2.com]]] [check_fd_timeouts] (0x4000): Checking for DNS timeouts (Fri Jan 13 11:18:16 2017) [sssd[be[sssdad2012r2.com]]] [check_fd_timeouts] (0x4000): Checking for DNS timeouts (Fri Jan 13 11:18:16 2017) [sssd[be[sssdad2012r2.com]]] [unschedule_timeout_watcher] (0x4000): Unscheduling DNS timeout watcher (Fri Jan 13 11:18:18 2017) [sssd[be[sssdad2012r2.com]]] [be_ptask_execute] (0x0400): Back end is offline (In reply to Dan Lavu from comment #11) > Sadly, no luck. > > After issuing > > iptables -A INPUT -s $PRIMARY_DC -j DROP ; iptables -A OUTPUT -s $PRIMARY_DC > -j DROP > > causes the timeout issue to the secondary_dc > Do you hit a timeout issue with REJECT instead of DROP? Yes, there is no difference between REJECT/DROP. This is now verified against sssd-1.13.3-56.el6.x86_64 This was a client mis-configuration, only the primary name server was in /etc/resolv.conf and was unable to perform the lookup for the IP for the secondary GC when attempting to fail over. Since the problem described in this bug report should be resolved in a recent advisory, it has been closed with a resolution of ERRATA. For information on the advisory, and where to find the updated files, follow the link below. If the solution does not work for you, open a new bug report. https://rhn.redhat.com/errata/RHBA-2017-0632.html |