+++ This bug was initially created as a clone of Bug #455802 +++ From Bugzilla Helper: User-Agent: Mozilla/5.0 (Windows; U; Windows NT 5.1; en-US; rv:1.9.0.1) Gecko/2008070208 Firefox/3.0.1 Description of problem: Since the upgrade to the following: Version: 9.3.4 Release: 6.0.2.P1.el5_2 We have had an issue with the named service randomly dieing off. In /var/log/messages we see the following errors: Jul 17 13:00:42 ns-mail named[19409]: socket.c:1649: INSIST(!sock->pending_recv) failed Jul 17 13:00:42 ns-mail named[19409]: exiting (due to assertion failure) The server is running bind-chroot and also has the caching-nameserver package installed. In addition to the local related zones, we also have 10 forwards setup in the named.conf file. This occurred on a fresh install of 5.2 which had the named service running for approximately 36 hours prior to this crash. Version-Release number of selected component (if applicable): bind-9.3.4-6.0.2.P1.el5_2 How reproducible: Sometimes Steps to Reproduce: 1.This has been random thus far. 2.A restart of the service is required as it crashes. 3. Actual Results: Expected Results: Additional info: A point to also note is that we had a 4.6 server running the latest security patched BIND and this same error occurred on it after the recent BIND related security patches. The named.conf was identical on the 4.6 server too. We then decided to migrate to 5.2, in which we have seen this same issue happen today, thus the bug report. --- Additional comment from atkac on 2008-09-12 07:15:29 EDT --- This problem is already fixed in upstream: 2406. [bug] Sockets could be closed too early, leading to inconsistent states in the socket module. [RT #18298] Problem will probably fixed together with bug #457036 by rebase to updated upstream version.
Created attachment 317192 [details] proposed patch
I've experienced this crash as well. RHEL 4.5 BIND version: bind-9.2.4-30.el4 (out of date, but I don't patch this system personally) From our logs: Oct 8 18:22:28 mlvv9n1x named[25034]: zone xxx.xxx.209.in-addr.arpa/IN: transferred serial 200307xxxx Oct 8 18:22:28 mlvv9n1x named[25034]: transfer of 'xxx.xxx.209.in-addr.arpa/IN' from xxx.xxx.xxx.132#53: end of transfer Oct 8 18:22:28 mlvv9n1x named[25034]: zone xxx.xxx.209.in-addr.arpa/IN: sending notifies (serial 200307xxxx) Oct 8 18:22:41 mlvv9n1x named[25034]: socket.c:1615: INSIST(!sock->pending_recv) failed Oct 8 18:22:41 mlvv9n1x named[25034]: exiting (due to assertion failure)
This request was evaluated by Red Hat Product Management for inclusion, but this component is not scheduled to be updated in the current Red Hat Enterprise Linux release. If you would like this request to be reviewed for the next minor release, ask your support representative to set the next rhel-x.y flag to "?".
RHEL 4.7 now. Same bind version which now appears to be up to date? Yep, still happening: Dec 3 17:30:34 mlvv9n1x named[3834]: socket.c:1615: INSIST(!sock->pending_recv) failed Dec 3 17:30:34 mlvv9n1x named[3834]: exiting (due to assertion failure) If I have to request a support ticket, then why do you provide open access to bugzilla? Please reevaluate this bug fix dwhich has caused 2 BIND crashes on our primary internal name server.
(In reply to comment #5) > If I have to request a support ticket, then why do you provide open access to > bugzilla? Please reevaluate this bug fix dwhich has caused 2 BIND crashes on > our primary internal name server. Could you fill support ticket, please? Bugs with corresponding support ticket are tracked with higher priority than bugs without support ticket. Direct access to bugzilla is mainly for people who don't use RH support.
As this happens just occasionally, I'll stay in ON_QA and SanityOnly state.
Technical note added. If any revisions are required, please edit the "Technical Notes" field accordingly. All revisions will be proofread by the Engineering Content Services team. New Contents: * under certain conditions, the named daemon could exit due to an assertion failure. The following message was logged to /var/log/messages: named: socket.c:1649: INSIST(!sock->pending_recv) failed named: exiting (due to assertion failure) This update provides a fix to the socket module which prevents this assertion from failing, thus resolving the problem.
Technical note updated. If any revisions are required, please edit the "Technical Notes" field accordingly. All revisions will be proofread by the Engineering Content Services team. Diffed Contents: @@ -1,4 +1,4 @@ -* under certain conditions, the named daemon could exit due to an assertion failure. The following message was logged to /var/log/messages: named: socket.c:1649: +Under certain conditions, the named daemon could exit due to an assertion failure. The following message was logged to /var/log/messages: named: socket.c:1649: INSIST(!sock->pending_recv) failed named: exiting (due to assertion failure)
An advisory has been issued which should help the problem described in this bug report. This report is therefore being closed with a resolution of ERRATA. For more information on therefore solution and/or where to find the updated files, please follow the link below. You may reopen this bug report if the solution does not work for you. http://rhn.redhat.com/errata/RHBA-2011-0223.html