Bug 211083

Summary: named no longer restarts using service
Product: [Fedora] Fedora Reporter: simon
Component: bindAssignee: Martin Stransky <stransky>
Status: CLOSED RAWHIDE QA Contact: Ben Levenson <benl>
Severity: medium Docs Contact:
Priority: medium    
Version: 5CC: bugs+fedora, tadej.j
Target Milestone: ---   
Target Release: ---   
Hardware: All   
OS: Linux   
Whiteboard:
Fixed In Version: Doc Type: Bug Fix
Doc Text:
Story Points: ---
Clone Of: Environment:
Last Closed: 2006-12-06 15:31:29 UTC Type: ---
Regression: --- Mount Type: ---
Documentation: --- CRM:
Verified Versions: Category: ---
oVirt Team: --- RHEL 7.3 requirements from Atomic Host:
Cloudforms Team: --- Target Upstream Version:
Embargoed:
Attachments:
Description Flags
named.conf
none
Output of "/usr/sbin/rndc stop" command none

Description simon 2006-10-17 09:04:34 UTC
Description of problem:
The recent bind-libds update no longer allows named to be restarted.
Running service named restart results in the stop failing and the start failing
with the error message that named is already running.
I have to manually kill named to stop it.


Version-Release number of selected component (if applicable):
bind-libs-9.3.3-0.1.rc2.fc5


How reproducible:
Always


Steps to Reproduce:
1. Boot up machine.
2. Named starts with NetworkManager
3. No hostname resolve so need to restart named
4. run service named restart
5. fails, so killall named  

Actual results:
Both stop and start fail

Expected results:
Named should restart with OK messages

Additional info:
This is a big issue for me due to bug #196962/#206604 which I did report while
it was still in testing.

Comment 1 Martin Stransky 2006-10-17 09:09:51 UTC
okay. btw. "service named reload" should work for you.

Could you attach content of /var/log/messages when named fails to restart?

Comment 2 simon 2006-10-17 09:36:25 UTC
There's not a lot;

Oct 17 10:33:48 echo named[1751]: rejected command channel message from
127.0.0.1#35066

The reload option does work however.

Comment 3 Martin Stransky 2006-10-17 14:20:13 UTC
Could you please attach your /etc/named.conf?

Comment 4 simon 2006-10-17 14:27:46 UTC
Created attachment 138682 [details]
named.conf

My named.conf. Should just be the default.

Comment 5 Tadej Janež 2006-10-20 17:13:21 UTC
I am experiencing the same problem on my FC5 box after updating to the latest
version of bind packages. And I also have to manually restart named due to bug
#206604. I also have the default named.conf, the same one as Simon.

Comment 6 Martin Stransky 2006-10-26 15:08:59 UTC
Could you please run "/usr/sbin/rndc stop" and attach an output here?

Comment 7 Tadej Janež 2006-10-26 20:38:38 UTC
Created attachment 139526 [details]
Output of  "/usr/sbin/rndc stop" command

Here is the output of 'sudo /usr/sbin/rndc stop' on my FC5 box.
I hope it helps.

Comment 8 Martin Stransky 2006-10-27 08:44:44 UTC
(In reply to comment #7)
> Created an attachment (id=139526) [edit]
> Output of  "/usr/sbin/rndc stop" command

I don't see any problem here. What is return code of the "/usr/sbin/rndc stop"
command? ($? shell variable). It should be zero...

Have you got any messages on console?

Comment 9 Tadej Janež 2006-10-28 20:06:46 UTC
The return code of command "sudo /usr/sbin/rndc stop" is zero and I don't get
any messages at the console (as expected).

However, running "sudo /sbin/service named stop" returns error code 1 and I get
the "Stopping named: [FAILED]" error message at the console.

Comment 10 Martin Stransky 2006-10-30 07:20:53 UTC
Please check the latest test update for FC5 (bind-9.3.3-0.2.rc2.fc5).

Comment 11 Tadej Janež 2006-11-04 10:55:27 UTC
Hi!

In the mean time, I updated my system to FC6 which now has bind-9.3.3-4.fc6
(from updates), but it still has the same problem with stopping named.

There seems to be a mistake, because updates-testing repo for FC6 contains the
packages of bind for FC5 (see
http://download.fedora.redhat.com/pub/fedora/linux/core/updates/testing/6/i386/).
So, I think I should wait for a new test update for FC6 (perhaps version
30:9.3.3-6 from cvs?)


Comment 12 Martin Stransky 2006-11-06 13:15:25 UTC
Update for FC6 is on the way.

Comment 13 Tadej Janež 2006-11-07 17:10:44 UTC
I updated bind to version 9.3.3-6 today and now I get the following error:

[tadej@tlinux-stable ~]$ sudo /sbin/service named restart
Password:
Stopping named: ..................................................no response,
killing with -TERM                                          [  OK  ]
Starting named: named: already running                     [FAILED]
[tadej@tlinux-stable ~]$ sudo /sbin/service named reload
Reloading named:                                           [FAILED]
[tadej@tlinux-stable ~]$ sudo /sbin/service named start
Starting named:                                            [  OK  ]
[tadej@tlinux-stable ~]$

Stopping named takes a very long time, but it has to be killed with -TERM, then
immediately starting named also fails. Manually staring afterwards works OK.

The relevant entries in /var/log/messages:
Nov  7 17:52:49 localhost named[1523]: rejected command channel message from
127.0.0.1#43260
Nov  7 17:54:32 localhost named[1523]: shutting down
Nov  7 17:54:32 localhost named[1523]: stopping command channel on 127.0.0.1#953
Nov  7 17:54:32 localhost named[1523]: exiting
Nov  7 17:55:10 localhost named[2511]: starting BIND 9.3.3rc3 -u named -D
Nov  7 17:55:10 localhost named[2511]: found 1 CPU, using 1 worker thread
Nov  7 17:55:10 localhost named[2511]: loading configuration from '/etc/named.conf'
Nov  7 17:55:10 localhost named[2511]: listening on IPv4 interface lo, 127.0.0.1#53
Nov  7 17:55:10 localhost named[2511]: listening on IPv4 interface eth0,
192.168.0.50#53
Nov  7 17:55:10 localhost named[2511]: command channel listening on 127.0.0.1#953
Nov  7 17:55:10 localhost named[2511]: zone 0.in-addr.arpa/IN: loaded serial 42
Nov  7 17:55:10 localhost named[2511]: zone 0.0.127.in-addr.arpa/IN: loaded
serial 1997022700
Nov  7 17:55:10 localhost named[2511]: zone 255.in-addr.arpa/IN: loaded serial 42
Nov  7 17:55:10 localhost named[2511]: zone
0.0.0.0.0.0.0.0.0.0.0.0.0.0.0.0.0.0.0.0.0.0.0.0.0.0.0.0.0.0.0.ip6.arpa/IN:
loaded serial 1997022700
Nov  7 17:55:10 localhost named[2511]: zone localdomain/IN: loaded serial 42
Nov  7 17:55:10 localhost named[2511]: zone localhost/IN: loaded serial 42
Nov  7 17:55:10 localhost named[2511]: D-BUS dhcdbd subscription disabled.
Nov  7 17:55:10 localhost named[2511]: D-BUS service enabled.
Nov  7 17:55:10 localhost named[2511]: running
Nov  7 17:55:35 localhost NetworkManager: <WARNING>      add_ip4_config_to_named
(): Could not set forwarders for zone '.'.  Error: 'Did not receive a reply.
Possible causes include: the remote application did not send a reply, the
message bus security policy blocked the reply, the reply timeout expired, or the
network connection was broken.'.

The first message seems to indicate there is still a problem with permissions
for stopping named (I'm not using SELinux, if that has anything to do with it).

Comment 14 Martin Stransky 2006-11-10 13:56:10 UTC
Could you please check it as a root? (not under sudo)

Comment 15 Tadej Janež 2006-11-10 22:08:58 UTC
Hmm, something strange is going on here.

If I try 'service named restart' right after Fedora boots up, it seems to hang
trying to stop named. The same thing happens for root and sudo.

If I execute 'service named reload' (as sudo) it finishes immediately and the
subsequent executions of command 'service named restart' work immediatelly (both
for root and sudo).

I don't know what is 'special' about the first attempt to stop (and restart) the
named service, because subsequent attempts (after waiting for a very long time
for named to stop or executing 'service named reload' in between to 'fix' the
issue) to restart named work flawlessly.

Note: I have to restart named after startup due to bug in NetworkManager:
https://bugzilla.redhat.com/bugzilla/show_bug.cgi?id=206604 (I'm waiting for
your patch to be applied to NetworkManager).

Comment 16 Juliano F. Ravasi 2006-11-20 01:51:41 UTC
I had similar problem, and solved by adding:

        inet ::1 allow { localhost; } keys { rndckey; };

to /etc/named.conf, insite the controls directive. Note the IPv6 address. This
seems to be caused due to ::1 listed as the localhost address in /etc/hosts.

Comment 17 Martin Stransky 2006-12-06 15:31:29 UTC
It should be fixed in bind-9.3.3-0.1.rc3.fc6 / bind-9.3.3-0.1.rc3.fc7.