Bug 170105 - hourly lost responses to control messages
Summary: hourly lost responses to control messages
Keywords:
Status: CLOSED WONTFIX
Alias: None
Product: Red Hat Enterprise Linux 3
Classification: Red Hat
Component: ntp
Version: 3.0
Hardware: i686
OS: Linux
medium
low
Target Milestone: ---
Assignee: Miroslav Lichvar
QA Contact: Brian Brock
URL:
Whiteboard:
Depends On:
Blocks:
TreeView+ depends on / blocked
 
Reported: 2005-10-07 10:31 UTC by Comsoft GmbH (MIB)
Modified: 2013-02-14 12:13 UTC (History)
1 user (show)

Fixed In Version:
Doc Type: Bug Fix
Doc Text:
Clone Of:
Environment:
Last Closed: 2007-01-24 21:48:14 UTC
Target Upstream Version:
Embargoed:


Attachments (Terms of Use)

Description Comsoft GmbH (MIB) 2005-10-07 10:31:18 UTC
From Bugzilla Helper:
User-Agent: Mozilla/5.0 (Windows; U; Windows NT 5.1; de-DE; rv:1.7.7) Gecko/20050414 Firefox/1.0.3

Description of problem:
The following was observed on the RHEL3U4 platform running ntp-4.1.2-4.EL3.1.

We have a (middleware) program that queries ntpd every 5 seconds with a mode 6 control message to read variable "peer" of the local host. The program issues a receive timeout if  no answer is received within 3s.

This program showed receive timeouts happening at exactly the same minute and second every hour. Sniffing the network, I saw that indeed ntpd did not answer the request at those times.

I also simulated our program with a shell script loop of the command

ntpq -c "debug more" -c "debug more" -c "debug more" -c "debug more" -c "debug more" -c "timeout 1500" -c "timeout" -c "addvars peer" -c "rl" localhost

This also showed that at hourly intervals, the request was not immediately answered but was retried.
There were no syslog entries written by ntpd at these times, and looking at it with "ntpq -p" also showed no change in its peers.

Deactivating the drift file generation by commenting out the line

  driftfile /var/lib/ntp/drift

resolved the problem - no more unanswered requests.

Version-Release number of selected component (if applicable):
ntp-4.1.2-4.EL3.1

How reproducible:
Sometimes

Steps to Reproduce:
1. start ntpd with driftfile activated in ntp.conf
2. send control request to read the "peer" variable in a fast loop with low timeout 
3. observe timeout (due to no response) at regular hourly intervals

Whether it is observed at a particular hour depends (I suppose on the relative timing of the requests and whatever is keeping ntpd busy).

Actual Results:  Almost every hour, a response from ntpd to a control request was missing at a particular time (roughly accurate to the second).

Expected Results:  The control request should be answered.

Additional info:

Hardware is HP Proliant DL380G4.

Contents of ntp.conf:
---------------------
driftfile /var/lib/ntp/drift
trustedkey 1
requestkey 1
keys /etc/ntp/keys
server ntpsrv1 version 4 iburst
server ntpsrv2 version 4 iburst

Comment 2 RHEL Program Management 2006-12-17 00:00:37 UTC
Product Management has reviewed and declined this request.  You may appeal this
decision by reopening this request. 

Comment 3 Daniel Riek 2007-01-24 21:48:14 UTC
As we have to prioritize the amount of change we introduce into a RHEL release
and given the fact that RHEL3 is in transition from an extended Full Support
phase (ended with U8) to the Maintenance phase, we have very strict inclusion
criteria for 3.9.

As this problem is not considered to have a high impact on production systems
and an easy workaround is available we do not think that it meets these criteria.

If you disagree, please re-open the case via Red Hat support.

Please also note that Bugzilla is a development tool, not a Support tool.
Therefor not all comments (especially process related ones) are publicly
visible. Detailed closure reasons for example often are only communicated by the
Red Hat support organization. Therefor we ask our customers to file requests
important for their production systems via our Support service. Only then we can
ensure a consitent communication.



Note You need to log in before you can comment on or make changes to this bug.