Bug 114869 - date returns future year of 586562
Summary: date returns future year of 586562
Keywords:
Status: CLOSED ERRATA
Alias: None
Product: Red Hat Enterprise Linux 3
Classification: Red Hat
Component: kernel
Version: 3.0
Hardware: x86_64
OS: Linux
medium
high
Target Milestone: ---
Assignee: Jim Paradis
QA Contact: Brian Brock
URL:
Whiteboard:
Depends On:
Blocks:
TreeView+ depends on / blocked
 
Reported: 2004-02-03 19:29 UTC by steve moss
Modified: 2013-08-06 01:03 UTC (History)
5 users (show)

Fixed In Version:
Doc Type: Bug Fix
Doc Text:
Clone Of:
Environment:
Last Closed: 2004-05-12 01:08:26 UTC
Target Upstream Version:
Embargoed:


Attachments (Terms of Use)


Links
System ID Private Priority Status Summary Last Updated
Red Hat Product Errata RHSA-2004:188 0 normal SHIPPED_LIVE Important: Updated kernel packages available for Red Hat Enterprise Linux 3 Update 2 2004-05-11 04:00:00 UTC

Description steve moss 2004-02-03 19:29:01 UTC
From Bugzilla Helper:
User-Agent: Mozilla/4.79 [en] (X11; U; SunOS 5.8 sun4u)

Description of problem:
Incorrect date causes automounter to stop working.  At that time if
you type "date" you will see the date of:

	Wed Aug  4 09:01:29 PST 586562

The month and day will change as our calendar day moves forward, but
the year is always consistent with the reported one above.  If you
look in /var/log/messages, you will see the time stamp of the time
above.  However, if you run hwclock, the date is correct.

The correct date of the machine is:  Tue Feb  3 09:01:58 PST 2004
The reported date of the machine is: Wed Aug  4 09:01:29 PST 586562

Version-Release number of selected component (if applicable):
coreutils-4.5.3-26

How reproducible:
Sometimes

Steps to Reproduce:
1. See the automounter no longer mounts NFS mount points
2. Run the "/bin/date" command and you will see the skewed date
3. Run the "/sbin/hwclock" command to verify correct date
    

Additional info:

Comment 2 Tim Waugh 2004-02-03 22:27:15 UTC
If it is causing other problems, the error isn't in 'date' but in the
kernel's time-keeping.

Comment 3 Jim Paradis 2004-02-10 18:51:44 UTC
Could you tell me what platform(s) this problem is showing up on, and
attach boot logs from these system(s)?

I suspect that the problem is one that we saw some time ago with TSC
timer handling and that we thought we had fixed.  The reason we may
not have tripped over it until now is that nearly all current AMD64
systems have HPET timers, which have been working correctly (and far
more accurately) as far as we know.  If the RHEL kernel is not
recognizing the HPET timer on your system, I want to know why :)

If it turns out that the TSC bug is still not fixed, then getting the
HPET working will solve your immediate problem.  If it turns out to be
a problem in HPET handling (unlikely) then we'll have to address that
ASAP; in either case console logs will help enormously in finding out
exactly what's going on.


Comment 4 Jim Paradis 2004-02-10 23:36:32 UTC
After reviewing some logs sent to me by Mentor:

It's interesting that you're seeing the same thing with both
SLES8 and RHEL3... that suggests a rather fundamental problem.

When I look at roach.log, one thing jumps out at me: with only
a couple of exceptions, every time ntpdate runs it believes 
that the clock is slow by either 4293 seconds (71 min. 33 sec)
or about 140,466,900 seconds (1625 days, 18 hours, 16 min, 40 sec).
The quantum jumps up into the 587th century are occurring
because somehow the pattern of the 4293-second update is showing
up in the *high-order* longword of the 64-bit time_t.  For any
dates that we mortals might see in our lifetimes, the high-order
longword should always be zero.

I'm curious as to why the date seems to be reported as so 
consistently off, even with updates.

I notice that the strange jumps occur right when ntpdate is
run, too.  This suggests to me that there might be an issue
with *updating* the date, rather than timekeeping per se.
Frequent updating of the date along with lots of reading
thereof might trigger a latent race condition.  I'll keep
investigating.


Comment 5 Jim Paradis 2004-02-13 18:59:02 UTC
Update from customer contact: booting with the "notsc" parameter so as
to use only the HPET timer seems to be a good workaround; stability
testing is continuing.

I still want to get to the bottom of this, though.


Comment 6 Ernie Petrides 2004-03-02 17:22:27 UTC
A work-around for this problem has just been committed to the
RHEL 3 U2 patch pool (in the internal Engineering build of kernel
version 2.4.21-9.17.EL).  I'll leave this bug in "assigned" state
so that Jim can pursue a final resolution in the future.


Comment 7 John Flanagan 2004-05-12 01:08:26 UTC
An errata has been issued which should help the problem described in this bug report. 
This report is therefore being closed with a resolution of ERRATA. For more information
on the solution and/or where to find the updated files, please follow the link below. You may reopen 
this bug report if the solution does not work for you.

http://rhn.redhat.com/errata/RHSA-2004-188.html



Note You need to log in before you can comment on or make changes to this bug.