110678 – Kudzu causes problems with the realtime clock.

Bug 110678 - Kudzu causes problems with the realtime clock.

Summary: Kudzu causes problems with the realtime clock.

Keywords:
Status:	CLOSED DEFERRED
Alias:	None
Product:	Fedora
Classification:	Fedora
Component:	kudzu
Sub Component:
Version:	1
Hardware:	All
OS:	Linux
Priority:	medium
Severity:	medium
Target Milestone:	---
Assignee:	Bill Nottingham
QA Contact:	David Lawrence
Docs Contact:
URL:
Whiteboard:
Depends On:
Blocks:
TreeView+	depends on / blocked

Reported:	2003-11-23 03:07 UTC by Sam Varshavchik
Modified:	2014-03-17 02:40 UTC (History)
CC List:	2 users (show)
Fixed In Version:
Clone Of:
Environment:
Last Closed:	2005-04-28 17:58:12 UTC
Type:	---
Embargoed:
Dependent Products:

Attachments	(Terms of Use)
lspci.txt (6.76 KB, text/plain) 2003-11-23 03:08 UTC, Sam Varshavchik	no flags	Details
kernelboot.txt (13.79 KB, text/plain) 2003-11-23 03:12 UTC, Sam Varshavchik	no flags	Details
View All

Description Sam Varshavchik 2003-11-23 03:07:29 UTC

Description of problem:

I'm pointing my accusatory finger at kudzu in FC1 for causing several
instabilities on the hardware described in the attachments to this
bug. Basically, all of the following symptoms were resolved simply by
disabling kudzu.

Kudzu in RH9 did not cause the following results on the same hardware.

After yet another uneventful upgrade from RH9 to FC1 (no different
than similar previous upgrade runs on different hardware), the machine
hung after rebooting, at the "Configuring kernel parameters" prompt.

An investigation showed that hwclock, being invoked from rc.sysinit,
was hanging here:

open("/etc/adjtime", O_RDONLY|O_LARGEFILE) = 3
fstat64(3, {st_mode=S_IFREG|0644, st_size=46, ...}) = 0
mmap2(NULL, 4096, PROT_READ|PROT_WRITE, MAP_PRIVATE|MAP_ANONYMOUS, -1,
0) = 0xbf370000
read(3, "0.069092 1069550582 0.000000\n106"..., 4096) = 46
close(3)                                = 0
munmap(0xbf370000, 4096)                = 0
open("/dev/rtc", O_RDONLY|O_LARGEFILE)  = 3
ioctl(3, RTC_UIE_ON, 0)                 = 0
read(3,

The read() was never coming back from the kernel.

hwclock when invoked from the halt script suffered the same fate.

After commenting out hwclock, the machine booted succesfully, but one
of the two NICs (3c59x.o, the 3c900 combo Boomerang) was refusing to
play nice:

Nov 22 14:42:14 brimstone kernel: eth0: Transmit error, Tx status
register d0.
Nov 22 14:42:14 brimstone kernel:   Flags; bus-master 1, dirty 1(1)
current 1(1)

[ ... ]

Nov 22 14:42:59 brimstone kernel: eth0: Host error, FIFO diagnostic
register 2000.
Nov 22 14:42:59 brimstone kernel: eth0: PCI bus error, bus status 008b0029

Booting an RH9 kernel did not fix some of these symptoms, so I began
suspecting a hardware failure that happen to occur at the same time as
an upgrade to FC1.

After some further investigation, I accidentally discovered that by

1) Disabling kudzu, and
2) Doing a complete cold reset, as in "Arctic Circle" cold reset --
meaning removing the power cord from the power supply, waiting, and
plugging it back in -- resolved all symptoms, brought the hardware
clock, and the misbehaving NIC card up to speed, and returned the
server to 100% production mode.

Basically, running Kudzu just once permanently screws up this hardware
I have here.  Even an ordinary power off, using the power button, does
not fix the situation.  I must remove all power from the machine, by
physically disconnecting it from the power supply, in order to restore
it.  This is completely reproducible.

Comment 1 Sam Varshavchik 2003-11-23 03:08:25 UTC

Created attachment 96141 [details]
lspci.txt

lspci output

Comment 2 Sam Varshavchik 2003-11-23 03:12:09 UTC

Created attachment 96142 [details]
kernelboot.txt

Here's how a succesful kernel boot looks like.

Comment 3 Bill Nottingham 2003-11-24 04:10:17 UTC

hwclock is something different, almost certainly.

The 3c59x is already in bugzilla.

*** This bug has been marked as a duplicate of 107389 ***

Comment 4 Bill Nottingham 2003-11-24 04:11:24 UTC

Reopening for the hwclock issue.

Is this reproducible as well? Or is just the 3c59x problem reproducible?

Comment 5 Sam Varshavchik 2003-11-24 12:06:27 UTC

hwclock funniness is also reproducible.

Comment 6 Bill Nottingham 2005-04-28 17:58:12 UTC

Closing bugs on older, no longer supported, releases. Apologies for any lack of
response. Please open a new bug if it persists on current releases.

Note You need to log in before you can comment on or make changes to this bug.