Bug 88900 - kjournald crashes regulary, system hangs
Summary: kjournald crashes regulary, system hangs
Keywords:
Status: CLOSED WORKSFORME
Alias: None
Product: Red Hat Linux
Classification: Retired
Component: kernel
Version: 8.0
Hardware: i686
OS: Linux
medium
high
Target Milestone: ---
Assignee: Stephen Tweedie
QA Contact: Brian Brock
URL:
Whiteboard:
Depends On:
Blocks:
TreeView+ depends on / blocked
 
Reported: 2003-04-15 13:54 UTC by Jonas Lundholm
Modified: 2007-04-18 16:53 UTC (History)
0 users

Fixed In Version:
Clone Of:
Environment:
Last Closed: 2003-04-28 12:47:31 UTC
Embargoed:


Attachments (Terms of Use)
gzip of /var/log/message.2 (11.22 KB, application/gzip)
2003-04-22 11:40 UTC, Jonas Lundholm
no flags Details
output from df -k, df -i and lsmod in a textfile (1.49 KB, text/plain)
2003-04-26 11:03 UTC, Jonas Lundholm
no flags Details

Description Jonas Lundholm 2003-04-15 13:54:25 UTC
From Bugzilla Helper:
User-Agent: Mozilla/4.75 [en] (X11; U; SunOS 5.7 sun4u)

Description of problem:
1. Installed RedHat Linux 8.0 Professional, as a server running PostgreSQL with
tomcat
2. kjournald crashes regulary, approx 4 times a day
3. Support informed me of a patch/update, installed via up2date
4. kjournald still crashes, new type of crash with kernel panic message:
<0> Kernel panic: Aiee, killing interrupt handler
                                   In interrupt handler - not syncing


Support Service Request 229854

(I am new to this forum, Basic Installation Support urged me to submit a bug
report)


Version-Release number of selected component (if applicable):
kernel-2.4.18-27.8.0

How reproducible:
Sometimes

Steps to Reproduce:
1. Just happens frequently without any obvious cause. I just wait a while.

Actual Results:  kernel panic or kjournald stack trace appears on the console

Expected Results:  A stable server for my small database java application would
have been nice :)

Additional info:

Comment 1 Stephen Tweedie 2003-04-16 11:25:35 UTC
We need the actual console errors, including the stack trace and any preceding
diagnostics, to debug this.

Comment 2 Jonas Lundholm 2003-04-22 11:40:00 UTC
Created attachment 91225 [details]
gzip of /var/log/message.2

The log has both the startup-sequence and some of the crashes, but not all in
the time span covered. I think there are crashes that don't end up in the log.

Comment 3 Jonas Lundholm 2003-04-26 11:03:32 UTC
Created attachment 91310 [details]
output from df -k, df -i and lsmod in a textfile

Comment 4 Stephen Tweedie 2003-04-28 12:47:31 UTC
Not a single one of the oopses in that log are in kjournald, nor do they show
any signs of being inside ext3.  They show all the hallmarks of hardware memory
corruption.

You really need to do a hardware memory test next.  www.memtest86.com is the
best place to start.

Comment 5 Jonas Lundholm 2003-05-12 19:31:27 UTC
You are right! Thanks a lot!

After running the tests for a day one of the memory modules was signalled as
faulty. After removing the module the machine has been up and running for 4 days
with no faults. Standard memory tests didn't detect anything. kjournald was
blamed because it was involved in the first n of the panics. 

A humble tip: Why not inform RH support service of some generic error types, if
they had told me this from start, since they had the same information, you and I
would have saved some days of work.

Case closed.


Comment 6 Stephen Tweedie 2003-05-12 20:11:01 UTC
Notes, but memory errors can show up as problems _anywhere_ in the kernel, and
it's a little tricky to give exact footprints of what stands out as such a problem.



Note You need to log in before you can comment on or make changes to this bug.