Bug 453507
Summary: | kernel panic with kernel version 2.6.9-67.0.20.EL | ||||||||||||||
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
Product: | Red Hat Enterprise Linux 4 | Reporter: | Jimmy Cho <jcho> | ||||||||||||
Component: | kernel | Assignee: | Vitaly Mayatskikh <vmayatsk> | ||||||||||||
Status: | CLOSED ERRATA | QA Contact: | Martin Jenner <mjenner> | ||||||||||||
Severity: | high | Docs Contact: | |||||||||||||
Priority: | high | ||||||||||||||
Version: | 4.8 | CC: | ajadhav, bernhard.furtmueller, duck, eric.eisenhart, gergnz, herrold, jan.iven, jburke, kajtzu, k.georgiou, linux, me, mishu, mmatsuya, mvaliyav, pasteur, pgervase, phaleintx, pzijlstr, qcai, rainer.traut, sputhenp, tao, tizod, vgoyal | ||||||||||||
Target Milestone: | rc | Keywords: | ZStream | ||||||||||||
Target Release: | --- | ||||||||||||||
Hardware: | i686 | ||||||||||||||
OS: | Linux | ||||||||||||||
Whiteboard: | |||||||||||||||
Fixed In Version: | Doc Type: | Bug Fix | |||||||||||||
Doc Text: | Story Points: | --- | |||||||||||||
Clone Of: | Environment: | ||||||||||||||
Last Closed: | 2009-05-18 19:26:01 UTC | Type: | --- | ||||||||||||
Regression: | --- | Mount Type: | --- | ||||||||||||
Documentation: | --- | CRM: | |||||||||||||
Verified Versions: | Category: | --- | |||||||||||||
oVirt Team: | --- | RHEL 7.3 requirements from Atomic Host: | |||||||||||||
Cloudforms Team: | --- | Target Upstream Version: | |||||||||||||
Embargoed: | |||||||||||||||
Bug Depends On: | |||||||||||||||
Bug Blocks: | 455072, 455074, 461297 | ||||||||||||||
Attachments: |
|
Description
Jimmy Cho
2008-07-01 04:16:43 UTC
Created attachment 310639 [details]
extracted kernel log entries in /var/log/message
Created attachment 311326 [details]
kernel panic log
Same problem experienced 4 days after installing 2.6.9-67.0.20.ELsmp kernel on
a system that previously had no stability issues.
seeing a similar issue on two of my 4.6 boxes running MailScanner as MX filters. Same here. It panics during heavy IO via MySQL. The problem is in next_thread function, I'm assigning this issue to Vitaly. This is a race between release_task() and sys_times()->next_thread() Created attachment 311625 [details]
lock sighand->siglock in release_task()
This patch fixes the problem
Created attachment 311626 [details]
reproducer
Created attachment 311684 [details]
simpler version of patch
Unhash process with locked sighahd->siglock.
We had two of these this weekend. Previously stable systems were upgraded to the latest kernel and two locked last night, not even making 24 hours. In my opinion this bug should be urgent. *** Bug 455274 has been marked as a duplicate of this bug. *** This is to backout the patch in 4.8. This request was evaluated by Red Hat Product Management for inclusion in a Red Hat Enterprise Linux maintenance release. Product Management has requested further review of this request by Red Hat Engineering, for potential inclusion in a Red Hat Enterprise Linux Update release for currently deployed products. This request is not yet committed for inclusion in an Update release. Committed in 78.1.EL . RPMS are available at http://people.redhat.com/vgoyal/rhel4/ For the time being orignal two sys_times patches ( bz 435280) have been reverted back to solve the issue. Following are the reverted commits. f9c0ff860ebf6aa16fbd3bfaabd77d72c267d449 47a5118b6f0f22add09c28c10e09f93034a9b8d9 We've just had an RHEL 4.7 box crash with this one. Is there any timescale for the fix to be released through the normal channels? *** Bug 456997 has been marked as a duplicate of this bug. *** *** Bug 456993 has been marked as a duplicate of this bug. *** Updating PM score. An advisory has been issued which should help the problem described in this bug report. This report is therefore being closed with a resolution of ERRATA. For more information on therefore solution and/or where to find the updated files, please follow the link below. You may reopen this bug report if the solution does not work for you. http://rhn.redhat.com/errata/RHSA-2009-1024.html |