Bug 576480 - RHEL 5.4 x86 edit the boot kernel " acpi=off" in default mode & use Intel Westmere-EP B0/B1 6-cores processors, SUT will halt
Summary: RHEL 5.4 x86 edit the boot kernel " acpi=off" in default mode & use Intel Wes...
Keywords:
Status: CLOSED WONTFIX
Alias: None
Product: Red Hat Enterprise Linux 5
Classification: Red Hat
Component: kernel
Version: 5.4
Hardware: x86_64
OS: Linux
low
urgent
Target Milestone: rc
: ---
Assignee: Lenny Szubowicz
QA Contact: Red Hat Kernel QE team
URL:
Whiteboard:
Depends On:
Blocks:
TreeView+ depends on / blocked
 
Reported: 2010-03-24 08:01 UTC by eileentseng
Modified: 2013-08-15 13:28 UTC (History)
2 users (show)

Fixed In Version:
Doc Type: Bug Fix
Doc Text:
Clone Of:
Environment:
Last Closed: 2013-08-15 13:28:24 UTC
Target Upstream Version:
Embargoed:


Attachments (Terms of Use)
attach dmesg & /var/log/messages files (236.55 KB, application/octet-stream)
2010-03-25 04:08 UTC, eileentseng
no flags Details

Description eileentseng 2010-03-24 08:01:24 UTC
Description of problem:
RHEL 5.4 x86 edit the boot kernel " acpi=off" in default mode & use Intel Westmere-EP B0/B1 6-cores processors, SUT will halt (80 debug port shows "04")


How reproducible:


Steps to Reproduce:
1. Install Intel Westmere-EP B0/B1 6-Core processors
2. Boot on SUT under RHEL 5.4 x86
3. Edit boot kernel command "acpi=off"
4. System will halt. 

  
Actual results:
System halt (80 debug port shows "04")

Expected results:
system should boot up ok.

Additional info:
1. XEN mode is without this problem.
2. In default mode, if using Intel Westmere-EP B0/B1 4-core processors, it can also boot up okay.

Comment 1 Jiri Skala 2010-03-24 09:01:22 UTC
When is the system halt?
Which processor exactly - Inel Xeon Processor {X,W,L,E}<number>
Could you provide more details - e.g. boot log, /var/log/messages, dmesg

Comment 2 eileentseng 2010-03-25 04:08:17 UTC
Created attachment 402455 [details]
attach dmesg & /var/log/messages files

Here is the requested log files.  Please help.  Thanks!

Comment 3 eileentseng 2010-03-29 09:32:54 UTC
(In reply to comment #1)
> When is the system halt?
> Which processor exactly - Inel Xeon Processor {X,W,L,E}<number>
> Could you provide more details - e.g. boot log, /var/log/messages, dmesg    

(In reply to comment #1)
> When is the system halt?
After we issue command acpi=off and reboot
> Which processor exactly - Inel Xeon Processor {X,W,L,E}<number>
Intel Westmere-EP 3.46G 12M B0 130W 6.4GT/s 4Cores X5677 (AT80614005145AB Q3V5/)
> Could you provide more details - e.g. boot log, /var/log/messages, dmesg 
Already provided the log files on 3/25

Comment 4 Jiri Skala 2010-03-31 08:31:28 UTC
1. Well, this bug is filed against apmd but dmesg contains:

apm: BIOS not found.

Therefore I thing that you should replace apm usage by acpi.

2. I don't understand there is "Install Intel Westmere-EP B0/B1 6-Core processors" in the description of bug but the processor specification describes 4-cores processor. What is wrong?

Comment 5 eileentseng 2010-03-31 09:53:32 UTC
1. Intel Westmere has 4-core type and 6-core type, but only 6-core types cannot issue issue apm=off in normal mode, 4-core type is without this issue.
2. In XEN mode, no matter use 4-core or 6-core type and issue apm=off are okay, system can boot up ok.

Comment 6 eileentseng 2010-04-02 07:23:27 UTC
(In reply to comment #4)
> 1. Well, this bug is filed against apmd but dmesg contains:
> apm: BIOS not found.
> Therefore I thing that you should replace apm usage by acpi.
sorry for the typo, we issue acpi=off, not apm=off as titled shown. can you give comments on this why issue acpi=off showns bios not found?
> 2. I don't understand there is "Install Intel Westmere-EP B0/B1 6-Core
> processors" in the description of bug but the processor specification describes
> 4-cores processor. What is wrong?

Comment 7 Jiri Skala 2010-04-02 08:45:28 UTC
You use acpi=off and dmesg says apm: BIOS not found (apm on? but ...). This could be something in bios or kernel. Please, could you clarify a couple of things:

1. Usage 4-cores physical CPU works fine. Usage 6-cores physical CPU generates described issue. Is it right?

2. Could you explain what does it mean: 80 debug port shows "04"? What about BIOS issue?

3. What problem do you try to solve with apci=off? I've got info acpi=off can lead to system freezes, upstream do not guarantee stable work of the system with acpi=off. acpi=off is a workaround for some problems, and not a recommended option.

Comment 8 eileentseng 2010-04-02 08:58:30 UTC
(In reply to comment #7)
> You use acpi=off and dmesg says apm: BIOS not found (apm on? but ...). This
> could be something in bios or kernel. Please, could you clarify a couple of
> things:
> 1. Usage 4-cores physical CPU works fine. Usage 6-cores physical CPU generates
> described issue. Is it right?
Yes
> 2. Could you explain what does it mean: 80 debug port shows "04"? What about
> BIOS issue?
There is a kind debug port card that can output system behavior.  Different number means different behavior but "04" is not a code reported by BIOS.
> 3. What problem do you try to solve with apci=off? I've got info acpi=off can
> lead to system freezes, upstream do not guarantee stable work of the system
> with acpi=off. acpi=off is a workaround for some problems, and not a
> recommended option.    
We've been assuming that the correct system behavior should have system booted up ok when issuing acpi=off.  We have been using this command for system test, so according to your comments, issue this kind of command for the test is not recommended?

Comment 9 Jiri Skala 2010-04-02 09:01:53 UTC
still one question and one remark:

- which kernel version do you use?
- RHEL 5.5 is currently released ...

Comment 10 eileentseng 2010-04-02 09:40:51 UTC
(In reply to comment #9)
> still one question and one remark:
> - which kernel version do you use?
RHEL 5.4 : 2.6.18-164.el5
> - RHEL 5.5 is currently released ...  
We also tested RHEL 5.5, result remains the same.  Kernel version as follows:  
RHEL 5.5 : 2.6.18-186.el5

Comment 11 Lenny Szubowicz 2013-01-11 15:21:15 UTC
I recently inherited this open problem report. Since it's been over 2 years since there has been any update on this problem, I'd like to check if this problem is still reproducible with RHEL 5.9?

Comment 12 Lenny Szubowicz 2013-08-15 13:28:24 UTC
Given how long this problem has gone unresolved, it's relatively low severity, and the lifecycle stage that RHEL 5 is currently in, the problem reported will not be fixed in RHEL 5.

                                   -Lenny.


Note You need to log in before you can comment on or make changes to this bug.