Bug 470540

Summary: SATA RAID1 not detected during installation (ESB2,LSI firmware)
Product: Red Hat Enterprise Linux 5 Reporter: Winfrid Tschiedel <Winfrid.Tschiedel>
Component: anacondaAssignee: Joel Andres Granados <jgranado>
Status: CLOSED DUPLICATE QA Contact: Release Test Team <release-test-team-automation>
Severity: high Docs Contact:
Priority: high    
Version: 5.3CC: cward, ddumas, hdegoede, heinzm, jgranado, syeghiay, tao
Target Milestone: rcKeywords: OtherQA, Reopened
Target Release: ---   
Hardware: x86_64   
OS: Linux   
Whiteboard:
Fixed In Version: Doc Type: Bug Fix
Doc Text:
Story Points: ---
Clone Of: Environment:
Last Closed: 2008-12-11 16:01:10 UTC Type: ---
Regression: --- Mount Type: ---
Documentation: --- CRM:
Verified Versions: Category: ---
oVirt Team: --- RHEL 7.3 requirements from Atomic Host:
Cloudforms Team: --- Target Upstream Version:
Embargoed:
Attachments:
Description Flags
requested output from dmraid none

Description Winfrid Tschiedel 2008-11-07 16:09:51 UTC
Description of problem:

If you have a SATA RAID1 on Intel Hardware with ESB2 and LSI firmware 
the RAID disk is not detected
Error device /dev/mapper/ddf1_..... not found.
Hardware used in this test FSC rx200-s4


Version-Release number of selected component (if applicable):


How reproducible:

Always


Steps to Reproduce:
1. Configure RAID1 in the SATA RAID controller ( ESB2 with LSI firmware )
2. Start installation
3.
  
Actual results:
After entering the root password /dev/mapper/ddf1_..... 


Expected results:
Sucessful installation


Additional info:
No problem with RAID0 on same system
No problem with fedora 9 installation on same system and configuration
fedora 10 installation does not work, instead of RAID1 2 disks 
sda and sdb are seen.

Comment 1 Winfrid Tschiedel 2008-11-07 16:46:03 UTC
The statement RAID0 has no problem, is not really true.
After first boot the system comes up with /dev/sda instead of 
/dev/mapper/ddf1_.........p#

This might have a similar reason as in incident 460996

Comment 2 Heinz Mauelshagen 2008-11-13 12:58:24 UTC
Please append, what "dmraid -b ; dmraid -r ; dmraid -s ; dmraid -tay" shows in the failure cases, thanks.

Comment 3 Winfrid Tschiedel 2008-11-13 15:00:03 UTC
Created attachment 323456 [details]
requested output from dmraid

Comment 4 Heinz Mauelshagen 2008-11-14 09:00:21 UTC
The attachment to comment #3 shows dmraid detecting the mirror
ok.
Please analyze any anaconda/python-pyblock flaw.

Comment 5 Joel Andres Granados 2008-11-14 15:15:19 UTC
Winfrid Tschiedel:
   1. Did you execute these commands (from comment #3) in a running system, or in the tty2 of the anaconda installer?
   2. Have you tested with snap #3?

I have a isw RAID1 on my machine that did not want to detect before snap3.  When I made sure that I was installing from snap3 I was able to see the mirror.  I don't think that there is actually anything wrong with anaconda/pyblock as the changes that went into dmraid (AFAIKS) did not leak to the API.

What I suggest is for you to test with snap 3 and if you still get the same behavior, run the commands that Heinz described in tty2 and post your findings:).  Also run `dmraid -ay`, thats the test that I used, but I assume that whatever Heinz posted is good enough :)

FYI: tty2 is the second virtual terminal ALT-F2.  There should be a shell there.

Comment 6 Winfrid Tschiedel 2008-11-14 16:06:37 UTC
Thank you Joel -

You could do me a favor - could you try dmraid + GPT 
and tell me your experience .

Have a nice weekend,

Winfrid

Comment 7 Joel Andres Granados 2008-11-14 16:50:04 UTC
Winfrid Tschiedel:
   I created a gpt partition table on the raid set.  Rebooted and tried an install.  Anaconda correctly detected the dmraid and gave me the option to use it for installation.

Comment 8 Hans de Goede 2008-11-17 23:12:37 UTC
Winfrid Tschiedel:
As already suggested by Joel, please retest this with snapshot3, we believe this is fixed in snapshot3.

Comment 9 Denise Dumas 2008-11-21 16:33:37 UTC
Winfrid, did Snapshot 3 fix the RAID detection problem for you?  Your Comment #10 in 470543 implies that it does, but I don't want to close this out until I hear from you.
Thanks,
Denise

Comment 10 Winfrid Tschiedel 2008-11-24 08:10:39 UTC
Hello Denise, 

You are right, that part1 of the installation works with rhel 5.3 -
but what does it help, when afterwards the firstboot sees 2 seperate disks instead of a RAID1 ?? I think we only can close this incident when also 460996 is fixed.

Comment 11 Hans de Goede 2008-12-02 11:17:33 UTC
(In reply to comment #10)
> Hello Denise, 
> 
> You are right, that part1 of the installation works with rhel 5.3 -
> but what does it help, when afterwards the firstboot sees 2 seperate disks
> instead of a RAID1 ?? I think we only can close this incident when also 460996
> is fixed.

We've managed to track down the cause of the system seeing 2 separate disks after installation to mkinitrd (nash). We've done a new build of mkinitrd / nash: 5.1.19.6-41, which we believe fixes this (it does on our test systems).

The new nash-5.1.19.6-41, will be in RHEL 5.3 snapshot 5 which should become available for testing next Monday.

Please test this with snapshot5 when available and let us know how it goes. Thanks for your patience.

Comment 14 Denise Dumas 2008-12-11 14:15:06 UTC
Winfrid, you should not still be seeing this problem as of RHEL 5.3 Snapshot 5.  We believe the change described in Comment 11 fixed it, at least it did for the other 4 reports of this problem verified by partners, and in our own testing.

I do not believe the fix has made it upstream into Fedora yet, so your BZ 460996 is still open. We are focused on 5.3 at the moment, and will get to Fedora once 5.3 is stabilized.
 
To reduce BZ confusion I am closing this as a dup of 471689. If you experience the same problem again, please feel free to reopen the BZ. If you encounter a different problem, I'd appreciate it if you could open a new BZ to make it clear that it is a new problem. 
Thanks!

*** This bug has been marked as a duplicate of bug 471689 ***

Comment 15 Winfrid Tschiedel 2008-12-11 15:19:59 UTC
Hi Denise,

I had not time to do a new installation, but I have updated a system 
to Snap5. 

At least with this kernel I do not see any RAID.
Booting the system with 92.0.13 ( actual kernel of U2 ) I see 
/dev/mapper/pdc_....... but the strange thing is that the mounted partitions
show device /dev/sda 

Is there any way to get my RAID configuration repaired, 
or should I forget this environment

Comment 16 Hans de Goede 2008-12-11 16:01:10 UTC
Winfrid, I'm afraid that once the system has been booted with a broken initrd once, the raid set is busted, you see will need to re-install, please re-open if a misinstall (after recreating the raid set in the BIOS) fails.

*** This bug has been marked as a duplicate of bug 471689 ***

Comment 17 Hans de Goede 2008-12-11 16:02:25 UTC
Winfrid, typing seems difficult to day, instead of misinstall I meant re-install of course.

Comment 18 Winfrid Tschiedel 2008-12-16 10:10:56 UTC
(In reply to comment #16)
> Winfrid, I'm afraid that once the system has been booted with a broken initrd
> once, the raid set is busted, you see will need to re-install, please re-open
> if a misinstall (after recreating the raid set in the BIOS) fails.
> *** This bug has been marked as a duplicate of 471689 ***

Hello Hans,

My reinstallion on a RAID0 was successful, there is just one cosmetic problem,
which in fedora 10 [Bug 470628] is already solved, I get during boot the following message:
Boot message: Could not detect stabilization, waiting 10 seconds.

Just a remark, I had a similar problem on fedora 8 ( seeing /dev/sda instead of the mapped device ), here the problem was gone with a new kernel 
(kernel-2.6.26.6-49.fc8)

Comment 19 Hans de Goede 2008-12-16 15:49:50 UTC
(In reply to comment #18)
> Hello Hans,
> 

Hi Winfrid,

> My reinstallion on a RAID0 was successful, there is just one cosmetic problem,
> which in fedora 10 [Bug 470628] is already solved, I get during boot the
> following message:
> Boot message: Could not detect stabilization, waiting 10 seconds.
> 
> Just a remark, I had a similar problem on fedora 8 ( seeing /dev/sda instead of
> the mapped device ), here the problem was gone with a new kernel 
> (kernel-2.6.26.6-49.fc8)

Unfortunately this 10 second boot delay is caused by a bugfix for systems where the ramdisk would not wait long enough for disk scanning to complete.

The problem is that in some cases, where the system is done scanning before we start waiting, we do not detect stabilization of the found disks (because all were already found) and thus wait 10 seconds to be sure.

Due to kernel differences we cannot use the Fedora fix in RHEL-5. We've prepared a release note to explain the cause of the 10 second delay.

Comment 20 Winfrid Tschiedel 2008-12-18 13:55:40 UTC
Just for your info :
I succeeded in getting back my dmraid on my updated rhel5 systems.
The good thing was, I still had a kernel where my dmraid basically worked -
booting my system with vmlinuz-2.6.18-8.1.15.el5 and having
sda replaced with mapper/pdc_bjfeeibeebp in 
/etc/mtab and /etc/blkid/blkid.tab my dmraid config looked good.
After this I installed the kernel from RHEL5.3-Snap6 and updated the 
the system with rpm -Fvh .

All updates worked fine, except a warning from dmraid :

[root@rx220a Server]# rpm --force -Uvh dmraid-1.0.0.rc13-19.el5.x86_64.rpm
warning: dmraid-1.0.0.rc13-19.el5.x86_64.rpm: Header V3 DSA signature: NOKEY, key ID 897da07a
Preparing...                ########################################### [100%]
   1:dmraid                 ########################################### [100%]
error: %post(dmraid-1.0.0.rc13-19.el5.x86_64) scriptlet failed, exit status 1
[root@rx220a Server]# 

Something I should care about ??

PS: I wish you a wonderful Xmas time 
    tomorrow it is my last working day in 2008 - being back on Jan. 7, 2009

Winfrid