Bug 149305 - grub-install fails on > 2 TB volumes
Summary: grub-install fails on > 2 TB volumes
Keywords:
Status: CLOSED WONTFIX
Alias: None
Product: Red Hat Enterprise Linux 4
Classification: Red Hat
Component: anaconda
Version: 4.0
Hardware: i386
OS: Linux
medium
high
Target Milestone: ---
: ---
Assignee: Peter Jones
QA Contact:
URL:
Whiteboard: RHEL4U3NAK
: 149064 (view as bug list)
Depends On: 203383
Blocks:
TreeView+ depends on / blocked
 
Reported: 2005-02-22 06:11 UTC by Marty Shannon
Modified: 2008-02-01 14:12 UTC (History)
8 users (show)

Fixed In Version:
Doc Type: Bug Fix
Doc Text:
Clone Of:
Environment:
Last Closed: 2008-02-01 14:11:30 UTC
Target Upstream Version:
Embargoed:


Attachments (Terms of Use)
/boot/grub/device.map (82 bytes, text/plain)
2005-04-14 20:12 UTC, Marty Shannon
no flags Details
/proc/mdstat (196 bytes, text/plain)
2005-04-14 20:13 UTC, Marty Shannon
no flags Details
/proc/partitions (454 bytes, text/plain)
2005-04-14 20:14 UTC, Marty Shannon
no flags Details
/boot/grub/grub.conf (1.00 KB, text/plain)
2005-04-14 20:15 UTC, Marty Shannon
no flags Details

Description Marty Shannon 2005-02-22 06:11:10 UTC
Description of problem:
grub-install fails, claiming that the cylinder is not supported by the
BIOS.  This failure is not detected by anaconda(!) so the result of
any install on a box with only SCSI controllers will be a non-bootable
system!

Version-Release number of selected component (if applicable):


How reproducible:
absolute: fails with both aic79xx and 3w-9xxx controllers.

Steps to Reproduce:
1. Obtain machine with no IDE/ATA/SATA drives
2. Install RHEL4
  
Actual results:
System is not bootable

Expected results:
System is bootable

Additional info:

Comment 1 Marty Shannon 2005-02-22 06:14:54 UTC
*** Bug 149064 has been marked as a duplicate of this bug. ***

Comment 3 Joshua Baker-LePain 2005-04-11 19:01:10 UTC
To give some more info, I just installed on a system with 2 3w-9xxx controllers.
 When it tries to reboot, *no* messages appear on the screen -- after the BIOS
summary screen all you get is a blinking cursor.  I booted into rescue mode
using the install CD, did a 'chroot /mnt/sysimage', and tried 'grub-install
/dev/sda'.  It returned the error "The file /boot/grub/stage1 not read correctly".

Comment 4 Joshua Baker-LePain 2005-04-12 14:33:00 UTC
I think the problem in my case is that the boot "drive" is >2TB.  Each 3ware has
12 300GB drives on it configured as RAID5 with a hot spare.  Was that your issue
as well?  If not, then I'll file it under a separate bug.  FWIW, lilo boots this
system just fine.

Comment 5 Peter Jones 2005-04-14 15:15:39 UTC
Marty, what's the exact error message grub-install is printing?

Also, what are the sizes of the disks attached?  Joshua's theory may have some
credence; devices larger than 2TB are not supported to boot off of.  This may or
may not be the problem; hard to tell without more data.

Can you please attach the files /boot/grub/grub.conf , /boot/grub/device.map ,
and /proc/partitions to this bug report?

Comment 6 Marty Shannon 2005-04-14 19:15:26 UTC
Yes, Joshua is probably correct.  Unfortunately, the machine is no longer
available for me to hack on (it's in production now).  The initial boot drive
was 11 400GB drives in RAID5 with a hot spare, so it was definitely >2TB. 
However, the primary controller is now the onboard AIC-7902 with 2 36GB drives
(in software RAID1), and grub failed on that in the very same manner (as Joshua
documented in comment #3).  Fortunately, lilo had no issue with the AIC-7902
drives -- at least running from rescue-mode -- so that's why the box is now in
production.  I *think* we'll be getting another similarly configured box, and if
I can "steal" it to do some testing for a day or so, I'll drop a note here (I'm
guessing "within about a month").

I think it is very important that anaconda catch this problem, though.  That's
the part that had me *really* frustrated.  Had I known that grub failed, I would
have immediately been able to try lilo on the 4TB 3ware array (and I'm guessing
it would have worked, but I can't be sure).

Comment 7 Joshua Baker-LePain 2005-04-14 19:33:57 UTC
Neither grub nor lilo will reliably boot from a >2TB device.  Sure, lilo will
boot once -- it will then happily trash your partition table.  The problem is
that msdos disk labels don't support >2TB, and neither grub nor lilo support gpt
labels.  I'm guessing that the issue you had above had to do with drive ordering
-- grub tried to install to the big array even though the SCSI mirror was there.

To get my system to be able to reboot reliably, I had to make sure the
motherboard BIOS tried to boot first off the internal hard drive I put in (I've
got a 2 port 3ware and a couple of WD Raptors on order) and that I put gpt
labels on my arrays as well as at least one partition.  Only then could I reboot
the system (using grub) and have it come up with all the partitions intact.

Comment 8 Marty Shannon 2005-04-14 19:55:58 UTC
In that case, trying lilo on the 4TB drive would have also been very frustrating
-- at least for the 2nd and subsequent boots.  :(  But now that I know about all
this, I'll be prepared when I assemble the filestore / near-line backup box for
home.  This issue would probably have driven me mad on my own box, as I would
not have gone for the option of any other drives.  Thanks, Joshua!

I just read up on GPT drives, and that sounds like something that the grub folks
ought to be looking at....

Anyway, there's still 2 outstanding issues here.  1: anaconda did not catch
grub's failure; 2: grub needs to ascertain the BIOS boot order & put the boot
block on the right drive (if comment #7 is right about why grub failed on the
AIC-7902 drives).

Comment 9 Marty Shannon 2005-04-14 20:12:10 UTC
Created attachment 113183 [details]
/boot/grub/device.map

Comment 10 Marty Shannon 2005-04-14 20:13:22 UTC
Created attachment 113184 [details]
/proc/mdstat

Comment 11 Marty Shannon 2005-04-14 20:14:17 UTC
Created attachment 113185 [details]
/proc/partitions

Comment 12 Marty Shannon 2005-04-14 20:15:07 UTC
Created attachment 113186 [details]
/boot/grub/grub.conf

Comment 19 David Lawrence 2006-03-10 17:21:48 UTC
NEEDINFO_PM has been deprecated. Changing status to NEEDINFO and changing
ownership to pm manager.

Comment 27 RHEL Program Management 2006-08-21 16:50:39 UTC
Development Management has reviewed and declined this request.  You may appeal
this decision by reopening this request. 

Comment 28 Rob Kenna 2006-08-21 19:59:00 UTC
Re-opening and placing in 4.6 eval queue based on Peter's comments in #26.

Comment 29 RHEL Program Management 2007-03-10 01:16:45 UTC
This bugzilla had previously been approved for engineering
consideration but Red Hat Product Management is currently reevaluating
this issue for inclusion in RHEL4.6.

Comment 33 David Lehman 2007-12-07 22:12:54 UTC
Booting from disks >2TB is not going to work in RHEL4 (or current RHEL5, for
that matter). You can work around it by exporting a second, smaller LUN to boot
from.

Comment 36 Rob Kenna 2008-02-01 14:12:30 UTC
This issuing is being closed as this is late in the RHEL 4 cycle and would be
too disruptive.  Additionally, there is a reasonable work around noted in
comment #33.


Note You need to log in before you can comment on or make changes to this bug.