Bug 249077

Summary: virt-manager installs on a non-typed disk partition, install succeed, domU fails
Product: Red Hat Enterprise Linux 5 Reporter: Chuck Morrison <chuck.morrison>
Component: xenAssignee: Hugh Brock <hbrock>
Status: CLOSED NOTABUG QA Contact:
Severity: high Docs Contact:
Priority: low    
Version: 5.1CC: dchapman, jarod, martine.silbermann, rick.hester
Target Milestone: ---   
Target Release: ---   
Hardware: ia64   
OS: Linux   
Whiteboard:
Fixed In Version: Doc Type: Bug Fix
Doc Text:
Story Points: ---
Clone Of: Environment:
Last Closed: 2007-07-25 20:37:15 UTC Type: ---
Regression: --- Mount Type: ---
Documentation: --- CRM:
Verified Versions: Category: ---
oVirt Team: --- RHEL 7.3 requirements from Atomic Host:
Cloudforms Team: --- Target Upstream Version:
Embargoed:
Bug Depends On:    
Bug Blocks: 223107    
Attachments:
Description Flags
xend.log
none
xend-debug.log
none
virt-manager.log none

Description Chuck Morrison 2007-07-20 19:12:14 UTC
Description of problem:

When setting up a disk (/dev/cciss/c1d0) with partitions with parted, specifying
ext3 as a filesystem type results in no filesystem on the partition.
Virt-manager is fine with installing to this non-existent file system, but the
domU that is installed never even gets a VM Console to start after the install.

Version-Release number of selected component (if applicable):

rhel5.1 prebeta

How reproducible:

I've had it happen the 3 times I've tried it.

It happens with fully virtualized domU installs. Para-virtualized domUs are
never able to start the install process, just a blank screen on the VM Console.

Steps to Reproduce:
1. partition a hard drive with parted and specify ext3 as the filesystem type
2. create a new fully virtualized domU and use the above partition
3. try to get to a first boot on the installed system.
  
Actual results:

The VM Console just gives a black screen indefinately. CPU usage for that domU
very slowly rises over time.

Expected results:

Either virt-install should put a file system on the partition or refuse to use
it until it has one on it.

Additional info:

This was done with a fully virtualized domU already running during the install.

Comment 1 Chuck Morrison 2007-07-20 19:33:24 UTC
Also, during the install of this domU, when it initially booted to elilo, I
could see fs0 and fs1. The new install elilo files were in fs1, while fs0 held
the elilo boot files for the previous (and running at the time) domU. Normally,
with installing to a partition with an ext2 (for example) file system, only fs0
is visible

Comment 3 Hugh Brock 2007-07-23 17:41:47 UTC
Virt-manager (and python-virtinst, where the create work is done) doesn't know
anything about creating filesystems for guests or checking if they exist -- it
relies on the guest installer for that. There are many reasons you might not be
getting a console for a guest you are installing, but I doubt the filesystem, or
lack thereof, is one of them.

Can you post xend.log, xend-debug.log, and virt-manager.log from when you create
the guest?

Comment 4 Hugh Brock 2007-07-23 17:46:34 UTC
FWIW, if you are trying to get a serial console from a fully virtualized guest,
the serial console is unavailable in RHEL5.x xen. The only way to see a screen
with a fully virtualized guest is by connecting to it with vncviewer or the
virt-manager graphical console.

Comment 5 Chuck Morrison 2007-07-23 19:52:16 UTC
Created attachment 159805 [details]
xend.log

Comment 6 Chuck Morrison 2007-07-23 19:54:06 UTC
Created attachment 159806 [details]
xend-debug.log

Comment 7 Chuck Morrison 2007-07-23 19:55:03 UTC
Created attachment 159808 [details]
virt-manager.log

There are a lot of installs in these files. I don't know how to isolate them
for you.

Comment 8 Chuck Morrison 2007-07-24 14:46:41 UTC
I tried this with ext2 as well and found the same issue. So I agree it's not the
file-system type. It appears that after the initial fully virtualized guest
install succeeded, any additional install fails. Unfortunately, it hosed the
initial install as well, which would not restart after shutting it down to try
another install.

I am aware that the graphical console is needed, but it never even gets to the
EFI prompt in that console. At this point in the install (first boot), vncviewer
is not an option as I need console access to start elilo. It appears that it
never starts EFI, it claims there is no console to attach to.

Comment 9 Jarod Wilson 2007-07-24 20:26:07 UTC
From your xend-debug.log:

File "/usr/lib/python2.4/site-packages/xen/xend/image.py", line 128, in createDomain
    raise VmError('Kernel image does not exist: %s' % self.kernel)
VmError: Kernel image does not exist: /usr/lib/xen/boot/hvmloader

Looks like you're missing the package xen-ia64-guest-firmware, which provides
the hvmloader file, which is an emulated EFI for hvm guests, and required for
them to function. Its found on the supplemental packages cd. Make sure you have
that installed, and I think you should be in much better shape.

Comment 10 Chuck Morrison 2007-07-24 20:46:13 UTC
This was indeed a problem with the first install. I discovered the problem and
installed the rpm you mention. Continue down the log, you'll find more without
that particular issue. I probably did at least 5 or 6 installs after installing
the guest-firmware rpm.

Comment 11 Jarod Wilson 2007-07-24 21:14:07 UTC
Hm... All subsequent log info in xend-debug.log appears to be related to
paravirt guests, not hvm guests (based on the existence of messages about
pygrub, which is only for pv guests). Poking through xend.log now...

Comment 12 Jarod Wilson 2007-07-24 21:25:20 UTC
Actually, given that you're attempting to use partitions on a drive hooked to a
cciss controller, I'm curious if maybe bug 249104 is the real root cause of
problems. Are you seeing any "out of memory" messages in dmesg or messages
related to the cciss driver?

Comment 13 Chuck Morrison 2007-07-24 22:03:50 UTC
re: comment 11

Yes, I did try some para-virtual installs. They failed.

re: comment 12

Yes, I have seen these out of memory errors when dom0 boots.

Now, to confuse matters more, I have just completed a fresh install (dom0) on
the same machine (rx2660) and have successfully installed two fully virtualized
domUs to the same disk partitions I had the errors on previously. They still
don't have labels, according to parted. I did not attempt to make any domUs
before I installed the guest-firmware rpm.

I have not seen a repeat of the initial errors that prompted this bug report.

Comment 14 Chuck Morrison 2007-07-24 22:43:49 UTC
The main difference between the installs that prompted this report and the
latter installs (see comment 13) w/o problems that I can think of is I did not
allow any dom's (not even dom0) to use lvm this time. Every one is going
straight to disk (or virtual disk). This may be significant or not.

Comment 15 Hugh Brock 2007-07-25 13:06:50 UTC
OK... can you clear xend.log and virt-manager.log (actually virt-manager.log
will clear itself on startup) and do a single hvm guest install using LVM? If
that fails, post those logs here and we'll see what we can work out -- if it
succeeds, then it sounds like we have no bug...

Thanks!

Comment 16 Chuck Morrison 2007-07-25 13:51:53 UTC
OK, I can do that. What's the preferred way to clear xend.log ? My guess is stop
xend, blow away xend.log and restart xend. Is there a better way ?

Comment 17 Hugh Brock 2007-07-25 14:02:32 UTC
That should do it, yes. xend probably also clears it on startup, not 100% sure
on that.

Comment 18 Hugh Brock 2007-07-25 17:23:23 UTC
I was wrong, xend doesn't clear its log on startup. You'd have to delete it
manually and reboot...

Comment 19 Chuck Morrison 2007-07-25 20:32:24 UTC
I reinstalled everything to try this again with lvm included. Oddly enough, I
did not experience the failures I did last week. I installed two lvm based
virtual machines and two non-lvm virtual machines. They are all running and I've
stopped and restarted them several times.

So at this point I don't know what was happening to mess these up, but I suspect
that something I was doing triggered the bug in bz#249574 since the error
messages  I've seen point to missing domain id, which I'm assuming are the UUID.

Comment 20 Hugh Brock 2007-07-25 20:37:15 UTC
Well I'm glad it is now working. We also have a fix for #249574 on the way. Thanks!