491529 – Unable to mount filesystem: device /dev/sda1 does not exist

Bug 491529 - Unable to mount filesystem: device /dev/sda1 does not exist

Summary: Unable to mount filesystem: device /dev/sda1 does not exist

Keywords:
Status:	CLOSED RAWHIDE
Alias:	None
Product:	Fedora
Classification:	Fedora
Component:	anaconda
Sub Component:
Version:	rawhide
Hardware:	All
OS:	Linux
Priority:	low
Severity:	medium
Target Milestone:	---
Assignee:	Anaconda Maintenance Team
QA Contact:	Fedora Extras Quality Assurance
Docs Contact:
URL:
Whiteboard:
Duplicates (2):	491743 491945 (view as bug list)
Depends On:
Blocks:	F11Beta, F11BetaBlocker
TreeView+	depends on / blocked

Reported:	2009-03-22 17:16 UTC by Jesse Keating
Modified:	2013-01-10 03:27 UTC (History)
CC List:	16 users (show)
Fixed In Version:
Clone Of:
Environment:
Last Closed:	2009-04-23 18:10:17 UTC
Type:	---
Embargoed:
Dependent Products:

Attachments	(Terms of Use)
screenshot of error. (18.28 KB, image/png) 2009-03-22 17:16 UTC, Jesse Keating	no flags	Details
log file (30.47 KB, text/plain) 2009-03-22 17:17 UTC, Jesse Keating	no flags	Details
storage log (52.90 KB, text/plain) 2009-03-22 17:17 UTC, Jesse Keating	no flags	Details
system log (32.87 KB, text/plain) 2009-03-22 17:17 UTC, Jesse Keating	no flags	Details
program log (10.25 KB, text/plain) 2009-03-22 17:18 UTC, Jesse Keating	no flags	Details
anaconda-logs.tgz (21.22 KB, application/octet-stream) 2009-03-23 16:24 UTC, James Laska	no flags	Details
C-prog used to demonstrate not getting any CPU cycles (283 bytes, text/plain) 2009-03-24 14:05 UTC, Hans de Goede	no flags	Details
bash script to run the program. (37 bytes, text/plain) 2009-03-24 14:06 UTC, Hans de Goede	no flags	Details
View All

Description Jesse Keating 2009-03-22 17:16:37 UTC

Created attachment 336205 [details]
screenshot of error.

When doing a kickstart in kvm, using clearpart and autopart, the install fails after formatting the filesystems.  It displays the attached screenshot.  Various log files will be attached as well.  This is repeatable.

Comment 1 Jesse Keating 2009-03-22 17:17:04 UTC

Created attachment 336206 [details]
log file

Comment 2 Jesse Keating 2009-03-22 17:17:26 UTC

Created attachment 336207 [details]
storage log

Comment 3 Jesse Keating 2009-03-22 17:17:48 UTC

Created attachment 336208 [details]
system log

Comment 4 Jesse Keating 2009-03-22 17:18:12 UTC

Created attachment 336209 [details]
program log

Comment 5 James Laska 2009-03-23 16:24:08 UTC

Created attachment 336319 [details]
anaconda-logs.tgz

I'm randomly seeing this failure while testing various installation scenarios in my KVM guest.

Attaching anaconda-logs.tgz which contains...
-rw-r--r-- root/root     34456 2009-03-23 12:22 tmp/anaconda.log
-rw-r--r-- root/root      7288 2009-03-23 12:21 tmp/program.log
-rw-r--r-- root/root     56988 2009-03-23 12:21 tmp/storage.log
-rw-r--r-- root/root     32406 2009-03-23 12:23 tmp/syslog
-rwxr-xr-x root/root       912 2009-03-23 12:19 tmp/vncserver.log

Comment 6 Chris Lumens 2009-03-23 20:08:37 UTC

Does /dev/sda1 even exist in time?

Comment 7 Jesse Keating 2009-03-23 20:24:14 UTC

Hard to say.  By the time I notice the error and flip to tty2 to check, its there.

Comment 8 Chris Lumens 2009-03-23 20:31:26 UTC

*** Bug 491743 has been marked as a duplicate of this bug. ***

Comment 9 Hans de Goede 2009-03-24 14:04:58 UTC

The problem is that on a kvm, sleeping may not give us much (if any) cpu cycles to
actually do the whole rescan partition table thingie, as sleep waits an amount of realtime, and in that amount of realtime, we may get very little virtual cpu.

I'll attach a small C-program +  a sheel script to run it, which shows this.
Run the script in a virtual machine on an idle machine, (note the numbers
already vary wildly, while they are quite stable on a real idle machine).

Now load the host machine, using multiple processes (md5sum /dev/urandom& 10x does the trick), now watch how much cpu the kvm gets while the shell script sleeps 1 *real* second.

So in short, we either need to to retry with longer timeouts, or find a way to not sleep at all (or blame the kvm guys).

Comment 10 Hans de Goede 2009-03-24 14:05:36 UTC

Created attachment 336473 [details]
C-prog used to demonstrate not getting any CPU cycles

Comment 11 Hans de Goede 2009-03-24 14:06:08 UTC

Created attachment 336474 [details]
bash script to run the program.

Comment 12 Hans de Goede 2009-03-24 14:08:10 UTC

Note by loading the host machine, I've seen the "measureD' amount of virt cpu time given to the virtual machine in 1 second drop by as much as a factor 30.

So we may need to sleep upto 30 times as long.

Comment 13 Marcelo Tosatti 2009-03-24 14:52:19 UTC

Should use a notification instead of assuming a certain amount of work can 
be accomplished in a certain wallclock period?

Comment 14 Hans de Goede 2009-03-24 15:00:30 UTC

(In reply to comment #13)
> Should use a notification instead of assuming a certain amount of work can 
> be accomplished in a certain wallclock period?  

We are using events, the problem is that the kernel inside virtual machine does not even get the time to generate the events, so the event queue is empty, so we assume the kernel is done *scanning hardware*. This is a rather hard problem, which consists mainly of the kernel <-> userspace interface for hardware scanning not given us enough info.

Comment 15 Marcelo Tosatti 2009-03-24 15:08:22 UTC

(In reply to comment #14)
> (In reply to comment #13)
> > Should use a notification instead of assuming a certain amount of work can 
> > be accomplished in a certain wallclock period?  
> 
> We are using events, the problem is that the kernel inside virtual machine does
> not even get the time to generate the events, so the event queue is empty, so
> we assume the kernel is done *scanning hardware*. This is a rather hard
> problem, which consists mainly of the kernel <-> userspace interface for
> hardware scanning not given us enough info. 

There must be some object/state visible in userspace that you can regularly poll on?

Comment 16 Richard W.M. Jones 2009-03-24 15:47:57 UTC

I wrote this kernel patch a long time ago to fix this
problem ...

http://lkml.indiana.edu/hypermail/linux/kernel/0706.1/1638.html

Comment 17 Chris Lumens 2009-03-24 18:53:00 UTC

*** Bug 491945 has been marked as a duplicate of this bug. ***

Comment 18 David Lehman 2009-03-25 01:30:07 UTC

This should be fixed in anaconda-11.5.0.37-1.

Comment 19 Mark McLoughlin 2009-03-25 09:06:19 UTC

For reference, the commit was:

http://git.fedorahosted.org/git/anaconda.git?p=anaconda.git;a=commitdiff;h=0bb7d5413f

Comment 20 Adam Williamson 2009-04-22 16:54:05 UTC

This is on the preview blocker list: can any of you confirm the claimed fix?

-- 
Fedora Bugzappers volunteer triage team
https://fedoraproject.org/wiki/BugZappers

Comment 21 Jesse Keating 2009-04-23 18:10:17 UTC

I'm not able to see it anymore.

Comment 22 Chris Lumens 2009-06-08 13:39:08 UTC

*** Bug 504408 has been marked as a duplicate of this bug. ***

Note You need to log in before you can comment on or make changes to this bug.