Bug 901786
| Summary: | After upgrade to Fedora 18 using FedUp, system does not find any LVM volumes and fails to boot | ||||||
|---|---|---|---|---|---|---|---|
| Product: | [Fedora] Fedora | Reporter: | Carl A. Adams <carlalex> | ||||
| Component: | kernel | Assignee: | Kernel Maintainer List <kernel-maint> | ||||
| Status: | CLOSED INSUFFICIENT_DATA | QA Contact: | Fedora Extras Quality Assurance <extras-qa> | ||||
| Severity: | high | Docs Contact: | |||||
| Priority: | unspecified | ||||||
| Version: | 18 | CC: | agk, bmarzins, bmr, carlalex, donkishoot, dwysocha, gansalmon, heinzm, itamar, jonathan, kernel-maint, l4coa3fnjplr, lvm-team, madhu.chinakonda, msnitzer, prajnoha, prockai, sam, zkabelac | ||||
| Target Milestone: | --- | ||||||
| Target Release: | --- | ||||||
| Hardware: | x86_64 | ||||||
| OS: | Linux | ||||||
| Whiteboard: | |||||||
| Fixed In Version: | Doc Type: | Bug Fix | |||||
| Doc Text: | Story Points: | --- | |||||
| Clone Of: | Environment: | ||||||
| Last Closed: | 2013-04-22 14:24:17 UTC | Type: | Bug | ||||
| Regression: | --- | Mount Type: | --- | ||||
| Documentation: | --- | CRM: | |||||
| Verified Versions: | Category: | --- | |||||
| oVirt Team: | --- | RHEL 7.3 requirements from Atomic Host: | |||||
| Cloudforms Team: | --- | Target Upstream Version: | |||||
| Embargoed: | |||||||
| Attachments: |
|
||||||
|
Description
Carl A. Adams
2013-01-19 07:51:40 UTC
The only thing slightly weird about this system that I can think of is I have VMWare Workstation installed; but I don't see any obvious indication that this is related. Google'd around. I attempted to regenerate the initrd with dracut per defect #827651. This did not help Working on another hunch. I moved the disk from the internal SATA-III controller to the SATA-II controller. This fixed my issue, and the 3.7.X kernels now boot the system. As my disk is only a SATA-II, this really doesn't matter that much for me. The unrecognized SATA-III controller is a Marvell controller. The working SATA-II controller is Intel ICH10. Motherboard is ASUS Rampage III Gene LGA 1366 Intel X58 (http://www.newegg.com/Product/Product.aspx?Item=N82E16813131658) There are other messages from the kernel at boot-up, and there system now has a ~30 second delay near the start of the boot sequence it used to not have. Lots of complaining about IOMMU and broken bios. dmesg attached. Created attachment 683323 [details]
dmesg with errors.
(In reply to comment #3) > Working on another hunch. I moved the disk from the internal SATA-III > controller to the SATA-II controller. This fixed my issue, and the 3.7.X > kernels now boot the system. This does not seem to be an LVM problem, reassigning the component to kernel. > There are other messages from the kernel at boot-up, and there system now > has a ~30 second delay near the start of the boot sequence it used to not > have. Lots of complaining about IOMMU and broken bios. dmesg attached. Also: [ 6.226167] ata14.00: qc timeout (cmd 0xa1) [ 6.725113] ata14.00: failed to IDENTIFY (I/O error, err_mask=0x4) [ 7.532664] ata14: SATA link up 1.5 Gbps (SStatus 113 SControl 300) [ 17.517131] ata14.00: qc timeout (cmd 0xa1) [ 18.017343] ata14.00: failed to IDENTIFY (I/O error, err_mask=0x4) [ 18.017349] ata14: limiting SATA link speed to 1.5 Gbps [ 18.824216] ata14: SATA link up 1.5 Gbps (SStatus 113 SControl 310) [ 48.779540] ata14.00: qc timeout (cmd 0xa1) [ 49.279642] ata14.00: failed to IDENTIFY (I/O error, err_mask=0x4) [ 50.085527] ata14: SATA link up 1.5 Gbps (SStatus 113 SControl 310) I have this problem to, how I can fix it? This appears to be an SELinux issue. It is not that the LVM is gone, it is that the system no longer has permission to use it. On my system the entire /dev directory had no security context. Everything was labelled with a ? 1. On first reboot after Fed Up a GUI login prompt is presented, but users can't login. Even root can't login either at the GUI or at a CTRL-ALT-F2 console prompt. 2. On second reboot after Fed Up attempting to boot at init 3 the boot aborts and dumps the user to dracut# after reporting that the root logical volume is not present. From there I could see that it was in fact present, but that it had a security context of ? 3. On the third reboot after Fed Up I set selinux=0 enforcing=0 3 in GRUB to come up to a console where I could in fact login as root. 4. On the fourth reboot after Fed Up I set selinux=1 and waited for the system to relabel itself. 5. On the fifth reboot after Fed Up and selinux relabel the system came up and runs as it should. I don't think it's only an SELinux issue. I have the same problem on my old Dell PowerEdge 2450, and adding selinux=0 enforcing=0 (I stop at level 3 on this system, console only) I still had the same problem. And I always have SELinux disabled back several Fedora versions. But earlier in the boot I get: aic7xxx: no multichannel peer found! Well, the root partition is on the AIC-7899 SCSI drive. So I'm thinking that the aic7xxx module isn't getting loaded properly in the fedup boot, it really can't see the root partition. The machine boots fine into Fedora 17, and did before into Fedora 16. I'm not really sharp on grub2 yet, so I don't know how to force aic7xxx to be preloaded like when I can roll my own initrd in /boot with mkinitrd. I can't make a custom initrd in /boot for fedup because fedup doesn't put any modules under /lib for the Fedora 18 kernel. Caught in a trap - don't have the module I need to make an initrd so I can boot, but I need the module preloaded to boot up. It seems. Or maybe it's something else, but the aic7xxx error early in the boot looks ominous. You can go further but it seems not to be a solution. While on dracut cli do : # blkid check if you see some volume # lvm vgscan # blkid check if you see your VG # lvm vgchange -ay # blkid check if you see your logical volumes Next you exit (#exit) dracut and fedup continue the upgrade process but it seems not to boot after :( After you will probably have another bug where you will have to delete in /boot/grub2/grub.cfg the end of the linux boot line (all behind KEYTABLE= parameter) Are you still seeing this with the 3.8.5 kernel? This bug is being closed with INSUFFICIENT_DATA as there has not been a response in 2 weeks. If you are still experiencing this issue, please reopen and attach the relevant data from the latest kernel you are running and any data that might have been requested previously. I gave up on this. I saw no way to get the machine to boot up under Fedup when it can't see the AIC-7899 SCSI drive. I'm sitting on Fedora 17 and will probably do a full install of Fedora 18 sometime, if I care enough. |