1721395 – disk boot delay and high cpu

RHEL Engineering is moving the tracking of its product development work on RHEL 6 through RHEL 9 to Red Hat Jira (issues.redhat.com). If you're a Red Hat customer, please continue to file support cases via the Red Hat customer portal. If you're not, please head to the "RHEL project" in Red Hat Jira and file new tickets here. Individual Bugzilla bugs in the statuses "NEW", "ASSIGNED", and "POST" are being migrated throughout September 2023. Bugs of Red Hat partners with an assigned Engineering Partner Manager (EPM) are migrated in late September as per pre-agreed dates. Bugs against components "kernel", "kernel-rt", and "kpatch" are only migrated if still in "NEW" or "ASSIGNED". If you cannot log in to RH Jira, please consult article #7032570. That failing, please send an e-mail to the RH Jira admins at rh-issues@redhat.com to troubleshoot your issue as a user management inquiry. The email creates a ServiceNow ticket with Red Hat. Individual Bugzilla bugs that are migrated will be moved to status "CLOSED", resolution "MIGRATED", and set with "MigratedToJIRA" in "Keywords". The link to the successor Jira issue will be found under "Links", have a little "two-footprint" icon next to it, and direct you to the "RHEL project" in Red Hat Jira (issue links are of type "https://issues.redhat.com/browse/RHEL-XXXX", where "X" is a digit). This same link will be available in a blue banner at the top of the page informing you that that bug has been migrated.

Bug 1721395 - disk boot delay and high cpu

Summary: disk boot delay and high cpu

Keywords:
Status:	CLOSED WONTFIX
Alias:	None
Product:	Red Hat Enterprise Linux 8
Classification:	Red Hat
Component:	device-mapper-multipath
Sub Component:
Version:	8.4
Hardware:	Unspecified
OS:	Unspecified
Priority:	medium
Severity:	unspecified
Target Milestone:	rc
Target Release:	8.5
Assignee:	Ben Marzinski
QA Contact:	Lin Li
Docs Contact:
URL:
Whiteboard:
Depends On:
Blocks:
TreeView+	depends on / blocked

Reported:	2019-06-18 08:14 UTC by Kapetanakis Giannis
Modified:	2021-09-06 15:21 UTC (History)
CC List:	11 users (show)
Fixed In Version:
Doc Type:	If docs needed, set a value
Doc Text:
Clone Of:
Environment:
Last Closed:	2021-03-15 07:36:50 UTC
Type:	Bug
Target Upstream Version:
Embargoed:
Dependent Products:

Attachments	(Terms of Use)
VM boot strace (397.01 KB, application/gzip) 2019-06-18 08:14 UTC, Kapetanakis Giannis	no flags	Details
VM open fd (2.51 KB, text/plain) 2019-06-18 08:15 UTC, Kapetanakis Giannis	no flags	Details
dmesg with multipath (33.52 KB, text/plain) 2019-06-18 17:27 UTC, Kapetanakis Giannis	no flags	Details
dmesg without multipath (33.66 KB, text/plain) 2019-06-18 17:28 UTC, Kapetanakis Giannis	no flags	Details
View All

Description Kapetanakis Giannis 2019-06-18 08:14:32 UTC

Created attachment 1581600 [details]
VM boot strace

Hi,

Not sure if this belongs here but I'll try anyway since vdsm is the closest match.

We've recently migrated from FC to iSCSI 10Gbps on an EMC Unity 350F Storage.
Active controller on EMC is connected to 2 switches with different VLANs per port. Nodes are connected to the two switches with bonding (active-backup).
Multipathd works on top of this.

We have problems booting VMs with IDE disks.

- boot stops after SeaBIOS on "Booting from Hard Disk"
- CPU of VM process is at 100%
- Sometimes I've seen booting VM virtio disks delaying 10-20 seconds when scanning LVM (after swap).

setup is up2date:
kernel-3.10.0-957.21.2.el7.x86_64
vdsm-4.30.17-1.el7.x86_64
qemu-kvm-ev-2.12.0-18.el7_6.5.1.x86_64

The funny thing is that if logout from one ISCSI (multipath) session then the boot continues:

360060160f1c04800f55b065c6c7cff11 dm-10 DGC     ,VRAID           
size=4.0T features='2 queue_if_no_path retain_attached_hw_handler' hwhandler='1 alua' wp=rw
|-+- policy='service-time 0' prio=50 status=active
| |- 7:0:0:0  sdf 8:80  active ready running
| `- 8:0:0:0  sdg 8:96  active ready running
`-+- policy='service-time 0' prio=10 status=enabled
  |- 9:0:0:0  sdh 8:112 active ready running
  `- 10:0:0:0 sdi 8:128 active ready running

Either logging out from sdf or sdg, makes boot continue

strace of the process I see a lot of
- EAGAIN (Resource temporarily unavailable)
- ppoll resumed Timeout

After the VM is booted and then I re-login to all iscsi sessions I don't see any problems.

I don't have any errors on the network interfaces on the node and on switches.

Comment 1 Kapetanakis Giannis 2019-06-18 08:15:16 UTC

Created attachment 1581601 [details]
VM open fd

Comment 2 Kapetanakis Giannis 2019-06-18 17:26:21 UTC

so, problem is not completely solved by changing the disk to virtio.

See dmesg of same VM.

1) with multipath enabled and iscsi access to both ports of storage
2) with access to only one port of iscsi and logged out from all other ports of storage

Comment 3 Kapetanakis Giannis 2019-06-18 17:27:57 UTC

Created attachment 1581802 [details]
dmesg with multipath

Comment 4 Kapetanakis Giannis 2019-06-18 17:28:28 UTC

Created attachment 1581803 [details]
dmesg without multipath

much faster boot time (and reboot)

Comment 5 Tal Nisan 2019-07-08 14:55:41 UTC

Hi Martin,
Can you please assign it to the relevant team? Seems like it's out of RHV Storage team's scope

Comment 6 Kapetanakis Giannis 2019-07-09 07:45:08 UTC

Just for the record, I've already contacted Dell support about the storage and it doesn't seem to be a storage (device) problem.

Comment 7 Martin Tessun 2019-07-22 15:24:53 UTC

Moving this to multipath for now, as logging out of one mpath target seems to fix the issue.

Also the IO Timeouts seem to lead to that conclusion.

Comment 9 Ben Marzinski 2020-11-04 21:43:04 UTC

Are you still able to reproduce this? Do you have the log messages from when this occurs.

Comment 10 Kapetanakis Giannis 2020-11-05 18:27:43 UTC

Yes I still see this. Definitely happens after VM installation upon first boot.
This happens with virtio as well but the delay is not that big.

I'm using ovirt 4.3.10 and haven't moved to 4.4 yet

What kind of logs do you need?

Comment 11 Ben Marzinski 2020-11-18 15:14:50 UTC

It's likely too late for this to make it in RHEL7. Do you know if this happens in RHEL8?

Comment 12 Kapetanakis Giannis 2020-11-23 07:26:09 UTC

Haven't tested this in RHEL8...

Wild guess: 
is there a chance this is related to kvm_intel preemption_timer?

I have similar problems with virtio disks (not only ide)

too many events=POLLIN timeouts

Comment 13 Ben Marzinski 2020-12-02 17:46:14 UTC

(In reply to Kapetanakis Giannis from comment #12)
> Haven't tested this in RHEL8...

It would be really helpful to know if this is still happening in RHEL8.  RHEL7 is closed to all but the most serious errors, and this doesn't qualify.

> Wild guess: 
> is there a chance this is related to kvm_intel preemption_timer?

Possibly. I have a hard time figuring out how the multipath device is even involved, since it works fine after boot, and from the multipath device's perspective IO is the same whether a guest is booting or not.  It's possible that there is some different sort of access pattern to the device when a guest is booting, but I don't see why having only one iSCSI session would make that better.

What process are you stracing when you see all the poll event timeouts?

> I have similar problems with virtio disks (not only ide)
> 
> too many events=POLLIN timeouts

Comment 14 Kapetanakis Giannis 2020-12-02 20:40:45 UTC

(In reply to Ben Marzinski from comment #13)
> (In reply to Kapetanakis Giannis from comment #12)
> > Haven't tested this in RHEL8...
> 
> It would be really helpful to know if this is still happening in RHEL8. 
> RHEL7 is closed to all but the most serious errors, and this doesn't qualify.
> 
> > Wild guess: 
> > is there a chance this is related to kvm_intel preemption_timer?
> 
> Possibly. I have a hard time figuring out how the multipath device is even
> involved, since it works fine after boot, and from the multipath device's
> perspective IO is the same whether a guest is booting or not.  It's possible
> that there is some different sort of access pattern to the device when a
> guest is booting, but I don't see why having only one iSCSI session would
> make that better.
> 
> What process are you stracing when you see all the poll event timeouts?
> 
> > I have similar problems with virtio disks (not only ide)
> > 
> > too many events=POLLIN timeouts

I'm stracing qemu-kvm on the supervisor nodes.

I've also have similar delays upon upgrading VMs with openbsd. This reboots the VM with a minimal kernel and then extracts tar archives from and to the filesystem.
It hangs for a while and then continues, all the time... lot's of events=POLLIN timeouts on strace.

Anyway, I will upgrade to RHEL8 and see what's the status there.

thanks

Comment 15 Ben Marzinski 2021-01-11 20:15:06 UTC

Moving this over to RHEL8, to see if it can be reproduced there, since it is too late to get any fix into RHEL7.

Comment 17 Kapetanakis Giannis 2021-01-12 12:45:33 UTC

I'm evaluating the same thing on RHEL 8.3.2011

Situation seems the same. A lot of:

14:36:29 ppoll([{fd=4, events=POLLIN}, {fd=5, events=POLLIN}, {fd=7, events=POLLIN}, {fd=8, events=POLLIN}, {fd=13, events=POLLIN}, {fd=16, events=POLLIN}, {fd=18, events=POLLIN}, {fd=29, events=POLLIN}, {fd=31, events=POLLIN}, {fd=32, events=POLLIN}, {fd=33, events=POLLIN}, {fd=36, events=POLLIN}], 12, {tv_sec=0, tv_nsec=996416}, NULL, 8) = 0 (Timeout) <0.001011>
14:36:29 ppoll([{fd=4, events=POLLIN}, {fd=5, events=POLLIN}, {fd=7, events=POLLIN}, {fd=8, events=POLLIN}, {fd=13, events=POLLIN}, {fd=16, events=POLLIN}, {fd=18, events=POLLIN}, {fd=29, events=POLLIN}, {fd=31, events=POLLIN}, {fd=32, events=POLLIN}, {fd=33, events=POLLIN}, {fd=36, events=POLLIN}], 12, {tv_sec=0, tv_nsec=996427}, NULL, 8) = 0 (Timeout) <0.001011>

The setup is a physical box connected with iSCSI on the same storage with LVM.
VMs are setup with libvirt (virt-manager) on kvm

The hungs are while upgrading an openbsd vm. 
see https://bugzilla.redhat.com/show_bug.cgi?id=1721395#c14

I've also tried (again) logging out of one multipath and TIMEOUTs stopped and disk stalls also stopped.
So situation is the same.

I believe you can replicate it easily.

install openbsd 6.7 (amd64)
boot
login
sysupgrade

Comment 19 RHEL Program Management 2021-03-15 07:36:50 UTC

After evaluating this issue, there are no plans to address it further or fix it in an upcoming release.  Therefore, it is being closed.  If plans change such that this issue will be fixed in an upcoming release, then the bug can be reopened.

Note You need to log in before you can comment on or make changes to this bug.