RHEL Engineering is moving the tracking of its product development work on RHEL 6 through RHEL 9 to Red Hat Jira (issues.redhat.com). If you're a Red Hat customer, please continue to file support cases via the Red Hat customer portal. If you're not, please head to the "RHEL project" in Red Hat Jira and file new tickets here. Individual Bugzilla bugs in the statuses "NEW", "ASSIGNED", and "POST" are being migrated throughout September 2023. Bugs of Red Hat partners with an assigned Engineering Partner Manager (EPM) are migrated in late September as per pre-agreed dates. Bugs against components "kernel", "kernel-rt", and "kpatch" are only migrated if still in "NEW" or "ASSIGNED". If you cannot log in to RH Jira, please consult article #7032570. That failing, please send an e-mail to the RH Jira admins at rh-issues@redhat.com to troubleshoot your issue as a user management inquiry. The email creates a ServiceNow ticket with Red Hat. Individual Bugzilla bugs that are migrated will be moved to status "CLOSED", resolution "MIGRATED", and set with "MigratedToJIRA" in "Keywords". The link to the successor Jira issue will be found under "Links", have a little "two-footprint" icon next to it, and direct you to the "RHEL project" in Red Hat Jira (issue links are of type "https://issues.redhat.com/browse/RHEL-XXXX", where "X" is a digit). This same link will be available in a blue banner at the top of the page informing you that that bug has been migrated.
Bug 1435198 - Udev/haldaemon consume 100 percent of the CPU for more than an hour when booting with lots of LVM objects
Summary: Udev/haldaemon consume 100 percent of the CPU for more than an hour when boot...
Keywords:
Status: CLOSED WONTFIX
Alias: None
Product: Red Hat Enterprise Linux 6
Classification: Red Hat
Component: hal
Version: 6.7
Hardware: x86_64
OS: Linux
urgent
urgent
Target Milestone: rc
: ---
Assignee: Richard Hughes
QA Contact: Desktop QE
URL:
Whiteboard:
Depends On:
Blocks:
TreeView+ depends on / blocked
 
Reported: 2017-03-23 11:32 UTC by Greg Scott
Modified: 2020-05-14 15:49 UTC (History)
2 users (show)

Fixed In Version:
Doc Type: If docs needed, set a value
Doc Text:
Clone Of:
Environment:
Last Closed: 2017-08-15 21:19:56 UTC
Target Upstream Version:
Embargoed:


Attachments (Terms of Use)


Links
System ID Private Priority Status Summary Last Updated
Red Hat Bugzilla 1424853 0 urgent CLOSED RHEV-H host activation fails after reboot on setup with large number of lvs 2021-02-22 00:41:40 UTC

Internal Links: 1424853 1451240

Description Greg Scott 2017-03-23 11:32:21 UTC
Description of problem:

RHEVH-6.7 and other RHEVH-6.n systems with lots of LVM objects take more than an hour to boot while haldaemon consumes 100 percent of the CPU and grinds through several thousand LVM objects.


Version-Release number of selected component (if applicable):

RHEVH-6.7-20160104 and others in the RHEVH-6 series.

How reproducible:
Always

Steps to Reproduce:
1. Set up a RHEV environment with 2000 active, stateless VMs and 2000 inactive VMs in a fiberchannel environment with 4 paths to the SAN.  This will add up to ((2000 X 2) + 2000) X 4 = 24000 LVM objects.  These will be a mixture of LVs, VGs, and PVs.
2. Boot one of the RHEV-H hosts
3. Wait a long time
4. It never activates by itself into a RHEV environment.

Actual results:

Once we can finally login to the console or ssh into the host, watch top and notice haldaemon consumes 100 percent of the CPU for at least an hour to grind through all those /dev/mapper objects.  After haldaemon finishes grinding, we can restart the vdsmd service by hand to activate the host into a RHEV environment.

Expected results:

The system should boot in a reasonable amount of time and activate on its own.

Additional info:

Note that RHEVH-6 and RHVH-7 are special cases of RHEL.  RHEV / RHV uses a process named VDSM on each host to filter out LVM objects it doesn't need to activate the VMs on this host. So, at boot time, haldaemon enumerates every single LVM object, and then VDSM filters out the ones it doesn't need.  Not only does this waste a huge amount of time, but haldaemon breaks vdsm by taking so long.

We worked around the problem by disabling haldaemon at boot for large customers.  With haldaemon disabled, boot times drop from one hour plus to around 15 minutes, and the VDSM process works as expected to automatically activate these hosts into their RHEV datacenter.

Haldaemon starts by default with RHEL 6. RHEV engineering needs to know what is lost by disabling haldaemon in RHEVH-6, and the best way to boot RHEL7 / RHVH-7 while carrying a similar load of tens of thousands of LVM objects in /dev/mapper.

Comment 2 Michal Sekletar 2017-03-23 12:48:36 UTC
I don't know all the details about HAL, but in the nutshell, HAL exposes devices and on DBus and provides library (libhal) that applications can use to enumerate devices, get and set device properties and so on.

By removing HAL you lose some capabilities but question is whether you need them. Anyway, HAL is a separate component in RHEL6. Reassigning...


Note You need to log in before you can comment on or make changes to this bug.