RHEL Engineering is moving the tracking of its product development work on RHEL 6 through RHEL 9 to Red Hat Jira (issues.redhat.com). If you're a Red Hat customer, please continue to file support cases via the Red Hat customer portal. If you're not, please head to the "RHEL project" in Red Hat Jira and file new tickets here. Individual Bugzilla bugs in the statuses "NEW", "ASSIGNED", and "POST" are being migrated throughout September 2023. Bugs of Red Hat partners with an assigned Engineering Partner Manager (EPM) are migrated in late September as per pre-agreed dates. Bugs against components "kernel", "kernel-rt", and "kpatch" are only migrated if still in "NEW" or "ASSIGNED". If you cannot log in to RH Jira, please consult article #7032570. That failing, please send an e-mail to the RH Jira admins at rh-issues@redhat.com to troubleshoot your issue as a user management inquiry. The email creates a ServiceNow ticket with Red Hat. Individual Bugzilla bugs that are migrated will be moved to status "CLOSED", resolution "MIGRATED", and set with "MigratedToJIRA" in "Keywords". The link to the successor Jira issue will be found under "Links", have a little "two-footprint" icon next to it, and direct you to the "RHEL project" in Red Hat Jira (issue links are of type "https://issues.redhat.com/browse/RHEL-XXXX", where "X" is a digit). This same link will be available in a blue banner at the top of the page informing you that that bug has been migrated.
Bug 1316774 - vm failed to start with mlock
Summary: vm failed to start with mlock
Keywords:
Status: CLOSED ERRATA
Alias: None
Product: Red Hat Enterprise Linux 7
Classification: Red Hat
Component: libvirt
Version: 7.3
Hardware: ppc64le
OS: Linux
high
high
Target Milestone: rc
: ---
Assignee: Andrea Bolognani
QA Contact: Virtualization Bugs
URL:
Whiteboard:
Depends On:
Blocks:
TreeView+ depends on / blocked
 
Reported: 2016-03-11 03:46 UTC by Wayne Sun
Modified: 2017-08-01 23:51 UTC (History)
8 users (show)

Fixed In Version: libvirt-3.1.0-1.el7
Doc Type: If docs needed, set a value
Doc Text:
Clone Of:
Environment:
Last Closed: 2017-08-01 17:09:12 UTC
Target Upstream Version:
Embargoed:


Attachments (Terms of Use)
vm xml with mlock (2.24 KB, text/plain)
2016-03-11 07:56 UTC, Wayne Sun
no flags Details
vm xml with mlock (2.29 KB, text/plain)
2016-03-11 07:59 UTC, Wayne Sun
no flags Details


Links
System ID Private Priority Status Summary Last Updated
Red Hat Product Errata RHEA-2017:1846 0 normal SHIPPED_LIVE libvirt bug fix and enhancement update 2017-08-01 18:02:50 UTC

Description Wayne Sun 2016-03-11 03:46:03 UTC
Description of problem:
vm failed to start with mlock

Version-Release number of selected component (if applicable):
# rpm -q libvirt qemu-kvm-rhev kernel
libvirt-1.3.2-1.el7.ppc64le
qemu-kvm-rhev-2.5.0-2.el7.ppc64le
kernel-3.10.0-362.el7.ppc64le

How reproducible:
always

Steps to Reproduce:
1. start a vm with mlock config
# virsh dumpxml avocado-vt-vm1
...
  <memoryBacking>
    <locked/>
  </memoryBacking>
...

# virsh start avocado-vt-vm1
error: Failed to start domain avocado-vt-vm1
error: internal error: process exited while connecting to monitor: mlockall: Cannot allocate memory
2016-03-11T03:14:19.638234Z qemu-kvm: locking memory failed

strace log:
61177 --- SIGCHLD {si_signo=SIGCHLD, si_code=CLD_EXITED, si_pid=61364, si_status=0, si_utime=0, si_stime=0} ---
61451 setrlimit(RLIMIT_MEMLOCK, {rlim_cur=20480*1024, rlim_max=20480*1024}) = 0
61450 +++ exited with 0 +++
61177 --- SIGCHLD {si_signo=SIGCHLD, si_code=CLD_EXITED, si_pid=61450, si_status=0, si_utime=0, si_stime=0} ---
61451 syscall_360(0x48f168d8, 0x1, 0x3ffffd31bed4, 0x48b75b70, 0x48b760d0, 0xffffffffffffffff) = -1 (errno 38)
61460 +++ exited with 1 +++
61461 +++ exited with 1 +++

2.
3.

Actual results:
failed to start vm

Expected results:
succeed

Additional info:

Comment 2 Jiri Denemark 2016-03-11 07:20:02 UTC
Could you waste a little bit more disk space and always attach the full XML of the domain? According to the strace log it looks like you tried to set 20 MB memory limit, which is likely not something you wanted to do.

Comment 3 Wayne Sun 2016-03-11 07:56:09 UTC
Created attachment 1135133 [details]
vm xml with mlock

Attached the vm xml, it's only with mlock config.
Yes, 20M is not expected.

Comment 4 Wayne Sun 2016-03-11 07:59:24 UTC
Created attachment 1135134 [details]
vm xml with mlock

sorry, paste wrong one in last comment, pls check the update one.

Comment 5 Jiri Denemark 2016-03-11 08:16:22 UTC
I think the documentation is pretty clear at http://libvirt.org/formatdomain.html#elementsMemoryBacking

...
locked
    When set and supported by the hypervisor, memory pages belonging to the domain will be locked in host's memory and the host will not be allowed to swap them out. For QEMU/KVM this requires hard_limit memory tuning element to be used and set to the maximum memory configured for the domain plus any memory consumed by the QEMU process itself.

I'd say it's a user error, but I'm not sure whether we perhaps relaxed the requirement to set hard_limit for Power... Andrea?

Comment 6 Andrea Bolognani 2016-03-21 13:34:41 UTC
(In reply to Jiri Denemark from comment #5)
> I think the documentation is pretty clear at
> http://libvirt.org/formatdomain.html#elementsMemoryBacking
> 
> ...
> locked
>     When set and supported by the hypervisor, memory pages belonging to the
> domain will be locked in host's memory and the host will not be allowed to
> swap them out. For QEMU/KVM this requires hard_limit memory tuning element
> to be used and set to the maximum memory configured for the domain plus any
> memory consumed by the QEMU process itself.
> 
> I'd say it's a user error, but I'm not sure whether we perhaps relaxed the
> requirement to set hard_limit for Power... Andrea?

Not at all, the requirement is the same as x86 and as documented.

The call to setrlimit() the reporter is seeing is because, on
ppc64, some memory has to be locked regardless of whether the
<memoryBacking><locked> element is present or not.

20 MiB make sense here because only some small caches and lookup
tables need to be locked all the time - the bulk of the guest
memory doesn't have such requirement.

So I agree it's user error; on the other hand, maybe we should
take this chance to improve libvirt so that some kind of
meaningful error message is reported whenever the user has
specified <memoryBacking><locked> but <memtune><hardlimit> is
not present in the guest configuration? With much lower
severity and priority, of course :)

Comment 8 Andrea Bolognani 2017-02-06 17:31:35 UTC
Patch posted upstream:

  https://www.redhat.com/archives/libvir-list/2017-February/msg00180.html

Comment 9 Andrea Bolognani 2017-02-07 11:38:51 UTC
v2 patch posted upstream:

  https://www.redhat.com/archives/libvir-list/2017-February/msg00214.html

Comment 10 Jaroslav Suchanek 2017-02-07 12:15:31 UTC
(In reply to Andrea Bolognani from comment #9)
> v2 patch posted upstream:
> 
>   https://www.redhat.com/archives/libvir-list/2017-February/msg00214.html

Was it accepted?

Comment 11 Andrea Bolognani 2017-02-07 13:17:48 UTC
Nope :)

But v3 has just been posted upstream:

  https://www.redhat.com/archives/libvir-list/2017-February/msg00227.html

Comment 12 Andrea Bolognani 2017-02-07 17:45:06 UTC
The fix has been pushed upstream.

commit c2e60ad0e5124482942164e5fec088157f5e716a
Author: Andrea Bolognani <abologna>
Date:   Mon Feb 6 17:54:49 2017 +0100

    qemu: Forbid <memoryBacking><locked> without <memtune><hard_limit>
    
    In order for memory locking to work, the hard limit on memory
    locking (and usage) has to be set appropriately by the user.
    
    The documentation mentions the requirement already: with this
    patch, it's going to be enforced by runtime checks as well,
    by forbidding a non-compliant guest from being defined as well
    as edited and started.
    
    Resolves: https://bugzilla.redhat.com/show_bug.cgi?id=1316774

v3.0.0-123-gc2e60ad

Comment 14 Dan Zheng 2017-03-06 09:47:47 UTC
Test packages:

libvirt-3.1.0-1.el7.ppc64le
qemu-kvm-rhev-2.8.0-5.el7.ppc64le
kernel-3.10.0-578.el7.ppc64le

Steps:
1. Configure the guest XML with below part:

...
  <memory unit='KiB'>1048576</memory>
  <currentMemory unit='KiB'>1048576</currentMemory>
  <memtune>
    <hard_limit unit='KiB'>2000000</hard_limit>
  </memtune>
  <memoryBacking>
    <locked/>
  </memoryBacking>

...
  <os>
    <type arch='ppc64le' machine='pseries-rhel7.4.0'>hvm</type>
    <boot dev='hd'/>
  </os>

...

2. Start the guest and guest can start up successfully.
3. Log on the guest and normal operations have no problem.
# virsh start dd
Domain dd started

# virsh console dd
Connected to domain dd
Escape character is ^]
CF000012
CF000015ch
Linux ppc64le
#1 SMP Wed Oct 1
Red Hat Enterprise Linux Server 7.3 (Maipo)
Kernel 3.10.0-514.el7.ppc64le on an ppc64le

localhost login: root
Password: 
[root@localhost ~]# ls
original-ks.cfg
[root@localhost ~]# pwd
/root
[root@localhost ~]# 

# virsh destroy dd
Domain dd destroyed


So make it pass.

Comment 15 errata-xmlrpc 2017-08-01 17:09:12 UTC
Since the problem described in this bug report should be
resolved in a recent advisory, it has been closed with a
resolution of ERRATA.

For information on the advisory, and where to find the updated
files, follow the link below.

If the solution does not work for you, open a new bug report.

https://access.redhat.com/errata/RHEA-2017:1846

Comment 16 errata-xmlrpc 2017-08-01 23:51:16 UTC
Since the problem described in this bug report should be
resolved in a recent advisory, it has been closed with a
resolution of ERRATA.

For information on the advisory, and where to find the updated
files, follow the link below.

If the solution does not work for you, open a new bug report.

https://access.redhat.com/errata/RHEA-2017:1846


Note You need to log in before you can comment on or make changes to this bug.