Bug 1408371 - DHCP server is not ready on time to respond to VMs requests
Summary: DHCP server is not ready on time to respond to VMs requests
Keywords:
Status: CLOSED NOTABUG
Alias: None
Product: Virtualization Tools
Classification: Community
Component: libvirt
Version: unspecified
Hardware: x86_64
OS: Linux
unspecified
high
Target Milestone: ---
Assignee: Libvirt Maintainers
QA Contact:
URL:
Whiteboard:
Depends On:
Blocks:
TreeView+ depends on / blocked
 
Reported: 2016-12-23 07:07 UTC by Yaniv Kaul
Modified: 2017-01-03 19:41 UTC (History)
4 users (show)

Fixed In Version:
Doc Type: If docs needed, set a value
Doc Text:
Clone Of:
: 1411025 (view as bug list)
Environment:
Last Closed: 2017-01-03 19:41:08 UTC
Embargoed:


Attachments (Terms of Use)

Description Yaniv Kaul 2016-12-23 07:07:22 UTC
Description of problem:
When disabling the boot menu, and with a VM that loads quickly (disabled grub's 5 seconds timeout), it appears that libvirt's DHCP server may not be ready to respond to DHCP requests.

I am running multiple VMs, concurrently, each with 4 NICs - and some of them fail to get an IP at a time. By simply restoring the boot menu, this problem goes away.

Version-Release number of selected component (if applicable):
libvirt-2.2.0-2.fc25.x86_64

How reproducible:
Sometimes.

Steps to Reproduce:
1. Launch multiple VMs, with quick boot.

Actual results:
Some interfaces do not get an IP.

Expected results:
All IPs get an IP.

Comment 1 Laine Stump 2017-01-03 19:06:43 UTC
If this is a setup using a libvirt-managed virtual network, dnsmasq is started and running all the way back when the libvirt network is started (which usually happens when libvirtd is first started after booting the host), so it's not likely that the problem is the DHCP server (which is in dnsmasq) not being ready. A more common problem is that the dhcp client in the guest times out while the guest's tap interface (its connection to the libvirt network's bridge) is still in the STP forward delay period (during which any guest network traffic is discarded rather than being forwarded). If you have STP enabled in the network, does it have delay='0'? If you reboot the guest (without shutting it down, which would also detach/reattach the tap device thus restarting the STP delay) does it then work?

If your setup isn't using a libvirt-managed network, then neither the DHCP server nor the STP forwarding delay are under libvirt's control, so the solution lies elsewhere.

Comment 2 Yaniv Kaul 2017-01-03 19:41:08 UTC
Closing - I think we have found an issue that might explain it - we had duplicate IPv6 addresses configured for the GWs of different networks.

In any case, it does not reproduce now that we've fixed that - so hopefully it was that.


Note You need to log in before you can comment on or make changes to this bug.