Bug 360741 (Roldyx)
Summary: | ICMP crash when live migrating | ||||||
---|---|---|---|---|---|---|---|
Product: | Red Hat Enterprise Linux 5 | Reporter: | Rodrigo roldan <rroldan> | ||||
Component: | xen | Assignee: | Rik van Riel <riel> | ||||
Status: | CLOSED DUPLICATE | QA Contact: | Virtualization Bugs <virt-bugs> | ||||
Severity: | high | Docs Contact: | |||||
Priority: | low | ||||||
Version: | 5.1 | CC: | alain.richard, clalance, syeghiay, xen-maint | ||||
Target Milestone: | --- | ||||||
Target Release: | --- | ||||||
Hardware: | x86_64 | ||||||
OS: | Linux | ||||||
Whiteboard: | |||||||
Fixed In Version: | Doc Type: | Bug Fix | |||||
Doc Text: | Story Points: | --- | |||||
Clone Of: | Environment: | ||||||
Last Closed: | 2009-04-10 17:13:29 UTC | Type: | --- | ||||
Regression: | --- | Mount Type: | --- | ||||
Documentation: | --- | CRM: | |||||
Verified Versions: | Category: | --- | |||||
oVirt Team: | --- | RHEL 7.3 requirements from Atomic Host: | |||||
Cloudforms Team: | --- | Target Upstream Version: | |||||
Embargoed: | |||||||
Bug Depends On: | |||||||
Bug Blocks: | 492190 | ||||||
Attachments: |
|
Description
Rodrigo roldan
2007-10-31 19:01:13 UTC
Created attachment 248531 [details]
xend.log Dom0
I detected the problem, it's a rare one. When having a bad mdadm config, we have connection issues. The solution: repair /etc/mdadm.conf or wait for a source code update. Under Centos 5.2 (RHEL 5.2), I get the same behavior : once a domU is migrated from one server to an other, ping stop after just the first packet. Looking more deeply in that issue, I found out that effectively it hangs on "gettimeofday" with an EAGAIN error. In fact the problem is not with ping, but with some part of the clock code. When the problem is present, I get always the same time with the date command : [root@auto127 ~]# date mer jui 9 14:33:42 CEST 2008 [root@auto127 ~]# date mer jui 9 14:33:42 CEST 2008 [root@auto127 ~]# date mer jui 9 14:33:42 CEST 2008 [root@auto127 ~]# date mer jui 9 14:33:42 CEST 2008 [root@auto127 ~]# date mer jui 9 14:33:42 CEST 2008 [root@auto127 ~]# date mer jui 9 14:33:42 CEST 2008 [root@auto127 ~]# date mer jui 9 14:33:42 CEST 2008 [root@auto127 ~]# date mer jui 9 14:33:42 CEST 2008 [root@auto127 ~]# date mer jui 9 14:33:42 CEST 2008 [root@auto127 ~]# date mer jui 9 14:33:42 CEST 2008 [root@auto127 ~]# date mer jui 9 14:33:42 CEST 2008 [root@auto127 ~]# After some time (from 5 minutes to 15 minutes), the date works again and so is ping. During the hang, cat /proc/uptime is increasing normaly. Our setup is : dom0 servers under Centos 5.2, 2.6.18-92.1.1.el5xen, clock synchronised with ntpd. domU under Centos 5.2, 2.6.18-92.1.1.el5xen, no ntpd daemon. /proc/sys/xen/independent_wallclock is 0 in dom0 and domU. this is 100% reproductible under our setup. Ah, OK. Note that there are still some possible lingering problems with live migrate ARP stuff, but if your clock is stopping, then it is almost certainly BZ 426861 which we've been tracking. Chris Lalancette To the original reporter, does time stop when this happens as it does to the other reporter? It's been half a year since this bug has seen activity. I believe it is a duplicate of bug 426861 and will close it as such. Feel free to reopen if the bug continues to manifest itself after upgrading to RHEL 5.3. *** This bug has been marked as a duplicate of bug 426861 *** |