Bug 1162844
| Summary: | Network connectivity drops after vdsm runs | ||||||||||
|---|---|---|---|---|---|---|---|---|---|---|---|
| Product: | [Retired] oVirt | Reporter: | Adam Litke <alitke> | ||||||||
| Component: | vdsm | Assignee: | Ido Barkan <ibarkan> | ||||||||
| Status: | CLOSED DUPLICATE | QA Contact: | Gil Klein <gklein> | ||||||||
| Severity: | high | Docs Contact: | |||||||||
| Priority: | unspecified | ||||||||||
| Version: | 3.5 | CC: | alitke, asegurap, bazulay, bugs, danken, ecohen, gklein, ibarkan, iheim, lsurette, mgoldboi, osvoboda, rbalakri, s.kieske, yeylon | ||||||||
| Target Milestone: | --- | ||||||||||
| Target Release: | 3.5.1 | ||||||||||
| Hardware: | Unspecified | ||||||||||
| OS: | Unspecified | ||||||||||
| Whiteboard: | network | ||||||||||
| Fixed In Version: | Doc Type: | Bug Fix | |||||||||
| Doc Text: | Story Points: | --- | |||||||||
| Clone Of: | Environment: | ||||||||||
| Last Closed: | 2014-11-18 15:30:11 UTC | Type: | Bug | ||||||||
| Regression: | --- | Mount Type: | --- | ||||||||
| Documentation: | --- | CRM: | |||||||||
| Verified Versions: | Category: | --- | |||||||||
| oVirt Team: | Network | RHEL 7.3 requirements from Atomic Host: | |||||||||
| Cloudforms Team: | --- | Target Upstream Version: | |||||||||
| Embargoed: | |||||||||||
| Attachments: |
|
||||||||||
|
Description
Adam Litke
2014-11-11 20:35:04 UTC
Created attachment 956412 [details]
supervdsm.log
Created attachment 956413 [details]
vdsm.log
As suggested by Ondřej Svoboda, I tried disabling NetworkManager and that did not seem to resolve the problem. Adam, did you lose connection after VDSM and superVDSM restarted (not necessarily)? Could you look in engine logs? supervdsm.log MainThread::DEBUG::2014-11-11 14:19:31,399::supervdsmServer::451::SuperVdsm.Server::(main) Terminated normally vdsm.log MainThread::DEBUG::2014-11-11 14:19:26,600::vdsm::58::vds::(sigtermHandler) Received signal 15 There are a couple of not really nice warnings in supervdsm.log when VDSM creates the management network (bridge expected too early -- looks harmless; libvirt network not there -- I don't like this) and also further on (sourceroutethread trying to add the same route over and over again). What puzzles me though is that lines such as the one below indicate some kind of DHCP activity. sourceRoute::DEBUG::2014-11-11 15:24:28,939::sourceroutethread::38::root::(process_IN_CLOSE_WRITE_filePath) Responding to DHCP response in /var/run/vdsm/sourceRoutes/1415737468 I CC'd Toni and Ido. Guys, can you see anything in the logs? I mean, the last observation seems to contradict the connection loss. Created attachment 956855 [details]
engine log
I didn't see anything fishy in the engine.log but here it is for completeness
(In reply to Ondřej Svoboda from comment #4) > Adam, > > did you lose connection after VDSM and superVDSM restarted (not > necessarily)? Could you look in engine logs? Unless I'm missing something, could this be the new soft-fencing in response to storage connection failures? > > supervdsm.log > MainThread::DEBUG::2014-11-11 > 14:19:31,399::supervdsmServer::451::SuperVdsm.Server::(main) Terminated > normally > > vdsm.log > MainThread::DEBUG::2014-11-11 > 14:19:26,600::vdsm::58::vds::(sigtermHandler) Received signal 15 > > There are a couple of not really nice warnings in supervdsm.log when VDSM > creates the management network (bridge expected too early -- looks harmless; > libvirt network not there -- I don't like this) and also further on > (sourceroutethread trying to add the same route over and over again). > > What puzzles me though is that lines such as the one below indicate some > kind of DHCP activity. > > sourceRoute::DEBUG::2014-11-11 > 15:24:28,939::sourceroutethread::38::root::(process_IN_CLOSE_WRITE_filePath) > Responding to DHCP response in /var/run/vdsm/sourceRoutes/1415737468 At this point I may have logged into the box and executed "dhclient ovirtmgmt" in order to rescue the connection. After talking with Adam and asking him to try the patch for https://bugzilla.redhat.com/1142082 the issue has not happened again. Please Adam, if by Monday it still keeps the address mark this bug as duplicate of the one above. Adam - Just to be on the safe side what OS did you run on your mini-dells ? Bug 1116004 was fixed for RHEL 7.1 and 7.0.z (Bug 1148345) (In reply to Barak from comment #9) > Adam - Just to be on the safe side what OS did you run on your mini-dells ? I tried with CentOS 7 and Fedora 20. > Bug 1116004 was fixed for RHEL 7.1 and 7.0.z (Bug 1148345) *** This bug has been marked as a duplicate of bug 1116004 *** |