Bug 1208376 - vdsm unpersists unified persistence network devices on RHEV-H
Summary: vdsm unpersists unified persistence network devices on RHEV-H
Keywords:
Status: CLOSED CURRENTRELEASE
Alias: None
Product: Red Hat Enterprise Virtualization Manager
Classification: Red Hat
Component: vdsm
Version: 3.5.1
Hardware: Unspecified
OS: Unspecified
high
high
Target Milestone: ---
: 3.5.4
Assignee: Ido Barkan
QA Contact: Michael Burman
URL:
Whiteboard: network
: 1212714 (view as bug list)
Depends On: 1203422 1249396 1249397
Blocks:
TreeView+ depends on / blocked
 
Reported: 2015-04-02 06:38 UTC by haiyang,dong
Modified: 2016-02-10 19:59 UTC (History)
22 users (show)

Fixed In Version:
Doc Type: Bug Fix
Doc Text:
Clone Of:
Environment:
Last Closed: 2015-09-14 09:26:50 UTC
oVirt Team: Network
Target Upstream Version:
Embargoed:


Attachments (Terms of Use)
attached var logs for backup 6.5 version (11.41 MB, application/x-tar)
2015-04-02 06:38 UTC, haiyang,dong
no flags Details
attached sosreport logs for back version 6.5 (6.15 MB, application/x-xz)
2015-04-02 06:43 UTC, haiyang,dong
no flags Details

Description haiyang,dong 2015-04-02 06:38:27 UTC
Created attachment 1009975 [details]
attached var logs for backup 6.5 version

Description of problem:
network configuration files of the backup 6.5 version missed after upgrade rhevh 6.5 via rhevm portal to rhevh6.6

Version-Release number of selected component (if applicable):
RHEVH 6.5 20150115
rhev-hypervisor6-6.6-20150327.0
ovirt-node-3.2.2-1.el6.noarch
RHEV-M VT14.1 (3.5.1-0.2.el6ev)

How reproducible:
100%

Steps to Reproduce:
1. Installed RHEVH 6.5 20150115 with dhcp network,
2. Add rhevh via rhevm portal into 3.4 compatibility version in rhevm 
3.5.1-0.2.el6ev
4. Upgrade rhevh 6.5 via rhevm portal to rhev-hypervisor6-6.6-20150327.0
5. login the new version 6.6 and check the notwork status
6. Reboot the new version 6.6 and enter into backup 6.5 version and check
the notwork status

Actual results:
After 5, Could found the network configuration files and it's status is up.
After 6, network configuration files of the backup 6.5 version missed and it's status is down.

Expected results:
Could found the network configuration files and it's status is up for step5 and step6

Additional info:

Comment 1 haiyang,dong 2015-04-02 06:43:33 UTC
Created attachment 1009976 [details]
attached sosreport logs for back version 6.5

Comment 2 Fabian Deutsch 2015-04-02 13:13:03 UTC
Yaniv, could this be a side effect of the unified persistence?

RHEV-H 6.5 is using RHEV 3.4.z
RHEV-H 6.6 is using RHEV 3.5.z

IIUIC the once 3.5.z is booted, all network configuration get's migrated to the unified persistence, which includes removing the ifcfg files. If a user now boots back into RHEV-H 6.5/3.4.z, then no ifcfg files are around - and that vdsm does not understand the unified persistence.

Is this correct?

Comment 3 Fabian Deutsch 2015-04-02 19:59:15 UTC
Haiyang, is this a dupe of bug 1194068?

Comment 4 haiyang,dong 2015-04-08 05:35:23 UTC
(In reply to Fabian Deutsch from comment #3)
> Haiyang, is this a dupe of bug 1194068?

Seems it 's not duplicated of bug 1194068:
1. bug 1194068 talked about the network interfaces on RHEV hypervisors that have been upgraded are being set to ONBOOT=no from "ONBOOT=yes"
2. Also from https://bugzilla.redhat.com/show_bug.cgi?id=1206536#c2, we known that bug 1194068 has been fixed in rhev-hypervisor6-6.6-20150327.0, but we find bug 1208376 in rhev-hypervisor6-6.6-20150327.0 version

Comment 5 Fabian Deutsch 2015-04-13 20:48:04 UTC
Thanks Haiyang.

But this looks very similar to bug 1209486.
Can you confirm this, Haiyang?

Comment 6 haiyang,dong 2015-04-14 02:58:41 UTC
(In reply to Fabian Deutsch from comment #5)
> Thanks Haiyang.
> 
> But this looks very similar to bug 1209486.
> Can you confirm this, Haiyang?

seems it's not similar to bug 1209486.
If it's similar to bug 1209486, according to bug 1209486's Actual results,
For my test step 5 "login the new version 6.6 and check the notwork status", it should be there is no networks exists on host, no ifcfg files for those networks. host has no ip.

but the actual test results is:
After 5, Could found the network configuration files and it's status is up.

Bug 1208376 issue just happen on my test step 6 "Reboot the new version 6.6 and enter into backup 6.5 version and check the notwork status".

Comment 7 haiyang,dong 2015-04-14 03:09:06 UTC
(In reply to haiyang,dong from comment #6)
> (In reply to Fabian Deutsch from comment #5)
> > Thanks Haiyang.
> > 
> > But this looks very similar to bug 1209486.
> > Can you confirm this, Haiyang?
> 
> seems it's not similar to bug 1209486.
> If it's similar to bug 1209486, according to bug 1209486's Actual results,
> For my test step 5 "login the new version 6.6 and check the notwork status",
> it should be there is no networks exists on host, no ifcfg files for those
> networks. host has no ip.
> 
> but the actual test results is:
> After 5, Could found the network configuration files and it's status is up.

Seems the root cause happen in here:
After step 5 "login the new version 6.6 and check the notwork status" , i could find the network configuration files in 
[root@dhcp-8-211 admin]# ls /etc/sysconfig/network-scripts/ifcfg-*
/etc/sysconfig/network-scripts/ifcfg-em1
/etc/sysconfig/network-scripts/ifcfg-lo
/etc/sysconfig/network-scripts/ifcfg-rhevm

but configuration files "ifcfg-em1/ifcfg-rhevm" missed in 
[root@dhcp-8-211 admin]# ls /config/etc/sysconfig/network-scripts/ifcfg-*
/config/etc/sysconfig/network-scripts/ifcfg-lo

it's the reason why after step6 "Reboot the new version 6.6 and enter into backup 6.5 version and check the notwork status", found network configuration files of the backup 6.5 version missed and it's status is down.

> 
> Bug 1208376 issue just happen on my test step 6 "Reboot the new version 6.6
> and enter into backup 6.5 version and check the notwork status".

Comment 8 Fabian Deutsch 2015-04-21 15:22:59 UTC
(In reply to haiyang,dong from comment #0)
…
> RHEVH 6.5 20150115
^^ RHEV-H 3.4.z

> rhev-hypervisor6-6.6-20150327.0
^^ RHEV-H 3.5.1


In RHEV 3.5, vdsm will not persist the ifcfg anymore, this breaks the rollback to the old version, because 3.4.z does not know how to bring up the networks from the unified persistence.

I'd think about persisting the ifcfg files again, even for networks owned by vdsm.

Comment 9 Fabian Deutsch 2015-04-21 15:24:13 UTC
Ido, maybe you have an insight if it makes sense to persist the ifcfg files. IMO it might be safer.

Comment 10 Ido Barkan 2015-04-22 04:37:33 UTC
IMO it will be safer. It will be also make the system more predictable (similar to rhel) and the code simpler. It will also make our customers, QE and GSS less surprised. This is not a trivial change, but should be done soon.

Comment 11 Lior Vernia 2015-04-22 05:48:17 UTC
In my opinion we should solve as much of this mess as possible while we still remember what's going on :) Dan, feel free to override any of these decisions.

Comment 12 Lior Vernia 2015-04-26 13:21:49 UTC
*** Bug 1212714 has been marked as a duplicate of this bug. ***

Comment 16 Michael Burman 2015-07-27 14:37:23 UTC
Fabian,

Just to be sure, i'm going to test this report with latest rhev-h 6.7 and vdsm 4.16.23-1 and not with rhev-h 6.6 like described in original report, so please ACK this.

Scenario that will be tested here is:
Upgrade rhev-h 6.5 3.4.z >> rhev-h latest 6.7 3.5.4(vdsm 4.16.23-1)
verify network configurations are not broken and verify the network configurations on the 6.5 backup as well.

Comment 17 Fabian Deutsch 2015-07-28 08:33:13 UTC
Michael, I can ack that you test it with the 6.7 build.

However, I can not ack the flow, as I'm not into this bug - but I'd recommend to also do a reboot of the machine afterwards, to ensure that the networks also come up right after reboot.

Comment 18 Anatoly Litovsky 2015-08-06 14:05:18 UTC
i am not sure step 6 in the original bug is possible anymore

Comment 19 Michael Burman 2015-08-06 14:24:30 UTC
I have tested all steps, except step 6, like Tolik wrote in comment 18 ^^, i can't back up to 6.5. No such option.

Upgrade from vdsm-4.14.13-2.el6ev >> vdsm-4.16.24-2.el6ev.x86_64
RHEV Hypervisor - 6.5 - 20140930.1.el6ev >> rhev-hypervisor6-6.7-20150804.0.el6ev

Host is up after reboots, all networks exists and up. Setup Networks configuration is done after second reboot and server rebooted for third time with success.

Fabian, is this can be considered as verified ?
Thanks,

Comment 20 Fabian Deutsch 2015-08-06 15:25:45 UTC
IMO it is not yet verified.

In the setup of comment 19 - you just need to boot back into the initially installed rhevh (RHEV Hypervisor - 6.5 - 20140930.1.el6ev)

It is important that the networks will come up then as well.

Comment 21 Meni Yakove 2015-08-10 07:27:56 UTC
How do we boot from the backup?

Comment 22 Fabian Deutsch 2015-08-10 14:40:16 UTC
1. Reboot the host
2. After the BIOS an "countdown" appears, hit <Tab>
3. The grub menu appears, select the backup entry

Comment 23 Meni Yakove 2015-08-12 07:26:44 UTC
Verified on https://brewweb.devel.redhat.com/taskinfo?taskID=9670665


Note You need to log in before you can comment on or make changes to this bug.