Bug 1507433 - oadm migrate storage failed in upgrade process
Summary: oadm migrate storage failed in upgrade process
Keywords:
Status: CLOSED ERRATA
Alias: None
Product: OpenShift Container Platform
Classification: Red Hat
Component: Networking
Version: 3.7.0
Hardware: x86_64
OS: Linux
unspecified
medium
Target Milestone: ---
: 3.7.0
Assignee: Ben Bennett
QA Contact: Meng Bo
URL:
Whiteboard:
Depends On:
Blocks:
TreeView+ depends on / blocked
 
Reported: 2017-10-30 09:18 UTC by ge liu
Modified: 2018-01-08 19:30 UTC (History)
4 users (show)

Fixed In Version:
Doc Type: No Doc Update
Doc Text:
undefined
Clone Of:
Environment:
Last Closed: 2017-12-18 13:23:26 UTC
Target Upstream Version:
Embargoed:


Attachments (Terms of Use)


Links
System ID Private Priority Status Summary Last Updated
Red Hat Product Errata RHBA-2017:3464 0 normal SHIPPED_LIVE Red Hat OpenShift Container Platform 3.7 bug fix and enhancement update 2017-12-18 18:22:05 UTC

Description ge liu 2017-10-30 09:18:52 UTC
Description of problem:

upgrade ocp v3.7.0-0.143.2 to v3.7.0-0.184.0, 'oadm migrate storage' met error below:

["error: clusternetworks/default: ClusterNetwork \"default\" is invalid: [network: Invalid value: \"\": network must be identical to clusterNetworks[0].cidr, hostsubnetlength: Invalid value: 0x0: hostsubnetlength must be identical to clusterNetworks[0].hostSubnetLength]", "summary: total=1031 errors=1 ignored=0 unchanged=964 migrated=66", "info: to rerun only failing resources, add --include=clusternetworks", "error: 1 resources failed to migrate"]

run the command by manual, still met error:

# oadm migrate storage --confirm --include=clusternetworks
error:      clusternetworks/default: ClusterNetwork "default" is invalid: [network: Invalid value: "": network must be identical to clusterNetworks[0].cidr, hostsubnetlength: Invalid value: 0x0: hostsubnetlength must be identical to clusterNetworks[0].hostSubnetLength]
summary: total=1 errors=1 ignored=0 unchanged=0 migrated=0
info: to rerun only failing resources, add --include=clusternetworks
error: 1 resources failed to migrate


How reproducible:
Always

Steps to Reproduce:

1. steup ocp HA env in aws cluster, upgrade cluster version from v3.7.0-0.143.2 to v3.7.0-0.184.0

==> met error as description

2. run the command by manual, still met error:

# oadm migrate storage --confirm --include=clusternetworks

Actual results:

oadm migrate storage failed in upgrade process

Expected results:

oadm migrate storage succeed in upgrade process

Comment 1 Scott Dodson 2017-10-30 19:56:17 UTC
Perhaps this will be addressed via https://github.com/openshift/openshift-ansible/pull/5864

Comment 2 liujia 2017-10-31 08:49:14 UTC
Hit the issue when do minor version upgrade.

Comment 3 Ben Bennett 2017-10-31 14:53:05 UTC
It would not since the old installer didn't write the new form, and somehow they have the new form (without the old). 


However, this is an upgrade from a dev release to a dev release.

The problem is that we did not require that for some releases in the 3.7 dev cycle we didn't set:

networkConfig:
  clusterNetworkCIDR: 10.128.0.0/14

And we didn't require it.  We realized there were problems with that and changed the code so that it was required.  This would not be an issue for any upgrade from a released version to a released version.

So, I think you need to manually fix it, and it's not really something we need to fix.

Comment 4 ge liu 2017-11-01 06:27:39 UTC
@bbennett, could u help to list the manual steps that could fix this problem? 
and I also think about that online env have requirement that upgrade from a dev release to a dev release, does it met this problem also?
thx

Comment 5 ge liu 2017-11-01 06:37:03 UTC
@bbennett, we have another bug(1489168) need to do upgrade from a dev release to a dev release, so the manual workaround is necessary to us if we want not fix this problem, thx

Comment 6 liujia 2017-11-03 10:52:08 UTC
Verified on openshift-ansible-3.7.0-0.191.0.git.0.bc2ff60.el7.noarch


Do upgrade from v3.7.0-0.190.0 to v3.7.0-0.191.0, upgrade succeed.

Comment 10 errata-xmlrpc 2017-12-18 13:23:26 UTC
Since the problem described in this bug report should be
resolved in a recent advisory, it has been closed with a
resolution of ERRATA.

For information on the advisory, and where to find the updated
files, follow the link below.

If the solution does not work for you, open a new bug report.

https://access.redhat.com/errata/RHBA-2017:3464


Note You need to log in before you can comment on or make changes to this bug.