Created attachment 1360646 [details] the ceph upgrade log Description of problem: The upgrade failed with when ceph-ansible failed to start rgw container. The error is: fatal: [192.168.24.13]: FAILED! => {"changed": false, "failed": true, "msg": "AnsibleUndefinedVariable: No first item, sequence was empty."} The rest of the ceph cluster was upgraded and containerized. Version-Release number of selected component (if applicable): ceph-ansible-3.0.14-1.el7cp.noarch puppet-tripleo-7.4.3-10.el7ost.noarch openstack-tripleo-common-containers-7.6.3-6.el7ost.noarch openstack-tripleo-puppet-elements-7.0.1-2.el7ost.noarch openstack-tripleo-validations-7.4.2-1.el7ost.noarch python-tripleoclient-7.3.3-6.el7ost.noarch openstack-tripleo-heat-templates-7.0.3-16.el7ost.noarch openstack-tripleo-ui-7.4.3-4.el7ost.noarch openstack-tripleo-common-7.6.3-6.el7ost.noarch openstack-tripleo-image-elements-7.0.1-1.el7ost.noarch How reproducible: 100% Steps to Reproduce: 1. Deploy an overcloud with dedicated node for rados gateway in OSP11 2. upgrade it to OSP12 Actual results: The deployment failed with the rados gateway fail to start in a container. Expected results: All of the ceph cluster is running on containers Additional info:
Is this related with https://bugzilla.redhat.com/show_bug.cgi?id=1519055 ?? Hi Giulio! Can you give us more information about this issue?
Looks like a real issue in ceph-ansible, moving to Ceph product
Is there a reproducer outside of OSP?
Looks like a blocker for 2.5, but OSPd-driven Ceph 3 upgrade from 11 to 12 is not a valid upgrade path for 3.0. Giulio/Yogev please explain if you disagree with the above. My guess is you set target=3 because this is a Ceph-Ansible 3 issue. Setting target=2.5
It looks like we have a similar issue with mons in BZ #1519055 ; probably a duplicate
Nothing has be done on this. I'm the assignee but I won't be working on this. We need to triage this and assign it to someone else and then move this to ASSIGNED.
The needinfo request[s] on this closed bug have been removed as they have been unresolved for 1000 days