Created attachment 1360646[details]
the ceph upgrade log
Description of problem:
The upgrade failed with when ceph-ansible failed to start rgw container. The error is:
fatal: [192.168.24.13]: FAILED! => {"changed": false, "failed": true, "msg": "AnsibleUndefinedVariable: No first item, sequence was empty."}
The rest of the ceph cluster was upgraded and containerized.
Version-Release number of selected component (if applicable):
ceph-ansible-3.0.14-1.el7cp.noarch
puppet-tripleo-7.4.3-10.el7ost.noarch
openstack-tripleo-common-containers-7.6.3-6.el7ost.noarch
openstack-tripleo-puppet-elements-7.0.1-2.el7ost.noarch
openstack-tripleo-validations-7.4.2-1.el7ost.noarch
python-tripleoclient-7.3.3-6.el7ost.noarch
openstack-tripleo-heat-templates-7.0.3-16.el7ost.noarch
openstack-tripleo-ui-7.4.3-4.el7ost.noarch
openstack-tripleo-common-7.6.3-6.el7ost.noarch
openstack-tripleo-image-elements-7.0.1-1.el7ost.noarch
How reproducible:
100%
Steps to Reproduce:
1. Deploy an overcloud with dedicated node for rados gateway in OSP11
2. upgrade it to OSP12
Actual results:
The deployment failed with the rados gateway fail to start in a container.
Expected results:
All of the ceph cluster is running on containers
Additional info:
Looks like a real issue in ceph-ansible, moving to Ceph product
Comment 7Ken Dreyer (Red Hat)
2017-11-30 20:17:44 UTC
Is there a reproducer outside of OSP?
Comment 8Federico Lucifredi
2017-11-30 23:26:48 UTC
Looks like a blocker for 2.5, but OSPd-driven Ceph 3 upgrade from 11 to 12 is not a valid upgrade path for 3.0.
Giulio/Yogev please explain if you disagree with the above. My guess is you set target=3 because this is a Ceph-Ansible 3 issue.
Setting target=2.5
Nothing has be done on this. I'm the assignee but I won't be working on this. We need to triage this and assign it to someone else and then move this to ASSIGNED.
Comment 24Red Hat Bugzilla
2023-09-14 04:12:54 UTC
The needinfo request[s] on this closed bug have been removed as they have been unresolved for 1000 days
Created attachment 1360646 [details] the ceph upgrade log Description of problem: The upgrade failed with when ceph-ansible failed to start rgw container. The error is: fatal: [192.168.24.13]: FAILED! => {"changed": false, "failed": true, "msg": "AnsibleUndefinedVariable: No first item, sequence was empty."} The rest of the ceph cluster was upgraded and containerized. Version-Release number of selected component (if applicable): ceph-ansible-3.0.14-1.el7cp.noarch puppet-tripleo-7.4.3-10.el7ost.noarch openstack-tripleo-common-containers-7.6.3-6.el7ost.noarch openstack-tripleo-puppet-elements-7.0.1-2.el7ost.noarch openstack-tripleo-validations-7.4.2-1.el7ost.noarch python-tripleoclient-7.3.3-6.el7ost.noarch openstack-tripleo-heat-templates-7.0.3-16.el7ost.noarch openstack-tripleo-ui-7.4.3-4.el7ost.noarch openstack-tripleo-common-7.6.3-6.el7ost.noarch openstack-tripleo-image-elements-7.0.1-1.el7ost.noarch How reproducible: 100% Steps to Reproduce: 1. Deploy an overcloud with dedicated node for rados gateway in OSP11 2. upgrade it to OSP12 Actual results: The deployment failed with the rados gateway fail to start in a container. Expected results: All of the ceph cluster is running on containers Additional info: