Red Hat Bugzilla – Bug 1590560
ceph upgrade/deployment fails with "Error response from daemon: No such container: ceph-create-keys"
Last modified: 2018-07-09 13:46:17 EDT
Description of problem: FFU: ceph upgrade fails during the fast forward process with "Error response from daemon: No such container: ceph-create-keys" Version-Release number of selected component (if applicable): ceph-ansible-3.1.0-0.1.rc8.el7cp.noarch How reproducible: 100% Steps to Reproduce: 1. Deploy OSP10 with 3 controllers + 2 compute + 3 ceph osd nodes 2. Run through the fast forward upgrade procedure 3. Run the ceph upgrade step: openstack overcloud ceph-upgrade run \ --templates /usr/share/openstack-tripleo-heat-templates \ --stack qe-Cloud-0 \ -e /usr/share/openstack-tripleo-heat-templates/environments/cinder-backup.yaml \ -e /usr/share/openstack-tripleo-heat-templates/environments/services/sahara.yaml \ -e /usr/share/openstack-tripleo-heat-templates/environments/storage-environment.yaml \ -e /home/stack/virt/internal.yaml \ -e /usr/share/openstack-tripleo-heat-templates/environments/network-isolation.yaml \ -e /home/stack/virt/network/network-environment.yaml \ -e /home/stack/virt/enable-tls.yaml \ -e /home/stack/virt/inject-trust-anchor.yaml \ -e /home/stack/virt/public_vip.yaml \ -e /usr/share/openstack-tripleo-heat-templates/environments/tls-endpoints-public-ip.yaml \ -e /home/stack/virt/hostnames.yml \ -e /home/stack/virt/debug.yaml \ -e /home/stack/cli_opts_params.yaml \ -e /home/stack/ceph-ansible-env.yaml \ --ceph-ansible-playbook '/usr/share/ceph-ansible/infrastructure-playbooks/switch-from-non-containerized-to-containerized-ceph-daemons.yml,/usr/share/ceph-ansible/infrastructure-playbooks/rolling_update.yml' \ --container-registry-file /home/stack/virt/docker-images.yaml \ Actual results: Fails Expected results: Completes fine. Additional info: Attaching ceph-install-workflow.log.
Note that it shows during OSP13 minor update as well where workaround to downgrade to ceph-ansible-3.1.0-0.1.rc3 worked: sudo yum downgrade -y http://download.eng.bos.redhat.com/brewroot/packages/ceph-ansible/3.1.0/0.1.rc3.el7cp/noarch/ceph-ansible-3.1.0-0.1.rc3.el7cp.noarch.rpm
We believe this can be hit for any ceph-ansible run (including fresh deployments) with >1 compute node (or custom role behaving as ceph client, hosting a service consuming ceph).
verified on ceph-ansible-3.1.0-0.1.rc9.el7cp.noarch
Since the problem described in this bug report should be resolved in a recent advisory, it has been closed with a resolution of ERRATA. For information on the advisory, and where to find the updated files, follow the link below. If the solution does not work for you, open a new bug report. https://access.redhat.com/errata/RHEA-2018:2086