Note: This bug is displayed in read-only format because the product is no longer active in Red Hat Bugzilla.

Bug 1609008

Summary: [FFU] Rhos 10->13: ffwd-converge step fails with no ceph-mon container error
Product: Red Hat OpenStack Reporter: Archit Modi <amodi>
Component: openstack-mistralAssignee: RHOS Maint <rhos-maint>
Status: CLOSED NOTABUG QA Contact: nlevinki <nlevinki>
Severity: unspecified Docs Contact:
Priority: unspecified    
Version: 13.0 (Queens)CC: amodi, jfrancoa, jjoyce, johfulto, jschluet, jstransk, pgrist, rbrady, slinaber, tvignaud
Target Milestone: ---Keywords: ZStream
Target Release: ---   
Hardware: Unspecified   
OS: Unspecified   
Whiteboard:
Fixed In Version: Doc Type: If docs needed, set a value
Doc Text:
Story Points: ---
Clone Of: Environment:
Last Closed: 2018-09-25 00:24:40 UTC Type: Bug
Regression: --- Mount Type: ---
Documentation: --- CRM:
Verified Versions: Category: ---
oVirt Team: --- RHEL 7.3 requirements from Atomic Host:
Cloudforms Team: --- Target Upstream Version:
Embargoed:
Attachments:
Description Flags
tail executor.log none

Description Archit Modi 2018-07-26 17:26:48 UTC
Created attachment 1470845 [details]
tail executor.log

Description of problem: While running ffwd-converge step manually, the step completed with UPDATE_FAILED and error found in /var/log/mistral/* as attached

Steps to Reproduce:
1. Deploy rhos-10 with 3 controller, 2 compute and 3 ceph and start ffwd-upgrade process
2. At the last step of ffwd-upgrade process, run overcloud_upgrade_converge.sh
3. Stack overcloud UPDATE_FAILED with error

Actual results:
 Stack overcloud UPDATE_FAILED

overcloud.AllNodesDeploySteps.WorkflowTasks_Step2_Execution:
  resource_type: OS::TripleO::WorkflowSteps
  physical_resource_id: 2f7a06ce-d698-4ec4-afc6-9fe4f115ec21
  status: CREATE_FAILED
  status_reason: |
    resources.WorkflowTasks_Step2_Execution: ERROR
Heat Stack update failed.
Heat Stack update failed.

Expected results:
UPDATE_COMPLETE/Success

Additional info:
Controller-0 was found to be offline

Comment 6 Jiri Stransky 2018-08-01 12:14:47 UTC
Error response from daemon: No such container: ceph-mon-controller-2

The error is in ceph-ansible output ^. I wonder if Ceph was upgraded or not before we tried to do converge? I mean the `ceph-upgrade run` command in particular.

https://access.redhat.com/documentation/en-us/red_hat_openstack_platform/13/html/fast_forward_upgrades/assembly-upgrading_the_overcloud#upgrading_all_ceph_storage_nodes

Comment 13 Paul Grist 2018-09-25 00:24:40 UTC
This one is no longer reproducible and may have been caused by an environmental issue with memory resources.

Thanks for the follow up Archit.