Bug 1877815

Summary: [osp13][update] Ceph update run fail during z12 to phase2 update during "inspect ceph mon container"
Product: Red Hat OpenStack Reporter: Sofer Athlan-Guyot <sathlang>
Component: openstack-tripleo-heat-templatesAssignee: Francesco Pantano <fpantano>
Status: CLOSED ERRATA QA Contact: David Rosenfeld <drosenfe>
Severity: urgent Docs Contact:
Priority: urgent    
Version: 13.0 (Queens)CC: astupnik, dsavinea, gfidente, jpretori, mburns, spower
Target Milestone: z13Keywords: Reopened, Triaged, ZStream
Target Release: 13.0 (Queens)   
Hardware: Unspecified   
OS: Unspecified   
Whiteboard:
Fixed In Version: openstack-tripleo-heat-templates-8.4.1-68.el7ost Doc Type: No Doc Update
Doc Text:
Story Points: ---
Clone Of: Environment:
Last Closed: 2020-10-28 18:24:14 UTC Type: Bug
Regression: --- Mount Type: ---
Documentation: --- CRM:
Verified Versions: Category: ---
oVirt Team: --- RHEL 7.3 requirements from Atomic Host:
Cloudforms Team: --- Target Upstream Version:
Embargoed:
Bug Depends On: 1846830    
Bug Blocks:    

Description Sofer Athlan-Guyot 2020-09-10 13:57:01 UTC
Description of problem: Update of OSP13z12 to passed_phase2.

We have already done the update run for all roles, and during "openstack overcloud ceph-upgrade run" run we have an error:

2020-09-09 05:17:26 | 2020-09-09 09:13:29Z [qe-Cloud-0-AllNodesDeploySteps-wn7sduykql3z.WorkflowTasks_Step2_Execution]: CREATE_IN_PROGRESS  state changed
2020-09-09 05:17:26 | 2020-09-09 09:17:16Z [qe-Cloud-0-AllNodesDeploySteps-wn7sduykql3z.WorkflowTasks_Step2_Execution]: CREATE_FAILED  resources.WorkflowTasks_Step2_Execution: Failure caused by error in tasks: ceph_base_ansible_workflow
2020-09-09 05:17:26 | 
2020-09-09 05:17:26 |   ceph_base_ansible_workflow [task_ex_id=eff1eefe-0567-4d87-9270-dfbeace69f9a] -> Failure caused by error in tasks: ceph_install


Which seems to manifest itself in ansible like this ... but it feels like partial output:


 "TASK [ceph-docker-common : inspect ceph mon container] *************************"

            \\"IPv6Gateway\\": \\"\\",\
            \\"MacAddress\\": \\"\\",\
            \\"Networks\\": {\
                \\"host\\": {\
                    \\"IPAMConfig\\": null,\
                    \\"Links\\": null,\
                    \\"Aliases\\": null,\
                    \\"NetworkID\\": \\"08c91309f2cbefbdbdc567f29b0654e8f2c1ff640fedba06564e2b4cdb3e3578\\",\
                    \\"EndpointID\\": \\"75b399b7fa547e3eb6ce346c615b491da9738bf5b735286260f08ad5256f5172\\",\
                    \\"Gateway\\": \\"\\",\
                    \\"IPAddress\\": \\"\\",\
                    \\"IPPrefixLen\\": 0,\
                    \\"IPv6Gateway\\": \\"\\",\
                    \\"GlobalIPv6Address\\": \\"\\",\
                    \\"GlobalIPv6PrefixLen\\": 0,\\Heat Stack update failed.
2020-09-08 23:22:39 | Heat Stack update failed.
2020-09-08 23:22:39 | n                    \\"MacAddress\\": \\"\\"\
                }\
            }\
        }\
    }\


Does this ring a bell, and could you help evaluating the severity of the issue ?

This issue doesn't manifest in z11, z10 ...

Comment 3 Yogev Rabl 2020-09-14 13:23:38 UTC

*** This bug has been marked as a duplicate of bug 1846830 ***

Comment 16 errata-xmlrpc 2020-10-28 18:24:14 UTC
Since the problem described in this bug report should be
resolved in a recent advisory, it has been closed with a
resolution of ERRATA.

For information on the advisory (Red Hat OpenStack Platform 13.0 director bug fix advisory), and where to find the updated
files, follow the link below.

If the solution does not work for you, open a new bug report.

https://access.redhat.com/errata/RHBA-2020:4388

Comment 17 Alex Stupnikov 2020-12-30 12:35:56 UTC
It looks like a fix for this bug introduced regression, bug #1911620 was reported.