Bug 1909011

Summary: [ceph-ansible] : switch from rpm to containerized - waiting for quorum task failing saying container not found despite container exists
Product: [Red Hat Storage] Red Hat Ceph Storage Reporter: Vasishta <vashastr>
Component: Ceph-AnsibleAssignee: Guillaume Abrioux <gabrioux>
Status: CLOSED ERRATA QA Contact: Manasa <mgowri>
Severity: high Docs Contact:
Priority: unspecified    
Version: 4.2CC: aschoen, ceph-eng-bugs, gmeno, mgowri, nthomas, tserlin, ykaul
Target Milestone: ---   
Target Release: 4.2z2   
Hardware: Unspecified   
OS: Unspecified   
Whiteboard:
Fixed In Version: ceph-ansible-4.0.54-1.el8cp, ceph-ansible-4.0.54-1.el7cp Doc Type: If docs needed, set a value
Doc Text:
Story Points: ---
Clone Of: Environment:
Last Closed: 2021-06-15 17:13:09 UTC Type: Bug
Regression: --- Mount Type: ---
Documentation: --- CRM:
Verified Versions: Category: ---
oVirt Team: --- RHEL 7.3 requirements from Atomic Host:
Cloudforms Team: --- Target Upstream Version:
Embargoed:

Description Vasishta 2020-12-18 07:43:35 UTC
Description of problem:
"waiting for the monitor to join the quorum" Task failing saying No such container: ceph-mon-magna072 despite container exists post playbook failure. We might have to induce some delay before checking quorum status after starting monitor service

Version-Release number of selected component (if applicable):
ceph-ansible-4.0.40-1.el7cp.noarch

How reproducible:
Tried thrice on same setup, facing thrice

Steps to Reproduce:
1. Configure baremetal cluster 
2. Try run switch-from-non-containerized-to-containerized-ceph-daemons.yml


Actual results:
<magna072> (1, '\n{"changed": true, "end": "2020-12-18 07:20:35.682361", "stdout": "", "cmd": ["docker", "exec", "ceph-mon-magna072", "ceph", "--cluster", "ceph", "quorum_status", "--format", "json"], "failed": true, "delta": "0:00:00.034791", "stderr": "Error response from daemon: No such container: ceph-mon-magna072",

Expected results:
Playbook should not fail if monitor is up ans running

Additional info:
BTW monitor was already containerized in previous run I'm not sure whether playbook should have skipped restarting the monitor

Comment 3 Guillaume Abrioux 2021-04-13 12:58:42 UTC
*** Bug 1945030 has been marked as a duplicate of this bug. ***

Comment 11 errata-xmlrpc 2021-06-15 17:13:09 UTC
Since the problem described in this bug report should be
resolved in a recent advisory, it has been closed with a
resolution of ERRATA.

For information on the advisory (Important: Red Hat Ceph Storage 4.2 Security and Bug Fix Update), and where to find the updated
files, follow the link below.

If the solution does not work for you, open a new bug report.

https://access.redhat.com/errata/RHSA-2021:2445