Bug 1812238
Summary: | OSP16 ceph update failed: ID ceph-mon-controller-2 found: no such container | ||||||
---|---|---|---|---|---|---|---|
Product: | [Red Hat Storage] Red Hat Ceph Storage | Reporter: | Sofer Athlan-Guyot <sathlang> | ||||
Component: | Ceph-Ansible | Assignee: | Guillaume Abrioux <gabrioux> | ||||
Status: | CLOSED CURRENTRELEASE | QA Contact: | Yogev Rabl <yrabl> | ||||
Severity: | high | Docs Contact: | |||||
Priority: | high | ||||||
Version: | 4.0 | CC: | aschoen, ceph-eng-bugs, dsavinea, gfidente, gmeno, nthomas, ykaul | ||||
Target Milestone: | rc | ||||||
Target Release: | 4.2 | ||||||
Hardware: | Unspecified | ||||||
OS: | Unspecified | ||||||
Whiteboard: | |||||||
Fixed In Version: | Doc Type: | If docs needed, set a value | |||||
Doc Text: | Story Points: | --- | |||||
Clone Of: | Environment: | ||||||
Last Closed: | 2020-04-06 17:03:30 UTC | Type: | Bug | ||||
Regression: | --- | Mount Type: | --- | ||||
Documentation: | --- | CRM: | |||||
Verified Versions: | Category: | --- | |||||
oVirt Team: | --- | RHEL 7.3 requirements from Atomic Host: | |||||
Cloudforms Team: | --- | Target Upstream Version: | |||||
Embargoed: | |||||||
Bug Depends On: | |||||||
Bug Blocks: | 1760354 | ||||||
Attachments: |
|
Description
Sofer Athlan-Guyot
2020-03-10 19:51:15 UTC
Created attachment 1669060 [details]
ceph-ansible.tar.xz
command, vars and logs from ceph-ansible run
According to the logs provided by Giulio, the controller-2 node isn't able to join the quorum after the RHCS 4 update. 2020-03-09 16:30:30,496 p=422404 u=root | TASK [container | waiting for the containerized monitor to join the quorum...] *** 2020-03-09 16:30:30,497 p=422404 u=root | task path: /usr/share/ceph-ansible/infrastructure-playbooks/rolling_update.yml:275 2020-03-09 16:30:30,497 p=422404 u=root | Monday 09 March 2020 16:30:30 +0000 (0:00:00.131) 0:06:08.088 ********** 2020-03-09 16:30:31,021 p=422404 u=root | FAILED - RETRYING: container | waiting for the containerized monitor to join the quorum... (5 retries left). 2020-03-09 16:30:46,406 p=422404 u=root | FAILED - RETRYING: container | waiting for the containerized monitor to join the quorum... (4 retries left). 2020-03-09 16:31:01,829 p=422404 u=root | FAILED - RETRYING: container | waiting for the containerized monitor to join the quorum... (3 retries left). 2020-03-09 16:31:17,190 p=422404 u=root | FAILED - RETRYING: container | waiting for the containerized monitor to join the quorum... (2 retries left). 2020-03-09 16:31:32,624 p=422404 u=root | FAILED - RETRYING: container | waiting for the containerized monitor to join the quorum... (1 retries left). 2020-03-09 16:31:48,029 p=422404 u=root | fatal: [controller-2]: FAILED! => changed=true attempts: 5 cmd: - podman - exec - ceph-mon-controller-2 - ceph - --cluster - ceph - -m - 172.17.3.20 - -s - --format - json delta: '0:00:00.089607' end: '2020-03-09 16:31:47.996623' msg: non-zero return code rc: 125 start: '2020-03-09 16:31:47.907016' stderr: 'Error: no container with name or ID ceph-mon-controller-2 found: no such container' stderr_lines: <omitted> stdout: '' stdout_lines: <omitted> Would it be possible to get the ceph-mon-controller-2 container logs ? (or ceph-mon@controller-2 systemd service) |