Bug 2172063 - deployed ceph fails to apply spec
Summary: deployed ceph fails to apply spec
Keywords:
Status: CLOSED ERRATA
Alias: None
Product: Red Hat OpenStack
Classification: Red Hat
Component: python-tripleoclient
Version: 17.1 (Wallaby)
Hardware: Unspecified
OS: Unspecified
low
medium
Target Milestone: ga
: 17.1
Assignee: Manoj Katari
QA Contact: David Rosenfeld
URL:
Whiteboard:
Depends On:
Blocks: 2193132
TreeView+ depends on / blocked
 
Reported: 2023-02-21 10:58 UTC by Manoj Katari
Modified: 2023-08-16 01:14 UTC (History)
7 users (show)

Fixed In Version: python-tripleoclient-16.5.1-1.20230505010953.534fe49.el9ost tripleo-ansible-3.3.1-1.20230506001021.b6d4b94.el9ost
Doc Type: Bug Fix
Doc Text:
Before this update, the `openstack overcloud ceph deploy` command could fail during the `apply spec` operation if the chrony NTP service was down. With this update, the chrony NTP service is enabled before the `apply spec` operation.
Clone Of:
: 2193132 (view as bug list)
Environment:
Last Closed: 2023-08-16 01:13:59 UTC
Target Upstream Version:
Embargoed:


Attachments (Terms of Use)


Links
System ID Private Priority Status Summary Last Updated
OpenStack gerrit 875051 0 None master: MERGED tripleo-operator-ansible: Add ntp parameters to ceph deploy (Ic46154e88245e27d78bf646ef9bd008cb7cabb8c) 2023-05-17 15:39:10 UTC
OpenStack gerrit 875053 0 None master: MERGED tripleo-quickstart-extras: use ntp parameters during ceph deploy (Ia35d06e2ec693f31e8eadf03fdcc15744858a2a2) 2023-05-17 15:39:16 UTC
OpenStack gerrit 875642 0 None master: MERGED tripleo-docs: Document Configuring ntp during deployed ceph (Ie5aea0149693aa83cd1c8115c12fc4d395d1bf8b) 2023-05-17 15:39:22 UTC
OpenStack gerrit 879300 0 None stable/wallaby: MERGED tripleo-ansible: Deployed Ceph time synchronization (I5d4a6a2f141582c18846057237b0858becc6dacd) 2023-05-17 15:39:28 UTC
OpenStack gerrit 879301 0 None stable/wallaby: MERGED python-tripleoclient: Deployed Ceph time synchronization (I85b692c21269f1bb69dabd54c1c2212cb03e2afe) 2023-05-17 15:39:34 UTC
Red Hat Issue Tracker OSP-22540 0 None None None 2023-02-21 10:59:12 UTC
Red Hat Product Errata RHEA-2023:4577 0 None None None 2023-08-16 01:14:23 UTC

Description Manoj Katari 2023-02-21 10:58:21 UTC
Description of problem:

'openstack overcloud ceph spec' command fails during cephadm apply sec:

FATAL | Apply spec | controller-0 | error={"changed": true, "cmd": ["podman", "run", "--rm", "--net=host", "--ipc=host", "--volume", "/etc/ceph:/etc/ceph:z", "--volume", "/home/ceph-admin/assimilate_ceph.conf:/home/assimilate_ceph.conf:z", "--volume", "/home/ceph-admin/specs/ceph_spec.yaml:/home/ceph_spec.yaml:z", "--entrypoint", "ceph", "undercloud-0.ctlplane.redhat.local:8787/rh-osbs/rhceph:5-359", "--fsid", "a31e89b4-6770-5ace-a27d-25c7a6ceca7f", "-c", "/etc/ceph/ceph.conf", "-k", "/etc/ceph/ceph.client.admin.keyring", "orch", "apply", "--in-file", "/home/ceph_spec.yaml"], "delta": "0:00:04.483712", "end": "2023-02-01 09:42:00.286307", "msg": "non-zero return code", "rc": 22, "start": "2023-02-01 09:41:55.802595", "stderr": "Error EINVAL: Host ceph-0 (192.168.24.32) failed check(s): ['No time synchronization is active']", "stderr_lines": ["Error EINVAL: Host ceph-0 (192.168.24.32) failed check(s): ['No time synchronization is 	active']"], "stdout": "", "stdout_lines": []}


Version-Release number of selected component (if applicable):


How reproducible:


Steps to Reproduce:
1.
2.
3.

Actual results:


Expected results:


Additional info:

Comment 3 Manoj Katari 2023-05-18 10:11:32 UTC
The FIV you mentioned "python3-tripleoclient-16.5.1-1.20230421001504.78730a3.el9ost.noarch" (same as https://brewweb.engineering.redhat.com/brew/buildinfo?buildID=2475132)  doesn't have the ntp fix,  


We should use a compose with FIV "python-tripleoclient-16.5.1-1.20230505010953.534fe49.el9osttrunk"

Comment 4 Manoj Katari 2023-05-18 11:06:32 UTC
Here is the latest compose 17.1-RHEL-9/RHOS-17.1-RHEL-9-20230517.n.1 which has the fix

Comment 8 Filip Hubík 2023-05-23 17:05:35 UTC
I can confirm the "Fixed in" packages python-tripleoclient-16.5.1-1.20230505010953.534fe49.el9ost tripleo-ansible-3.3.1-1.20230506001021.b6d4b94.el9ost provide the --ntp-server parameter which set up correctly passed the deployment stage. I am not sure whether the feature really delivers working time synchronization between nodes, that would require deeper checking with more context.

Comment 19 errata-xmlrpc 2023-08-16 01:13:59 UTC
Since the problem described in this bug report should be
resolved in a recent advisory, it has been closed with a
resolution of ERRATA.

For information on the advisory (Release of components for Red Hat OpenStack Platform 17.1 (Wallaby)), and where to find the updated
files, follow the link below.

If the solution does not work for you, open a new bug report.

https://access.redhat.com/errata/RHEA-2023:4577


Note You need to log in before you can comment on or make changes to this bug.