Bug 2166883 - OVN cluster containers are not updated during update.
Summary: OVN cluster containers are not updated during update.
Keywords:
Status: CLOSED ERRATA
Alias: None
Product: Red Hat OpenStack
Classification: Red Hat
Component: openstack-tripleo-heat-templates
Version: 17.1 (Wallaby)
Hardware: Unspecified
OS: Unspecified
urgent
urgent
Target Milestone: beta
: 17.1
Assignee: Sofer Athlan-Guyot
QA Contact: Archana Singh
URL:
Whiteboard:
: 2182682 (view as bug list)
Depends On:
Blocks:
TreeView+ depends on / blocked
 
Reported: 2023-02-03 10:57 UTC by Sofer Athlan-Guyot
Modified: 2023-08-21 15:54 UTC (History)
5 users (show)

Fixed In Version: openstack-tripleo-heat-templates-14.3.1-1.20230423001017.004ef6e.el9ost
Doc Type: If docs needed, set a value
Doc Text:
Clone Of:
Environment:
Last Closed: 2023-08-16 01:13:41 UTC
Target Upstream Version:
Embargoed:


Attachments (Terms of Use)


Links
System ID Private Priority Status Summary Last Updated
Launchpad 2004575 0 None None None 2023-02-03 11:01:07 UTC
OpenStack gerrit 873602 0 None MERGED Add update of OVN cluster container. 2023-04-18 07:19:10 UTC
Red Hat Issue Tracker OSP-22030 0 None None None 2023-02-03 10:59:48 UTC
Red Hat Product Errata RHEA-2023:4577 0 None None None 2023-08-16 01:14:03 UTC

Description Sofer Athlan-Guyot 2023-02-03 10:57:41 UTC
Description of problem: During a openstack overcloud run the all the
overcloud's containers should be updated.

But here the OVN database containers are not updated:

    osp17.0 -> 17.1 update
    [tripleo-admin@controller-1 ~]$ sudo podman ps | grep ovn
    503897e01169 undercloud-0.ctlplane.redhat.local:8787/rh-osbs/rhosp17-openstack-ovn-nb-db-server:17.0_20220908.1 kolla_start 3 weeks ago Up 24 hours ovn_cluster_north_db_server
    6723bdc08d43 undercloud-0.ctlplane.redhat.local:8787/rh-osbs/rhosp17-openstack-ovn-sb-db-server:17.0_20220908.1 kolla_start 3 weeks ago Up 24 hours ovn_cluster_south_db_server
    bf80025be019 undercloud-0.ctlplane.redhat.local:8787/rh-osbs/rhosp17-openstack-ovn-northd:17.0_20220908.1 kolla_start 3 weeks ago Up 24 hours (healthy) ovn_cluster_northd
    8afe1e0aa88f undercloud-0.ctlplane.redhat.local:8787/rh-osbs/rhosp17-openstack-nova-novncproxy:17.1_20230130.1 kolla_start 20 hours ago Up 20 hours (healthy) nova_vnc_proxy
    f5fccc07d2ee undercloud-0.ctlplane.redhat.local:8787/rh-osbs/rhosp17-openstack-ovn-controller:17.1_20230130.1 kolla_start 20 hours ago Up 20 hours (healthy) ovn_controller

Afected containers:
 - ovn-nb-db-server;
 - ovn-sb-db-server;
 - ovn-northd

Since wallaby, OVN databases are not under pacemaker control, and uses
the cluster feature of the dbs. So the "pacemaker" code that was
handling the update is not there anymore.

Those containers are bootstrapped using docker_container step 0 which
is not run as part of the common deploy step during the update, which
goes from step 1 to 6.

The configuration is then done, not using puppet but ansible, meaning
we don't have any docker_container step >= 1 to trigger an container
update in the current update framework.

For an update to happen here we must have explicit step in the
"update_tasks" part of the playbook.


Version-Release number of selected component (if applicable): 17.1.


How reproducible: all the time.

We don't have downstream CI failure on this as currently we don't have
code that check that the container have changed. We are working on
adding that check to the CI. Upstream CI doesn't include OVN cluster
database in its setup. We are looking into having that covered as
well.

This was discovered during a manual check by Mikolaj Ciecierski.

Comment 3 Sofer Athlan-Guyot 2023-02-03 18:39:36 UTC
Note, there's a clone of this bug for OSP17 there: Bug #2166883

Comment 4 Sofer Athlan-Guyot 2023-02-08 07:53:55 UTC
Hi, wrong link in the previous for OSP17.0, here it is Bug #2166969

Comment 5 dabarzil 2023-03-29 12:33:10 UTC
*** Bug 2182682 has been marked as a duplicate of this bug. ***

Comment 6 Sofer Athlan-Guyot 2023-04-03 12:23:19 UTC
Review tested and working without any cut in control plane.

Comment 23 errata-xmlrpc 2023-08-16 01:13:41 UTC
Since the problem described in this bug report should be
resolved in a recent advisory, it has been closed with a
resolution of ERRATA.

For information on the advisory (Release of components for Red Hat OpenStack Platform 17.1 (Wallaby)), and where to find the updated
files, follow the link below.

If the solution does not work for you, open a new bug report.

https://access.redhat.com/errata/RHEA-2023:4577


Note You need to log in before you can comment on or make changes to this bug.