Description of problem: While performing an update to 16.2z3 we experienced a 20 second blackout of the OVN dataplane while performing the OVN update step (openstack overcloud external-update run --stack overcloud --tags ovn). The outage happened on all compute nodes at the same time, breaking clusters hosted on the overcloud. Documentation: https://access.redhat.com/documentation/en-us/red_hat_openstack_platform/16.2/html-single/keeping_red_hat_openstack_platform_updated/index """ 3.3. Optional: Updating the ovn-controller container on all overcloud servers If you deployed your overcloud with the Modular Layer 2 Open Virtual Network mechanism driver (ML2/OVN), update the ovn-controller container to the latest RHOSP 16.2 version. The update occurs on every overcloud server that runs the ovn-controller container. Important The following procedure updates the ovn-controller containers on servers that are assigned the Compute role before it updates the ovn-northd service on servers that are assigned the Controller role. """ But it seems that ovn-controller running on Controllers were updated too, it seems the ovsdb-server where affected as well Version-Release number of selected component (if applicable): OSP 16.2 How reproducible: Unsure Steps to Reproduce: 1. # openstack overcloud external-update run --stack overcloud --tags ovn 2. 3. Actual results: Outage for 20 seconds Expected results: no outage
Updating status here, based on yesterday's information, this BZ should include https://review.opendev.org/c/openstack/tripleo-heat-templates/+/860473/ to allow for automatic update, otherewise manual intervention is needed
I've replaced that patch with 3 others that are in the review process: https://review.opendev.org/q/topic:ovn-ofctrl-wait-before-clear Only the THT and puppet-ovn are technically required for now. The ansible one is for keeping feature parity.
Thanks for the update Terry - good to have the topic to track all 3 of them (and backports)
Since the problem described in this bug report should be resolved in a recent advisory, it has been closed with a resolution of ERRATA. For information on the advisory (Release of components for Red Hat OpenStack Platform 16.2.4), and where to find the updated files, follow the link below. If the solution does not work for you, open a new bug report. https://access.redhat.com/errata/RHBA-2022:8794