Bug 2032139 - paunch doesn't handle containers left in an incomplete status because of timeout
Summary: paunch doesn't handle containers left in an incomplete status because of timeout
Keywords:
Status: CLOSED ERRATA
Alias: None
Product: Red Hat OpenStack
Classification: Red Hat
Component: python-paunch
Version: 16.1 (Train)
Hardware: Unspecified
OS: Unspecified
medium
medium
Target Milestone: z9
: 16.1 (Train on RHEL 8.2)
Assignee: Takashi Kajinami
QA Contact: nlevinki
URL:
Whiteboard:
Depends On: 2092726
Blocks:
TreeView+ depends on / blocked
 
Reported: 2021-12-14 06:51 UTC by Takashi Kajinami
Modified: 2022-12-07 20:26 UTC (History)
5 users (show)

Fixed In Version: python-paunch-5.3.3-1.20220715123744.ed2c015.el8ost
Doc Type: If docs needed, set a value
Doc Text:
Clone Of:
: 2092726 (view as bug list)
Environment:
Last Closed: 2022-12-07 20:25:32 UTC
Target Upstream Version:
Embargoed:


Attachments (Terms of Use)


Links
System ID Private Priority Status Summary Last Updated
Launchpad 1955056 0 None None None 2022-01-17 13:39:03 UTC
OpenStack gerrit 837591 0 None stable/train: MERGED paunch: Ensure systemd services are activated (I7a1b5514cb4b80999d2fc884d19b6e62f94ba3c5) 2022-07-15 12:14:07 UTC
Red Hat Issue Tracker OSP-11820 0 None None None 2021-12-14 06:57:14 UTC
Red Hat Product Errata RHBA-2022:8795 0 None None None 2022-12-07 20:26:01 UTC

Description Takashi Kajinami 2021-12-14 06:51:47 UTC
Description of problem:

Currently paunch looks for the existing container and compare image id and CONFIG_HASH (and some additional environments if needed)
to determine whether it should recreate the container.

When deployment times out while starting containers, paunch can leave containers in an incomplete status as described below.
 - The container was recreated, registered to systemd, but was not enabled/started
 - The container was recreated, registered to systemd, enabled, but was started
 - The container was recreated, but was not registered to systemd.

In such situation paunch can't detect these incomplete containers and next deployment successfully runs
without any errors, leaving these containers without any fix.
We should ensure paunch registers systemd service and starts these containers properly.


Version-Release number of selected component (if applicable):
python3-paunch-5.3.3-1.20210412123423
How reproducible:


Steps to Reproduce:
1.
2.
3.

Actual results:


Expected results:


Additional info:

Comment 1 Keigo Noha 2022-06-02 06:12:00 UTC
Hi Kajinami-san,

It looks that upstream merged the change into ussuri and train branches.
Is it possible to open a downstream gerrit to proceed the backport into RHSOP16.x?

Best Regards,
Keigo Noha

Comment 3 Takashi Kajinami 2022-06-02 06:27:02 UTC
(In reply to Keigo Noha from comment #1)
> Hi Kajinami-san,
> 
> It looks that upstream merged the change into ussuri and train branches.
> Is it possible to open a downstream gerrit to proceed the backport into
> RHSOP16.x?
> 
> Best Regards,
> Keigo Noha

Hi

The fix was merged to upstream stable/train but has not yet been imported to RHOSP16.2 yet.
Once it is imported into RHOSP16.2, we can consider backporting the fix to RHOSP16.1 based
on the requirement.

I'll use this bug to track the fix in RHOSP16.2, which is targeted to z4 atm.

Comment 5 Takashi Kajinami 2022-06-02 06:45:15 UTC
> I'll use this bug to track the fix in RHOSP16.2, which is targeted to z4 atm.

Instead of that, I've kept this bug for RHOSP16.1 and cloned this bug for RHOSP16.2.

Comment 15 errata-xmlrpc 2022-12-07 20:25:32 UTC
Since the problem described in this bug report should be
resolved in a recent advisory, it has been closed with a
resolution of ERRATA.

For information on the advisory (Red Hat OpenStack Platform 16.1.9 bug fix and enhancement advisory), and where to find the updated
files, follow the link below.

If the solution does not work for you, open a new bug report.

https://access.redhat.com/errata/RHBA-2022:8795


Note You need to log in before you can comment on or make changes to this bug.