Note: This bug is displayed in read-only format because the product is no longer active in Red Hat Bugzilla.

Bug 2218455

Summary:	HAProxy fails to restart during update from 16.1.8 to 16.2.5 - task "copy certificate, chgrp, restart haproxy"
Product:	Red Hat OpenStack	Reporter:	Eric Nothen <enothen>
Component:	openstack-tripleo-heat-templates	Assignee:	Luca Miccini <lmiccini>
Status:	CLOSED ERRATA	QA Contact:	Joe H. Rahme <jhakimra>
Severity:	medium	Docs Contact:
Priority:	medium
Version:	16.2 (Train)	CC:	jmarcian, lmiccini, mariel, mburns
Target Milestone:	z6	Keywords:	Triaged
Target Release:	16.2 (Train on RHEL 8.4)
Hardware:	x86_64
OS:	Linux
Whiteboard:
Fixed In Version:	openstack-tripleo-heat-templates-11.6.1-2.20230808225213.9adcac6.el8ost	Doc Type:	If docs needed, set a value
Doc Text:		Story Points:	---
Clone Of:		Environment:
Last Closed:	2023-11-08 19:19:16 UTC	Type:	Bug
Regression:	---	Mount Type:	---
Documentation:	---	CRM:
Verified Versions:		Category:	---
oVirt Team:	---	RHEL 7.3 requirements from Atomic Host:
Cloudforms Team:	---	Target Upstream Version:
Embargoed:

Description Eric Nothen 2023-06-29 08:08:41 UTC

Description of problem:

HAproxy fails to restart after update, eventually causing failure of update job in controller-0 when updating from 16.1.8 to 16.2.5

Version-Release number of selected component (if applicable):
16.2.5

How reproducible:


Steps to Reproduce:
1.
2.
3.

Actual results:


Expected results:


Additional info:

At the time HAproxy fails to restart, all of the rpms have been updated in the controller, and all of the new container images have been pre-fetched.

The error is mostly the same:

$ grep FATAL 0020-mistral.tar.gz/var/log/containers/mistral/package_update.log | grep -c "does not exist"
19

~~~
{
  "ansible_loop_var": "item",
  "changed": true,
  "cmd": "set -e\nif podman ps -f \"id=01c2b754fee8\" --format \"{{.Names}}\" | grep -q \"^haproxy-bundle\"; then\n  tar -c /etc/pki/tls/private/overcloud_endpoint.pem | podman exec -i 01c2b754fee8 tar -C / -xv\nelse\n  podman cp /etc/pki/tls/private/overcloud_endpoint.pem 01c2b754fee8:/etc/pki/tls/private/overcloud_endpoint.pem\nfi\npodman exec --user root 01c2b754fee8 chgrp haproxy /etc/pki/tls/private/overcloud_endpoint.pem\npodman kill --signal=HUP 01c2b754fee8\n",
  "delta": "0:00:00.569902",
  "end": "2023-06-28 12:29:29.541934",
  "failed_when_result": true,
  "item": "01c2b754fee8",
  "msg": "non-zero return code",
  "rc": 125,
  "start": "2023-06-28 12:29:28.972032",
  "stderr": "Error: container \"01c2b754fee8\" does not exist",
  "stderr_lines": [
    "Error: container \"01c2b754fee8\" does not exist"
  ],
  "stdout": "",
  "stdout_lines": []
}
~~~

But there's one that's slightly different:
~~~
{
  "ansible_loop_var": "item",
  "changed": true,
  "cmd": "set -e\nif podman ps -f \"id=0185c92bf4a9\" --format \"{{.Names}}\" | grep -q \"^haproxy-bundle\"; then\n  tar -c /etc/pki/tls/private/overcloud_endpoint.pem | podman exec -i 0185c92bf4a9 tar -C / -xv\nelse\n  podman cp /etc/pki/tls/private/overcloud_endpoint.pem 0185c92bf4a9:/etc/pki/tls/private/overcloud_endpoint.pem\nfi\npodman exec --user root 0185c92bf4a9 chgrp haproxy /etc/pki/tls/private/overcloud_endpoint.pem\npodman kill --signal=HUP 0185c92bf4a9\n",
  "delta": "0:00:00.696201",
  "end": "2023-06-28 12:29:58.321678",
  "failed_when_result": true,
  "item": "0185c92bf4a9",
  "msg": "non-zero return code",
  "rc": 255,
  "start": "2023-06-28 12:29:57.625477",
  "stderr": "Error: OCI runtime error: exec failed: container_linux.go:380: starting container process caused: process_linux.go:130: executing setns process caused: exit status 1",
  "stderr_lines": [
    "Error: OCI runtime error: exec failed: container_linux.go:380: starting container process caused: process_linux.go:130: executing setns process caused: exit status 1"
  ],
  "stdout": "",
  "stdout_lines": []
}
~~~

Comment 20 errata-xmlrpc 2023-11-08 19:19:16 UTC

Since the problem described in this bug report should be
resolved in a recent advisory, it has been closed with a
resolution of ERRATA.

For information on the advisory (Red Hat OpenStack Platform 16.2.6 (Train) bug fix and enhancement advisory), and where to find the updated
files, follow the link below.

If the solution does not work for you, open a new bug report.

https://access.redhat.com/errata/RHBA-2023:6307