Bug 1896099

Summary: OSP16.2 tripleo_heat_engine healthcheck failure
Product: Red Hat OpenStack Reporter: David Rosenfeld <drosenfe>
Component: openstack-tripleo-commonAssignee: Alex Schultz <aschultz>
Status: CLOSED ERRATA QA Contact: David Rosenfeld <drosenfe>
Severity: medium Docs Contact:
Priority: medium    
Version: 16.2 (Train)CC: aschultz, mburns, slinaber
Target Milestone: ---Keywords: Triaged
Target Release: ---   
Hardware: Unspecified   
OS: Unspecified   
Whiteboard:
Fixed In Version: openstack-tripleo-common-11.4.1-2.20201113004920.883b01e.el8ost Doc Type: If docs needed, set a value
Doc Text:
Story Points: ---
Clone Of: Environment:
Last Closed: 2021-09-15 07:09:58 UTC Type: Bug
Regression: --- Mount Type: ---
Documentation: --- CRM:
Verified Versions: Category: ---
oVirt Team: --- RHEL 7.3 requirements from Atomic Host:
Cloudforms Team: --- Target Upstream Version:
Embargoed:

Description David Rosenfeld 2020-11-09 18:01:28 UTC
Description of problem: Undercloud tripleo_heat_engine_healthcheck.service is failing after undercloud is installed.

sudo systemctl list-units --failed --plain --no-legend --no-pager |grep "healthcheck.service" |awk '{print $1}' | sort
tripleo_heat_engine_healthcheck.service


Version-Release number of selected component (if applicable): RHOS-16.2-RHEL-8-20201105.n.0


How reproducible: Have seen it in all undercloud deploys.


Steps to Reproduce: Deploy an undercloud and execute: sudo systemctl list-units --failed --plain --no-legend --no-pager |grep "healthcheck.service" |awk '{print $1}' | sort
1.
2.
3.

Actual results: tripleo_heat_engine_healthcheck fails


Expected results: tripleo_heat_engine_healthcheck fails


Additional info:

Comment 1 Alex Schultz 2020-11-09 18:57:50 UTC
Nov 07 23:14:41 undercloud-0.redhat.local healthcheck_heat_engine[46821]: Error: exec failed: container_linux.go:349: starting container process caused "exec: \"/openstack/healthcheck\": stat /openstack/healthcheck: no such file or directory": OCI runtime command not found error


Looks like the health check is missing from the container image

Comment 4 David Rosenfeld 2021-05-24 13:51:24 UTC
The tripleo_heat_engine_healthcheck has been fixed:

sudo systemctl list-units --failed --plain --no-legend --no-pager |grep "healthcheck.service" |awk '{print $1}' | sort
tripleo_ironic_inspector_dnsmasq_healthcheck.service
tripleo_memcached_healthcheck.service

The two failed healthchecks are being handled by different BZs. Moving to verified.

Comment 6 errata-xmlrpc 2021-09-15 07:09:58 UTC
Since the problem described in this bug report should be
resolved in a recent advisory, it has been closed with a
resolution of ERRATA.

For information on the advisory (Red Hat OpenStack Platform (RHOSP) 16.2 enhancement advisory), and where to find the updated
files, follow the link below.

If the solution does not work for you, open a new bug report.

https://access.redhat.com/errata/RHEA-2021:3483