Bug 2038897 - [RHOSP16.2] [DCN] [STF] metrics_qdr containers failed to start with bind address error [NEEDINFO]
Summary: [RHOSP16.2] [DCN] [STF] metrics_qdr containers failed to start with bind addr...
Keywords:
Status: CLOSED ERRATA
Alias: None
Product: Red Hat OpenStack
Classification: Red Hat
Component: openstack-tripleo-heat-templates
Version: 16.2 (Train)
Hardware: All
OS: Linux
high
urgent
Target Milestone: z2
: 16.2 (Train on RHEL 8.4)
Assignee: Ollie Walsh
QA Contact: Marian Krcmarik
URL:
Whiteboard:
Depends On:
Blocks:
TreeView+ depends on / blocked
 
Reported: 2022-01-10 13:52 UTC by Meiyan Zheng
Modified: 2022-12-01 10:04 UTC (History)
12 users (show)

Fixed In Version: openstack-tripleo-heat-templates-11.6.1-2.20220116004909.64b2e88.el8ost
Doc Type: Bug Fix
Doc Text:
This update fixes a bug that caused the `metrics_qdr` service to appear in the HAProxy configuration of an Service Telemetry Framework (STF) deployment on a distributed compute node (DCN) node.
Clone Of:
Environment:
Last Closed: 2022-03-23 22:30:16 UTC
Target Upstream Version:
Embargoed:
jamsmith: needinfo? (leif.hortlund)


Attachments (Terms of Use)


Links
System ID Private Priority Status Summary Last Updated
OpenStack gerrit 824145 0 None master: MERGED tripleo-heat-templates: Disable metrics_qdr binding in HAProxyEdge (I5f3b678b30f6375844c0a4b094432c9be9d8e444) 2022-02-07 21:40:14 UTC
OpenStack gerrit 824254 0 None stable/train: MERGED tripleo-heat-templates: Disable metrics_qdr binding in HAProxyEdge (I5f3b678b30f6375844c0a4b094432c9be9d8e444) 2022-02-07 21:40:20 UTC
Red Hat Issue Tracker OSP-12077 0 None None None 2022-01-10 13:55:33 UTC
Red Hat Product Errata RHSA-2022:0995 0 None None None 2022-03-23 22:30:41 UTC

Description Meiyan Zheng 2022-01-10 13:52:03 UTC
Description of problem:

RHOSP 16.2 DCN with Storage(Ceph), integrating with STF 

  3 x DistributedComputeHCI
  7 x DistributedComputeHCIScaleOut  

metrics_qdr service configured in haproxy.cfg on DistributedComputeHCIScaleOut and listening 0.0.0.0:5666, causing metrics_qdr container failed with bind address error. 

Here is the configuration in haproxy.cfg on DistributedComputeHCIScaleOut node. 
      ============================
      ~
      listen metrics_qdr
        bind :5666 transparent
        option tcp-check
        option tcplog
        tcp-check connect port 5666
        server controller01~
        server controller02~
        server controller03~
      ============================

From /usr/share/openstack-tripleo-heat-templates/deployment/haproxy/haproxy-edge-container-puppet.yaml, seems only glance-api services should be set in haproxy.cfg, but somehow metric_qdr service also configured in haproxy.cfg on the DistributedComputeHCIScaleOut node. 


Version-Release number of selected component (if applicable):

RHOSP 16.2


How reproducible:


Steps to Reproduce:
1. disable public virtual IP
2. Deploy RHOSP 16.2 integrating with STF in DCN environment 
 

Actual results:

metrics_qdr service configured in haproxy.cfg on DistributedComputeHCIScaleOut and listening 0.0.0.0:5666, causing metrics_qdr container failed with bind address error. 


Expected results:

metrics_qdr container should be started without problem 
metrics_qdr should not be configured in haproxy.cfg on DistributedComputeHCIScaleOut node. 

Additional info:

Comment 1 Donghwi Cha 2022-01-10 21:51:49 UTC
Hi Meiyan. Thanks for creating BZ on behalf of me. 

I included Oliver Walsh who helped with providing workaround for this issue. 

parameter_defaults:
    DistributedComputeScaleOutExtraConfig:
        tripleo::haproxy::metrics_qdr: false
    DistributedComputeHCIScaleOutExtraConfig:
        tripleo::haproxy::metrics_qdr: false


According to his insight, the issue is in haproxy-edge-container-puppet.yaml shown the link below

>> https://github.com/openstack/tripleo-heat-templates/blob/stable/train/deployment/haproxy/haproxy-edge-container-puppet.yaml#L109.

I really appreciate all of your help and hope this issue can be permanently resolved in the next version,
meanwhile I will go with the workaround that Oliver shared.

Comment 21 errata-xmlrpc 2022-03-23 22:30:16 UTC
Since the problem described in this bug report should be
resolved in a recent advisory, it has been closed with a
resolution of ERRATA.

For information on the advisory (Moderate: Red Hat OpenStack Platform 16.2 (openstack-tripleo-heat-templates) security update), and where to find the updated
files, follow the link below.

If the solution does not work for you, open a new bug report.

https://access.redhat.com/errata/RHSA-2022:0995


Note You need to log in before you can comment on or make changes to this bug.