Bug 2038897 - [RHOSP16.2] [DCN] [STF] metrics_qdr containers failed to start with bind address error
Summary: [RHOSP16.2] [DCN] [STF] metrics_qdr containers failed to start with bind addr...
Keywords:
Status: CLOSED ERRATA
Alias: None
Product: Red Hat OpenStack
Classification: Red Hat
Component: openstack-tripleo-heat-templates
Version: 16.2 (Train)
Hardware: All
OS: Linux
high
urgent
Target Milestone: z2
: 16.2 (Train on RHEL 8.4)
Assignee: Ollie Walsh
QA Contact: Marian Krcmarik
URL:
Whiteboard:
Depends On:
Blocks:
TreeView+ depends on / blocked
 
Reported: 2022-01-10 13:52 UTC by Meiyan Zheng
Modified: 2023-09-18 04:30 UTC (History)
12 users (show)

Fixed In Version: openstack-tripleo-heat-templates-11.6.1-2.20220116004909.64b2e88.el8ost
Doc Type: Bug Fix
Doc Text:
This update fixes a bug that caused the `metrics_qdr` service to appear in the HAProxy configuration of an Service Telemetry Framework (STF) deployment on a distributed compute node (DCN) node.
Clone Of:
Environment:
Last Closed: 2022-03-23 22:30:16 UTC
Target Upstream Version:
Embargoed:


Attachments (Terms of Use)


Links
System ID Private Priority Status Summary Last Updated
OpenStack gerrit 824145 0 None master: MERGED tripleo-heat-templates: Disable metrics_qdr binding in HAProxyEdge (I5f3b678b30f6375844c0a4b094432c9be9d8e444) 2022-02-07 21:40:14 UTC
OpenStack gerrit 824254 0 None stable/train: MERGED tripleo-heat-templates: Disable metrics_qdr binding in HAProxyEdge (I5f3b678b30f6375844c0a4b094432c9be9d8e444) 2022-02-07 21:40:20 UTC
Red Hat Issue Tracker OSP-12077 0 None None None 2022-01-10 13:55:33 UTC
Red Hat Product Errata RHSA-2022:0995 0 None None None 2022-03-23 22:30:41 UTC

Description Meiyan Zheng 2022-01-10 13:52:03 UTC
Description of problem:

RHOSP 16.2 DCN with Storage(Ceph), integrating with STF 

  3 x DistributedComputeHCI
  7 x DistributedComputeHCIScaleOut  

metrics_qdr service configured in haproxy.cfg on DistributedComputeHCIScaleOut and listening 0.0.0.0:5666, causing metrics_qdr container failed with bind address error. 

Here is the configuration in haproxy.cfg on DistributedComputeHCIScaleOut node. 
      ============================
      ~
      listen metrics_qdr
        bind :5666 transparent
        option tcp-check
        option tcplog
        tcp-check connect port 5666
        server controller01~
        server controller02~
        server controller03~
      ============================

From /usr/share/openstack-tripleo-heat-templates/deployment/haproxy/haproxy-edge-container-puppet.yaml, seems only glance-api services should be set in haproxy.cfg, but somehow metric_qdr service also configured in haproxy.cfg on the DistributedComputeHCIScaleOut node. 


Version-Release number of selected component (if applicable):

RHOSP 16.2


How reproducible:


Steps to Reproduce:
1. disable public virtual IP
2. Deploy RHOSP 16.2 integrating with STF in DCN environment 
 

Actual results:

metrics_qdr service configured in haproxy.cfg on DistributedComputeHCIScaleOut and listening 0.0.0.0:5666, causing metrics_qdr container failed with bind address error. 


Expected results:

metrics_qdr container should be started without problem 
metrics_qdr should not be configured in haproxy.cfg on DistributedComputeHCIScaleOut node. 

Additional info:

Comment 1 Donghwi Cha 2022-01-10 21:51:49 UTC
Hi Meiyan. Thanks for creating BZ on behalf of me. 

I included Oliver Walsh who helped with providing workaround for this issue. 

parameter_defaults:
    DistributedComputeScaleOutExtraConfig:
        tripleo::haproxy::metrics_qdr: false
    DistributedComputeHCIScaleOutExtraConfig:
        tripleo::haproxy::metrics_qdr: false


According to his insight, the issue is in haproxy-edge-container-puppet.yaml shown the link below

>> https://github.com/openstack/tripleo-heat-templates/blob/stable/train/deployment/haproxy/haproxy-edge-container-puppet.yaml#L109.

I really appreciate all of your help and hope this issue can be permanently resolved in the next version,
meanwhile I will go with the workaround that Oliver shared.

Comment 21 errata-xmlrpc 2022-03-23 22:30:16 UTC
Since the problem described in this bug report should be
resolved in a recent advisory, it has been closed with a
resolution of ERRATA.

For information on the advisory (Moderate: Red Hat OpenStack Platform 16.2 (openstack-tripleo-heat-templates) security update), and where to find the updated
files, follow the link below.

If the solution does not work for you, open a new bug report.

https://access.redhat.com/errata/RHSA-2022:0995

Comment 22 Red Hat Bugzilla 2023-09-18 04:30:00 UTC
The needinfo request[s] on this closed bug have been removed as they have been unresolved for 120 days


Note You need to log in before you can comment on or make changes to this bug.