Bug 1777204

Summary: [IPI][Baremetal] Many haproxy processes hanging around due to monitor not closing connections.
Product: OpenShift Container Platform Reporter: Yossi Boaron <yboaron>
Component: Machine Config OperatorAssignee: Yossi Boaron <yboaron>
Status: CLOSED ERRATA QA Contact: Victor Voronkov <vvoronko>
Severity: unspecified Docs Contact:
Priority: unspecified    
Version: 4.2.0CC: amurdaca, asegurap, augol, bperkins, rbartal, smilner, wsun
Target Milestone: ---   
Target Release: 4.4.0   
Hardware: Unspecified   
OS: Unspecified   
Whiteboard:
Fixed In Version: Doc Type: No Doc Update
Doc Text:
Story Points: ---
Clone Of: Environment:
Last Closed: 2020-05-04 11:17:42 UTC Type: Bug
Regression: --- Mount Type: ---
Documentation: --- CRM:
Verified Versions: Category: ---
oVirt Team: --- RHEL 7.3 requirements from Atomic Host:
Cloudforms Team: --- Target Upstream Version:
Embargoed:

Description Yossi Boaron 2019-11-27 07:03:37 UTC
Description of problem:

Duplication of https://bugzilla.redhat.com/show_bug.cgi?id=1771566 for Baremetal environment

Version-Release number of selected component (if applicable):


How reproducible:


Steps to Reproduce:
1.
2.
3.

Actual results:


Expected results:


Additional info:

Comment 3 Victor Voronkov 2020-03-16 12:20:13 UTC
Verified on 4.4.0-0.ci-2020-03-11-095511

[core@master-0 ~]$ ps aux | grep /sbin/haproxy
USER         PID %CPU %MEM    VSZ   RSS TTY      STAT START   TIME COMMAND
1001        3297  0.0  0.2 146784 73120 ?        S    Mar12   0:08 /usr/sbin/haproxy -W -db -f /etc/haproxy/haproxy.cfg -p /var/lib/haproxy/run/haproxy.pid -x /var/lib/haproxy/run/haproxy.sock -sf 17 16
1001        3301  0.1  0.3 1317268 122652 ?      Sl   Mar12  10:44 /usr/sbin/haproxy -W -db -f /etc/haproxy/haproxy.cfg -p /var/lib/haproxy/run/haproxy.pid -x /var/lib/haproxy/run/haproxy.sock -sf 17 16

=== rebooting master-1:

cat /etc/haproxy/haproxy.cfg
...
backend masters
   option  httpchk GET /healthz HTTP/1.0
   option  log-health-checks
   balance roundrobin
   server master-2.ocp-edge-cluster.qe.lab.redhat.com. fd2e:6f44:5dd8:c956::112:6443 weight 1 verify none check check-ssl inter 3s fall 2 rise 3
   server master-0.ocp-edge-cluster.qe.lab.redhat.com. fd2e:6f44:5dd8:c956::150:6443 weight 1 verify none check check-ssl inter 3s fall 2 rise 3


[core@master-0 ~]$ ps aux | grep /sbin/haproxy
1001     1319149  0.0  0.2 146796 72944 ?        S    12:02   0:00 /usr/sbin/haproxy -W -db -f /etc/haproxy/haproxy.cfg -p /var/lib/haproxy/run/haproxy.pid -x /var/lib/haproxy/run/haproxy.sock -sf 38 35
1001     1319154  0.4  0.2 1314684 76856 ?       Sl   12:02   0:00 /usr/sbin/haproxy -W -db -f /etc/haproxy/haproxy.cfg -p /var/lib/haproxy/run/haproxy.pid -x /var/lib/haproxy/run/haproxy.sock -sf 38 35

=== master-1 rebooted:

[core@master-0 ~]$ ps aux | grep /sbin/haproxy
1001     1327960  0.0  0.2 146788 72788 ?        S    12:04   0:00 /usr/sbin/haproxy -W -db -f /etc/haproxy/haproxy.cfg -p /var/lib/haproxy/run/haproxy.pid -x /var/lib/haproxy/run/haproxy.sock -sf 61 58
1001     1327965  0.3  0.2 1314680 79340 ?       Sl   12:04   0:00 /usr/sbin/haproxy -W -db -f /etc/haproxy/haproxy.cfg -p /var/lib/haproxy/run/haproxy.pid -x /var/lib/haproxy/run/haproxy.sock -sf 61 58

[core@master-0 ~]$ tail -30f /etc/haproxy/haproxy.cfg 
  retries 3
  timeout http-request 10s
  timeout queue        1m
  timeout connect      10s
  timeout client       86400s
  timeout server       86400s
  timeout tunnel       86400s
...

Comment 5 errata-xmlrpc 2020-05-04 11:17:42 UTC
Since the problem described in this bug report should be
resolved in a recent advisory, it has been closed with a
resolution of ERRATA.

For information on the advisory, and where to find the updated
files, follow the link below.

If the solution does not work for you, open a new bug report.

https://access.redhat.com/errata/RHBA-2020:0581