Bug 1663423

Summary: Router cannot sync routes until recreate it
Product: OpenShift Container Platform Reporter: wangzhida <zhiwang>
Component: NetworkingAssignee: Ram Ranganathan <ramr>
Networking sub component: router QA Contact: Hongan Li <hongli>
Status: CLOSED DUPLICATE Docs Contact:
Severity: urgent    
Priority: urgent CC: aos-bugs, bbennett, cewang, dageoffr, dmace, hongli, jinjli, jolee, openshift-bugs-escalate, ramr, scuppett, sfu, weliang, zhiwang
Version: 3.11.0   
Target Milestone: ---   
Target Release: ---   
Hardware: Unspecified   
OS: Unspecified   
Whiteboard:
Fixed In Version: Doc Type: If docs needed, set a value
Doc Text:
Story Points: ---
Clone Of: Environment:
Last Closed: 2019-01-22 14:45:25 UTC Type: Bug
Regression: --- Mount Type: ---
Documentation: --- CRM:
Verified Versions: Category: ---
oVirt Team: --- RHEL 7.3 requirements from Atomic Host:
Cloudforms Team: --- Target Upstream Version:
Embargoed:

Description wangzhida 2019-01-04 09:19:57 UTC
Description of problem:

The customer has the env of OCP3.11 with 3master + 30infra + 150+ nodes (holding 1900+ pods)
They running router pods with replica=30 across the infra nodes.

This issue is : when they create a route, some of the routers cannot sync it to their haproxy.conf. and after delete the router pod, the recreated one will be normal.

From the issued router pod, the log will keep reporting below errors:
E1228 11:57:38.516286       1 router_controller.go:250] another route has claimed this host


Concern from the customer:

1. They need detailed info about how router sync routes from api and they doubt the RCA is router doesn't treat api as the default input.For example, api "do" update new route to routers but router refuses to update it.

2. Does it make sense to just using replica=30 to run HA routers in the cluster ?


For sosreport, because of the limit of bugzilla , please download it from case 02282929 #5 with password:redhat


Thank you.

Comment 1 wangzhida 2019-01-09 01:39:53 UTC
any update ?

Comment 8 Ben Bennett 2019-01-14 14:58:07 UTC
@weibin: Can you try to reproduce this please?

Comment 35 Stephen Cuppett 2019-01-22 14:45:25 UTC

*** This bug has been marked as a duplicate of bug 1660598 ***