Bug 1466732
Summary: | cluster server temporarily unavailable and then recovered | ||
---|---|---|---|
Product: | OpenShift Container Platform | Reporter: | Weihua Meng <wmeng> |
Component: | Node | Assignee: | Derek Carr <decarr> |
Status: | CLOSED CURRENTRELEASE | QA Contact: | Weihua Meng <wmeng> |
Severity: | low | Docs Contact: | |
Priority: | medium | ||
Version: | 3.6.0 | CC: | aos-bugs, ccoleman, decarr, eparis, jokerman, lxia, mmccomas, wmeng |
Target Milestone: | --- | ||
Target Release: | 3.6.z | ||
Hardware: | Unspecified | ||
OS: | Unspecified | ||
Whiteboard: | |||
Fixed In Version: | Doc Type: | If docs needed, set a value | |
Doc Text: | Story Points: | --- | |
Clone Of: | Environment: | ||
Last Closed: | 2019-11-21 17:35:02 UTC | Type: | Bug |
Regression: | --- | Mount Type: | --- |
Documentation: | --- | CRM: | |
Verified Versions: | Category: | --- | |
oVirt Team: | --- | RHEL 7.3 requirements from Atomic Host: | |
Cloudforms Team: | --- | Target Upstream Version: | |
Embargoed: |
Description
Weihua Meng
2017-06-30 10:54:45 UTC
Can we get the master logs for "10.240.0.21" while you see this problem? Connection refused is a bit odd... I am not sure how to proceed further without logs. i suspect this is a duplicate of https://bugzilla.redhat.com/show_bug.cgi?id=1465361 does the problem continue with the changes made in referenced bz? the logs are not same. may be same root cause. I will retry when that bug verified. is this an HA deployment? were the masters behind a loadbalancer? You can gzip attachments before uploading to send larger chunks. It is not a HA cluster. After investigation, it happens when run "openshift start master controllers --config=/etc/origin/master/master-config.yaml" on master, which means there might be two openshift master processes running at the same time, resulting in errors. and when one process exited, the cluster recovered. Not meet this issue recently. |