Bug 1746406 - Cannot get all pakcagemanifest resources due to 504 timeout
Summary: Cannot get all pakcagemanifest resources due to 504 timeout
Keywords:
Status: CLOSED ERRATA
Alias: None
Product: OpenShift Container Platform
Classification: Red Hat
Component: OLM
Version: 4.2.0
Hardware: Unspecified
OS: Unspecified
urgent
high
Target Milestone: ---
: 4.2.z
Assignee: Nick Hale
QA Contact: Jian Zhang
URL:
Whiteboard:
Depends On:
Blocks: 1728536
TreeView+ depends on / blocked
 
Reported: 2019-08-28 11:53 UTC by Evan Cordell
Modified: 2020-02-03 16:50 UTC (History)
13 users (show)

Fixed In Version:
Doc Type: If docs needed, set a value
Doc Text:
Clone Of: 1728536
Environment:
Last Closed: 2020-01-23 11:05:32 UTC
Target Upstream Version:
Embargoed:


Attachments (Terms of Use)


Links
System ID Private Priority Status Summary Last Updated
Red Hat Product Errata RHBA-2020:0062 0 None None None 2020-01-23 11:06:00 UTC

Comment 1 Evan Cordell 2019-08-28 22:27:31 UTC
Has this issue been seen on a 4.2 cluster before? Do we still have one available for investigation?

Comment 6 Dan Geoffroy 2019-09-17 13:04:42 UTC
Moving to 4.3 as not overly concerned that this is a primary customer use case.

Comment 9 Jian Zhang 2019-11-18 03:26:11 UTC
Hi, Nick

Yes, it only occurred in the cluster which one of the nodes located in Yizhuang District, Beijing. 

Per comment 5, 
> But, after I reschedule the package pods on this new worker, it failed to get the packagemanifests. Is it as expected? @Evan
mac:~ jianzhang$ oc get pods  -o wide
NAME                               READY   STATUS    RESTARTS   AGE     IP            NODE                                      NOMINATED NODE   READINESS GATES
catalog-operator-dbf74866f-9mgvr   1/1     Running   0          5h23m   10.129.0.25   qe-wewang-https3-w49n7-control-plane-0    <none>           <none>
olm-operator-5bd79c59ff-cfsnj      1/1     Running   0          5h23m   10.129.0.29   qe-wewang-https3-w49n7-control-plane-0    <none>           <none>
packageserver-55ccd876d6-5dhsl     1/1     Running   0          85s     10.129.2.8    dell-per630-04.rhts.eng.pek2.redhat.com   <none>           <none>
packageserver-55ccd876d6-hc4sl     1/1     Running   0          2m39s   10.129.2.7    dell-per630-04.rhts.eng.pek2.redhat.com   <none>           <none>
mac:~ jianzhang$ oc get packagemanifest
Error from server (Timeout): the server was unable to return a response in the time allotted, but may still be processing the request (get packagemanifests.packages.operators.coreos.com)
mac:~ jianzhang$ oc logs packageserver-55ccd876d6-5dhsl 
Error from server: Get https://10.73.2.5:10250/containerLogs/openshift-operator-lifecycle-manager/packageserver-55ccd876d6-5dhsl/packageserver: proxyconnect tcp: x509: certificate signed by unknown authority
But, after the package pods rescheduled to the master, it works well, LGTM.


Do you think is it an issue? If not, I think I can verify it.

Comment 12 Jian Zhang 2019-11-29 06:13:41 UTC
Hi, Nick

I'm not sure what's the difference. But, we did always met this issue in that cluster, which "run a worker in Yizhuang District, Beijing". The system of that worker is RHEL 7.7, not the RHCOS. But, it's OK now, verify it, thanks!

Comment 14 errata-xmlrpc 2020-01-23 11:05:32 UTC
Since the problem described in this bug report should be
resolved in a recent advisory, it has been closed with a
resolution of ERRATA.

For information on the advisory, and where to find the updated
files, follow the link below.

If the solution does not work for you, open a new bug report.

https://access.redhat.com/errata/RHBA-2020:0062


Note You need to log in before you can comment on or make changes to this bug.