Bug 1698033

Summary: community-operators Readiness probe errored: command timed out, exit code -1
Product: OpenShift Container Platform Reporter: Nikolaos Leandros Moraitis <nmoraiti>
Component: OLMAssignee: Evan Cordell <ecordell>
Status: CLOSED DUPLICATE QA Contact: Jian Zhang <jiazha>
Severity: urgent Docs Contact:
Priority: unspecified    
Version: unspecifiedCC: aos-bugs, aravindh, jokerman, mfojtik, mifiedle, mmccomas, wking
Target Milestone: ---   
Target Release: 4.1.0   
Hardware: Unspecified   
OS: Unspecified   
Whiteboard:
Fixed In Version: Doc Type: If docs needed, set a value
Doc Text:
Story Points: ---
Clone Of: Environment:
Last Closed: 2019-04-09 17:15:20 UTC Type: Bug
Regression: --- Mount Type: ---
Documentation: --- CRM:
Verified Versions: Category: ---
oVirt Team: --- RHEL 7.3 requirements from Atomic Host:
Cloudforms Team: --- Target Upstream Version:
Embargoed:

Comment 1 Michal Fojtik 2019-04-09 13:31:06 UTC
Apr 09 11:57:01.789 W ns/openshift-marketplace pod/community-operators-8dfb4dc87-pfd6z Readiness probe errored: rpc error: code = Unknown desc = command error: command timed out, stdout: , stderr: , exit code -1 (5 times)

Comment 3 W. Trevor King 2019-04-09 17:15:20 UTC
Ok, sounds like the slow community-operators is a red-herring.  There were no failing cluster-operators here:

  $ curl -s https://storage.googleapis.com/origin-ci-test/logs/release-openshift-origin-installer-e2e-aws-upgrade-4.0/1017/artifacts/e2e-aws-upgrade/clusteroperators.json | jq '.items[] | .failing = ([.status.conditions[] | select(.type == "Failing")][0]) | .pending = ([.status.conditions[] | select(.type == "Pending")][0]) | select(.failing.status == "True" or .pending.status == "True")'

But the version conversion was not complete, which is probably why the test failed:

  $ curl -s https://storage.googleapis.com/origin-ci-test/logs/release-openshift-origin-installer-e2e-aws-upgrade-4.0/1017/artifacts/e2e-aws-upgrade/clusteroperators.json | jq -r '.items[] | .version = (.status.versions[] | select(.name == "operator").version) | .version + " " + .metadata.name' | sort
  4.0.0-0.ci-2019-04-09-033744 authentication
  4.0.0-0.ci-2019-04-09-033744 cluster-autoscaler
  4.0.0-0.ci-2019-04-09-033744 console
  4.0.0-0.ci-2019-04-09-033744 image-registry
  4.0.0-0.ci-2019-04-09-033744 ingress
  4.0.0-0.ci-2019-04-09-033744 marketplace
  4.0.0-0.ci-2019-04-09-033744 monitoring
  4.0.0-0.ci-2019-04-09-033744 node-tuning
  4.0.0-0.ci-2019-04-09-033744 openshift-controller-manager
  4.0.0-0.ci-2019-04-09-033744 openshift-samples
  4.0.0-0.ci-2019-04-09-033744 service-catalog-apiserver
  4.0.0-0.ci-2019-04-09-033744 service-catalog-controller-manager
  4.0.0-0.ci-2019-04-09-033744 storage
  4.0.0-0.ci-2019-04-09-111643 cloud-credential
  4.0.0-0.ci-2019-04-09-111643 dns
  4.0.0-0.ci-2019-04-09-111643 kube-apiserver
  4.0.0-0.ci-2019-04-09-111643 kube-controller-manager
  4.0.0-0.ci-2019-04-09-111643 kube-scheduler
  4.0.0-0.ci-2019-04-09-111643 machine-api
  4.0.0-0.ci-2019-04-09-111643 machine-config
  4.0.0-0.ci-2019-04-09-111643 network
  4.0.0-0.ci-2019-04-09-111643 openshift-apiserver
  4.0.0-0.ci-2019-04-09-111643 operator-lifecycle-manager
  4.0.0-0.ci-2019-04-09-111643 operator-lifecycle-manager-catalog
  4.0.0-0.ci-2019-04-09-111643 service-ca

I think that is just bug 1694226, so I'm closing this as a dup.

*** This bug has been marked as a duplicate of bug 1694226 ***

Comment 4 Red Hat Bugzilla 2023-09-14 05:26:39 UTC
The needinfo request[s] on this closed bug have been removed as they have been unresolved for 1000 days