Bug 1814821

Summary: community operator catalog image crashloop when applying CatalogSource
Product: OpenShift Container Platform Reporter: Evan Cordell <ecordell>
Component: OLMAssignee: Evan Cordell <ecordell>
OLM sub component: OLM QA Contact: yhui
Status: CLOSED ERRATA Docs Contact:
Severity: urgent    
Priority: high CC: acarter, augol, cdoan, cvultur, jfan, mcornea, sasha, vdinh
Version: 4.4Keywords: TestBlocker
Target Milestone: ---   
Target Release: 4.4.0   
Hardware: x86_64   
OS: Linux   
Whiteboard:
Fixed In Version: Doc Type: If docs needed, set a value
Doc Text:
Story Points: ---
Clone Of: 1814457 Environment:
Last Closed: 2020-05-04 11:46:37 UTC Type: ---
Regression: --- Mount Type: ---
Documentation: --- CRM:
Verified Versions: Category: ---
oVirt Team: --- RHEL 7.3 requirements from Atomic Host:
Cloudforms Team: --- Target Upstream Version:
Embargoed:
Bug Depends On: 1814457    
Bug Blocks: 1814777    

Comment 4 yhui 2020-03-24 05:57:26 UTC
1, Create a 4.4 cluster that the fixed PR merged in.

[hui@localhost 1814821]$ oc get clusterversion
NAME      VERSION                             AVAILABLE   PROGRESSING   SINCE   STATUS
version   4.4.0-0.nightly-2020-03-23-115620   True        False         78m     Cluster version is 4.4.0-0.nightly-2020-03-23-115620

[hui@localhost 1814821]$ oc exec catalog-operator-64cfdb5b6f-v7nrk -n openshift-operator-lifecycle-manager -- olm --version
OLM version: 0.14.2
git commit: 6a7973251408b9d4d0ee42e12e98af527d6b56a0

2, Create a CatalogSource object with this "dhubchris/community-operators:v1" image. Its pod works well.

[hui@localhost 1814821]$ cat cs-1805410.yaml 
apiVersion: operators.coreos.com/v1alpha1
kind: CatalogSource
metadata:
  name: bug-operator
  namespace: openshift-marketplace
spec:
  sourceType: grpc
  image: dhubchris/community-operators:v1
  displayName: Bug Operators
  publisher: Red Hat

[hui@localhost 1814821]$ oc get pods -n openshift-marketplace
NAME                                    READY   STATUS    RESTARTS   AGE
bug-operator-74cxh                      1/1     Running   0          48s
certified-operators-bdbb7dd8b-5m8kl     1/1     Running   0          77m
community-operators-6649d867c7-8lg65    1/1     Running   0          77m
marketplace-operator-7875c55475-x4gld   1/1     Running   0          78m
redhat-marketplace-7bdd5b4d48-k255l     1/1     Running   0          77m
redhat-operators-5bd7f86b9f-tt757       1/1     Running   0          77m

[hui@localhost 1814821]$ oc get packagemanifest |grep -i bug
hyperfoil-bundle                             Bug Operators         79s
enmasse                                      Bug Operators         79s
must-gather-operator                         Bug Operators         79s
...

3, Check the Request CPU/Memory of this pod, no Limits. LGTM, verify it.
[hui@localhost 1814821]$ oc get pods -n openshift-marketplace bug-operator-74cxh -o yaml
apiVersion: v1
kind: Pod
metadata:
  annotations:
    k8s.v1.cni.cncf.io/networks-status: |-
      [{
          "name": "openshift-sdn",
          "interface": "eth0",
          "ips": [
              "10.130.0.38"
          ],
          "dns": {},
          "default-route": [
              "10.130.0.1"
          ]
      }]
    openshift.io/scc: anyuid
  creationTimestamp: "2020-03-24T05:25:18Z"
  generateName: bug-operator-
  labels:
    olm.catalogSource: bug-operator
  name: bug-operator-74cxh
  namespace: openshift-marketplace
  ownerReferences:
  - apiVersion: operators.coreos.com/v1alpha1
    blockOwnerDeletion: false
    controller: false
    kind: CatalogSource
    name: bug-operator
    uid: 749f0fa2-adb5-4481-90ca-8f46937a42a0
  resourceVersion: "40020"
  selfLink: /api/v1/namespaces/openshift-marketplace/pods/bug-operator-74cxh
  uid: 9d2fa453-ef22-458e-8c3a-c3e880728380
spec:
  containers:
  - image: dhubchris/community-operators:v1
    imagePullPolicy: IfNotPresent
    livenessProbe:
      exec:
        command:
        - grpc_health_probe
        - -addr=localhost:50051
      failureThreshold: 3
      initialDelaySeconds: 10
      periodSeconds: 10
      successThreshold: 1
      timeoutSeconds: 1
    name: registry-server
    ports:
    - containerPort: 50051
      name: grpc
      protocol: TCP
    readinessProbe:
      exec:
        command:
        - grpc_health_probe
        - -addr=localhost:50051
      failureThreshold: 3
      initialDelaySeconds: 5
      periodSeconds: 10
      successThreshold: 1
      timeoutSeconds: 5
    resources:
      requests:
        cpu: 10m
        memory: 50Mi
    securityContext:
      capabilities:
        drop:
        - MKNOD
    terminationMessagePath: /dev/termination-log
    terminationMessagePolicy: File
    volumeMounts:
    - mountPath: /var/run/secrets/kubernetes.io/serviceaccount
      name: default-token-txnph
      readOnly: true
  dnsPolicy: ClusterFirst
  enableServiceLinks: true
  imagePullSecrets:
  - name: default-dockercfg-gmt5c
  nodeName: ip-10-0-133-15.us-east-2.compute.internal
  nodeSelector:
    beta.kubernetes.io/os: linux
  priority: 0
  restartPolicy: Always
  schedulerName: default-scheduler
  securityContext:
    seLinuxOptions:
      level: s0:c8,c2
  serviceAccount: default
  serviceAccountName: default
  terminationGracePeriodSeconds: 30
  tolerations:
  - operator: Exists
  volumes:
  - name: default-token-txnph
    secret:
      defaultMode: 420
      secretName: default-token-txnph
...

Comment 6 errata-xmlrpc 2020-05-04 11:46:37 UTC
Since the problem described in this bug report should be
resolved in a recent advisory, it has been closed with a
resolution of ERRATA.

For information on the advisory, and where to find the updated
files, follow the link below.

If the solution does not work for you, open a new bug report.

https://access.redhat.com/errata/RHBA-2020:0581