Bug 2079788

Summary: Operator restarts while applying the acm-ice example
Product: OpenShift Container Platform Reporter: Constantin Vultur <cvultur>
Component: Special Resource OperatorAssignee: Patryk Matuszak <pmatusza>
Status: CLOSED ERRATA QA Contact: Constantin Vultur <cvultur>
Severity: urgent Docs Contact:
Priority: high    
Version: 4.11CC: bthurber
Target Milestone: ---   
Target Release: 4.11.0   
Hardware: Unspecified   
OS: Unspecified   
Whiteboard:
Fixed In Version: Doc Type: If docs needed, set a value
Doc Text:
Story Points: ---
Clone Of: Environment:
Last Closed: 2022-08-10 11:09:16 UTC Type: Bug
Regression: --- Mount Type: ---
Documentation: --- CRM:
Verified Versions: Category: ---
oVirt Team: --- RHEL 7.3 requirements from Atomic Host:
Cloudforms Team: --- Target Upstream Version:
Embargoed:

Description Constantin Vultur 2022-04-28 09:09:23 UTC
Description of problem:
Operator restarts after applying an SpecialResourceModule
Logs point to this error: 'fatal error: concurrent map read and map write'


Version-Release number of selected component (if applicable):
master/release-4.11

How reproducible:


Steps to Reproduce:
1. Deploy Cluster with ACM installed and few spokes
2. Deploy SRO from bundle from master/release-4.11
3. Make sure CRD specialresourcemodules.sro.openshift.io is installed
3. Apply the ACM ice example that includes few spokes

Actual results:
Operator restart. In logs this line can be seen:

fatal error: concurrent map read and map write

Further oc create / oc delete of same file worked OK. 
This error appeared on the first time the CR was applied


The acm-ice.yaml had this content:
apiVersion: sro.openshift.io/v1beta1
kind: SpecialResourceModule
metadata:
  name: acm-ice
spec:
  namespace: acm-ice
  chart:
    name: acm-ice
    version: 0.0.1
    repository:
      name: chart
      url: cm://acm-ice/acm-ice-chart
  set:
    kind: Values
    apiVersion: sro.openshift.io/v1beta1
    buildArgs:
      - name: DRIVER_VER
        value: "1.6.4"
    registry: registry.ocp-edge-cluster-cdv-0.qe.lab.redhat.com:5000/sro
  watch:
    - path: "$.metadata.labels.openshiftVersion"
      apiVersion: cluster.open-cluster-management.io/v1
      kind: ManagedCluster
      name: snodhcp-0
    - path: "$.metadata.labels.openshiftVersion"
      apiVersion: cluster.open-cluster-management.io/v1
      kind: ManagedCluster
      name: snodhcptwo-0
    - path: "$.metadata.labels.openshiftVersion"
      apiVersion: cluster.open-cluster-management.io/v1
      kind: ManagedCluster
      name: snodhcpthree-0


Expected results:
No operator crash while applying a CR


Additional info:

Comment 4 Constantin Vultur 2022-05-19 16:37:44 UTC
Verified this with latest bundle from release-4.11 and did not see any restarts. 

Marking as Verified.

Comment 6 errata-xmlrpc 2022-08-10 11:09:16 UTC
Since the problem described in this bug report should be
resolved in a recent advisory, it has been closed with a
resolution of ERRATA.

For information on the advisory (Important: OpenShift Container Platform 4.11.0 bug fix and security update), and where to find the updated
files, follow the link below.

If the solution does not work for you, open a new bug report.

https://access.redhat.com/errata/RHSA-2022:5069