Bug 2079788

Summary:	Operator restarts while applying the acm-ice example
Product:	OpenShift Container Platform	Reporter:	Constantin Vultur <cvultur>
Component:	Special Resource Operator	Assignee:	Patryk Matuszak <pmatusza>
Status:	CLOSED ERRATA	QA Contact:	Constantin Vultur <cvultur>
Severity:	urgent	Docs Contact:
Priority:	high
Version:	4.11	CC:	bthurber
Target Milestone:	---
Target Release:	4.11.0
Hardware:	Unspecified
OS:	Unspecified
Whiteboard:
Fixed In Version:		Doc Type:	If docs needed, set a value
Doc Text:		Story Points:	---
Clone Of:		Environment:
Last Closed:	2022-08-10 11:09:16 UTC	Type:	Bug
Regression:	---	Mount Type:	---
Documentation:	---	CRM:
Verified Versions:		Category:	---
oVirt Team:	---	RHEL 7.3 requirements from Atomic Host:
Cloudforms Team:	---	Target Upstream Version:
Embargoed:

Description Constantin Vultur 2022-04-28 09:09:23 UTC

Description of problem:
Operator restarts after applying an SpecialResourceModule
Logs point to this error: 'fatal error: concurrent map read and map write'


Version-Release number of selected component (if applicable):
master/release-4.11

How reproducible:


Steps to Reproduce:
1. Deploy Cluster with ACM installed and few spokes
2. Deploy SRO from bundle from master/release-4.11
3. Make sure CRD specialresourcemodules.sro.openshift.io is installed
3. Apply the ACM ice example that includes few spokes

Actual results:
Operator restart. In logs this line can be seen:

fatal error: concurrent map read and map write

Further oc create / oc delete of same file worked OK. 
This error appeared on the first time the CR was applied


The acm-ice.yaml had this content:
apiVersion: sro.openshift.io/v1beta1
kind: SpecialResourceModule
metadata:
  name: acm-ice
spec:
  namespace: acm-ice
  chart:
    name: acm-ice
    version: 0.0.1
    repository:
      name: chart
      url: cm://acm-ice/acm-ice-chart
  set:
    kind: Values
    apiVersion: sro.openshift.io/v1beta1
    buildArgs:
      - name: DRIVER_VER
        value: "1.6.4"
    registry: registry.ocp-edge-cluster-cdv-0.qe.lab.redhat.com:5000/sro
  watch:
    - path: "$.metadata.labels.openshiftVersion"
      apiVersion: cluster.open-cluster-management.io/v1
      kind: ManagedCluster
      name: snodhcp-0
    - path: "$.metadata.labels.openshiftVersion"
      apiVersion: cluster.open-cluster-management.io/v1
      kind: ManagedCluster
      name: snodhcptwo-0
    - path: "$.metadata.labels.openshiftVersion"
      apiVersion: cluster.open-cluster-management.io/v1
      kind: ManagedCluster
      name: snodhcpthree-0


Expected results:
No operator crash while applying a CR


Additional info:

Comment 4 Constantin Vultur 2022-05-19 16:37:44 UTC

Verified this with latest bundle from release-4.11 and did not see any restarts. 

Marking as Verified.

Comment 6 errata-xmlrpc 2022-08-10 11:09:16 UTC

Since the problem described in this bug report should be
resolved in a recent advisory, it has been closed with a
resolution of ERRATA.

For information on the advisory (Important: OpenShift Container Platform 4.11.0 bug fix and security update), and where to find the updated
files, follow the link below.

If the solution does not work for you, open a new bug report.

https://access.redhat.com/errata/RHSA-2022:5069