Bug 2079788 - Operator restarts while applying the acm-ice example
Summary: Operator restarts while applying the acm-ice example
Keywords:
Status: CLOSED ERRATA
Alias: None
Product: OpenShift Container Platform
Classification: Red Hat
Component: Special Resource Operator
Version: 4.11
Hardware: Unspecified
OS: Unspecified
high
urgent
Target Milestone: ---
: 4.11.0
Assignee: Patryk Matuszak
QA Contact: Constantin Vultur
URL:
Whiteboard:
Depends On:
Blocks:
TreeView+ depends on / blocked
 
Reported: 2022-04-28 09:09 UTC by Constantin Vultur
Modified: 2022-08-10 11:09 UTC (History)
1 user (show)

Fixed In Version:
Doc Type: If docs needed, set a value
Doc Text:
Clone Of:
Environment:
Last Closed: 2022-08-10 11:09:16 UTC
Target Upstream Version:
Embargoed:


Attachments (Terms of Use)


Links
System ID Private Priority Status Summary Last Updated
Github openshift special-resource-operator pull 175 0 None open Bug 2079788: Handle concurrent map access 2022-04-28 16:49:22 UTC
Red Hat Product Errata RHSA-2022:5069 0 None None None 2022-08-10 11:09:32 UTC

Description Constantin Vultur 2022-04-28 09:09:23 UTC
Description of problem:
Operator restarts after applying an SpecialResourceModule
Logs point to this error: 'fatal error: concurrent map read and map write'


Version-Release number of selected component (if applicable):
master/release-4.11

How reproducible:


Steps to Reproduce:
1. Deploy Cluster with ACM installed and few spokes
2. Deploy SRO from bundle from master/release-4.11
3. Make sure CRD specialresourcemodules.sro.openshift.io is installed
3. Apply the ACM ice example that includes few spokes

Actual results:
Operator restart. In logs this line can be seen:

fatal error: concurrent map read and map write

Further oc create / oc delete of same file worked OK. 
This error appeared on the first time the CR was applied


The acm-ice.yaml had this content:
apiVersion: sro.openshift.io/v1beta1
kind: SpecialResourceModule
metadata:
  name: acm-ice
spec:
  namespace: acm-ice
  chart:
    name: acm-ice
    version: 0.0.1
    repository:
      name: chart
      url: cm://acm-ice/acm-ice-chart
  set:
    kind: Values
    apiVersion: sro.openshift.io/v1beta1
    buildArgs:
      - name: DRIVER_VER
        value: "1.6.4"
    registry: registry.ocp-edge-cluster-cdv-0.qe.lab.redhat.com:5000/sro
  watch:
    - path: "$.metadata.labels.openshiftVersion"
      apiVersion: cluster.open-cluster-management.io/v1
      kind: ManagedCluster
      name: snodhcp-0
    - path: "$.metadata.labels.openshiftVersion"
      apiVersion: cluster.open-cluster-management.io/v1
      kind: ManagedCluster
      name: snodhcptwo-0
    - path: "$.metadata.labels.openshiftVersion"
      apiVersion: cluster.open-cluster-management.io/v1
      kind: ManagedCluster
      name: snodhcpthree-0


Expected results:
No operator crash while applying a CR


Additional info:

Comment 4 Constantin Vultur 2022-05-19 16:37:44 UTC
Verified this with latest bundle from release-4.11 and did not see any restarts. 

Marking as Verified.

Comment 6 errata-xmlrpc 2022-08-10 11:09:16 UTC
Since the problem described in this bug report should be
resolved in a recent advisory, it has been closed with a
resolution of ERRATA.

For information on the advisory (Important: OpenShift Container Platform 4.11.0 bug fix and security update), and where to find the updated
files, follow the link below.

If the solution does not work for you, open a new bug report.

https://access.redhat.com/errata/RHSA-2022:5069


Note You need to log in before you can comment on or make changes to this bug.