Bug 1856413
Summary: | AMQ Online CSV sticks at Pending since OpenShift 4.4.11 | ||||||||||
---|---|---|---|---|---|---|---|---|---|---|---|
Product: | OpenShift Container Platform | Reporter: | Keith Wall <kwall> | ||||||||
Component: | OLM | Assignee: | Evan Cordell <ecordell> | ||||||||
OLM sub component: | OLM | QA Contact: | Jian Zhang <jiazha> | ||||||||
Status: | CLOSED DUPLICATE | Docs Contact: | |||||||||
Severity: | high | ||||||||||
Priority: | high | CC: | bluddy, rahmed, vdinh | ||||||||
Version: | 4.4 | ||||||||||
Target Milestone: | --- | ||||||||||
Target Release: | 4.6.0 | ||||||||||
Hardware: | Unspecified | ||||||||||
OS: | Unspecified | ||||||||||
Whiteboard: | |||||||||||
Fixed In Version: | Doc Type: | If docs needed, set a value | |||||||||
Doc Text: | Story Points: | --- | |||||||||
Clone Of: | Environment: | ||||||||||
Last Closed: | 2020-07-16 15:21:28 UTC | Type: | Bug | ||||||||
Regression: | --- | Mount Type: | --- | ||||||||
Documentation: | --- | CRM: | |||||||||
Verified Versions: | Category: | --- | |||||||||
oVirt Team: | --- | RHEL 7.3 requirements from Atomic Host: | |||||||||
Cloudforms Team: | --- | Target Upstream Version: | |||||||||
Embargoed: | |||||||||||
Attachments: |
|
Description
Keith Wall
2020-07-13 14:51:22 UTC
Relates to second aspect of this case: https://access.redhat.com/support/cases/#/case/02697817 Created attachment 1700854 [details]
CSV - 4.4.10 -working
Created attachment 1700855 [details]
CSV - 4.4.11 -failing
I've attached two dumps of the CSV showing the difference when AMQ Online 1.4.4 is installed on OCP 4.4.10 (working) and OCP 4.4.11 (failing - sticks in Pending state). Hey team, Can anyone please have a look at the case, The client is not happy. To quickly help the customer, it should be enough to create a ClusterRole/RoleBinding that grants {"verbs":["get","list","watch"],"apiGroups":["iot.enmasse.io"],"resources":["iotprojects"]} to the iot-tenant-service serviceacounnt manually. To further debug this, it would be helpful if we could see the installplan for the 4.4.11 install. Created attachment 1701035 [details]
amq-online-1.4.4-installplan on OpenShift 4.4.11
Evan I can confirm that manually applying the ClusterRoleBinding/ClusterRole for the affected ServiceAccount from AMQ Online's install bundle does workaround the issue. This works around the issue for AMQ Online 1.4.4. The same approach works for AMQ Online 1.5.0 which had two Service Accounts exhibiting the PresentNotSatisfied symptom. Can I help with any more artefacts to help you establish root cause? Any progress in understanding root cause? The workaround has been given to the client, but this still means AMQ Online is broken when installed from OLM for all other users. I've confirmed that this is a duplicate of https://bugzilla.redhat.com/show_bug.cgi?id=1855088 Here's the excerpt from `clusterPermissions` on AMQ's CSV { "serviceAccountName": "iot-protocol-adapter", "rules": [ { "verbs": [ "get", "list", "watch" ], "apiGroups": [ "iot.enmasse.io" ], "resources": [ "iotprojects" ] } ] }, { "serviceAccountName": "iot-tenant-service", "rules": [ { "verbs": [ "get", "list", "watch" ], "apiGroups": [ "iot.enmasse.io" ], "resources": [ "iotprojects" ] } ] } this triggers the linked bug because both ClusterRoles have the same set of permissions. Any cluster that has hit this issue can be fixed by manually creating the missing ClusterRoles. We can work around this issue in AMQ manifests by adding an extra, dummy permissionrule to any ClusterRoles with duplicated rules. (perhaps access to a specific resource that the serivceaccount should already have access to?) And the fix in OLM will come via 1855088 *** This bug has been marked as a duplicate of bug 1855088 *** Thanks Evan for the explanation. |