Bug 1925614 - Error: InstallPlan.operators.coreos.com not found
Summary: Error: InstallPlan.operators.coreos.com not found
Keywords:
Status: CLOSED ERRATA
Alias: None
Product: OpenShift Container Platform
Classification: Red Hat
Component: OLM
Version: 4.6.z
Hardware: Unspecified
OS: Unspecified
medium
medium
Target Milestone: ---
: 4.8.0
Assignee: Nick Hale
QA Contact: Bruno Andrade
URL:
Whiteboard:
Depends On:
Blocks:
TreeView+ depends on / blocked
 
Reported: 2021-02-05 16:47 UTC by Asheth
Modified: 2022-10-11 09:30 UTC (History)
4 users (show)

Fixed In Version:
Doc Type: Bug Fix
Doc Text:
Cause: Info log level used for transient errors. Consequence: Noisy olm-operator logs for the default configuration. Fix: Change transient error log level to debug. Result: Less logs noise for the debug configuration.
Clone Of:
Environment:
Last Closed: 2021-07-27 22:41:35 UTC
Target Upstream Version:
Embargoed:


Attachments (Terms of Use)
Output of the command $ oc get subscription -A (2.43 KB, text/plain)
2021-02-05 16:47 UTC, Asheth
no flags Details
$ oc get subscription -A -o yaml (71.02 KB, text/plain)
2021-02-17 17:13 UTC, Asheth
no flags Details


Links
System ID Private Priority Status Summary Last Updated
Github operator-framework operator-lifecycle-manager pull 2011 0 None open Bug 1925614: chore(adoption): use debug level to log missing components 2021-02-17 16:22:32 UTC
Red Hat Product Errata RHSA-2021:2438 0 None None None 2021-07-27 22:42:17 UTC

Description Asheth 2021-02-05 16:47:13 UTC
Created attachment 1755263 [details]
Output of the command $ oc get subscription -A

Description of problem:

OLM pod logs repetitively show ""msg":"component not found","error":"InstallPlan.operators.coreos.com \"install-5bwb6\" not found"

Version-Release number of selected component (if applicable):

4.6.15


Actual results:

OLM pod logs repetitively show ""msg":"component not found","error":"InstallPlan.operators.coreos.com \"install-5bwb6\" not found"

Expected results:

These error messages should be fixed.

Comment 6 Asheth 2021-02-11 09:44:53 UTC
This problem is still there in OCP 4.6.16.

Comment 7 Nick Hale 2021-02-16 03:18:29 UTC
From the logs (https://bugzilla.redhat.com/attachment.cgi?id=1755267), it looks like there are two distinct issues of various severity:

1. (low severity) error log when the InstallPlan a Subscription references no longer exists
2. (higher severity) high-frequency reconciliation of CSVs and Operator adoption (where the error log in 1 is generated)

We can fairly easily suppress the log in issue 1; issue 2 is very similar to another bug, which has a verified fix that unfortunately hasn't been released to 4.6.z yet according to bugzilla: https://bugzilla.redhat.com/show_bug.cgi?id=1924257 (In RELEASE_PENDING as of 02/15)

I tested today against 4.7 and was unable to trigger the high-frequency reconciliation (i.e. hot-loop). Tomorrow I'll attempt to reproduce against 4.6.16 and then again on the release-4.6 branch of OLM (which already contains the patch). If I can reproduce the hot-loop in 4.6.16 but not release-4.6, it means the two are not related and I'll open a PR with a simple patch to suppress issue 1; if I can (or can't) reproduce the hot-loop in _both_, it means that the root cause is not the same, and I'll need to do more digging.

Comment 8 Nick Hale 2021-02-17 14:47:00 UTC
Yesterday I was able to get some similar behavior by following the repro instructions in https://bugzilla.redhat.com/attachment.cgi?id=1755267 and deleting an InstallPlan referenced by a (any) Subscription.

Given that the hot-looping patch will (seemingly) be introduced in the next release, I'm reducing the priority/severity of this issue to low and I've opened a PR that will reduce log noise in olm-operator for non-debug settings.

Comment 10 Asheth 2021-02-17 17:13:26 UTC
Created attachment 1757619 [details]
$ oc get subscription -A -o yaml

Comment 13 Bruno Andrade 2021-02-19 16:05:46 UTC
Created an operator, delete the install plan, and logs now do not show the messages reported on the Bug. LGTM, marking as VERIFIED.

oc exec olm-operator-564748df57-nxzwb -n openshift-operator-lifecycle-manager -- olm --version                                                                                                                                      
OCP: 4.8.0-0.nightly-2021-02-18-175130
OLM version: 0.17.0
git commit: ad769c0747d0a7dd5bf501e7425b255535949dc8

Comment 20 errata-xmlrpc 2021-07-27 22:41:35 UTC
Since the problem described in this bug report should be
resolved in a recent advisory, it has been closed with a
resolution of ERRATA.

For information on the advisory (Moderate: OpenShift Container Platform 4.8.2 bug fix and security update), and where to find the updated
files, follow the link below.

If the solution does not work for you, open a new bug report.

https://access.redhat.com/errata/RHSA-2021:2438


Note You need to log in before you can comment on or make changes to this bug.