Note: This bug is displayed in read-only format because the product is no longer active in Red Hat Bugzilla.

Bug 1853020

Summary:	Failed to inspect image with size for layer unknown
Product:	OpenShift Container Platform	Reporter:	Mrunal Patel <mpatel>
Component:	Node	Assignee:	Sascha Grunert <sgrunert>
Node sub component:	CRI-O	QA Contact:	MinLi <minmli>
Status:	CLOSED NOTABUG	Docs Contact:
Severity:	medium
Priority:	medium	CC:	chdeshpa, dwalsh, jokerman, nagrawal, sgrunert, simore, tsweeney, wking, zkosic
Version:	4.6	Keywords:	Reopened
Target Milestone:	---
Target Release:	---
Hardware:	x86_64
OS:	Linux
Whiteboard:
Fixed In Version:		Doc Type:	If docs needed, set a value
Doc Text:		Story Points:	---
Clone Of:		Environment:
Last Closed:	2022-05-30 07:06:57 UTC	Type:	Bug
Regression:	---	Mount Type:	---
Documentation:	---	CRM:
Verified Versions:		Category:	---
oVirt Team:	---	RHEL 7.3 requirements from Atomic Host:
Cloudforms Team:	---	Target Upstream Version:
Embargoed:

Description Mrunal Patel 2020-07-01 18:16:04 UTC

Description of problem:
We saw this failure in CI 
"startTime": "2020-06-24T15:44:09Z",
  "containerStatuses": [
    {
      "name": "controller-manager",
      "state": {
        "waiting": {
          "reason": "ImageInspectError",
          "message": "Failed to inspect image \"registry.svc.ci.openshift.org/ci-op-t9zj0l6i/stable@sha256:b645cb8528a3fc8c94462ddee29b23b839fa96a2fce480ba77f421383a4809f5\": rpc error: code = Unknown desc = size for layer \"d18dc0b8cb048a281d310fcf1a32c41dfb11586f6a8fcf9192704d449dbcd33c\" is unknown, failing getSize()"
        }
      },
      "lastState": {},
      "ready": false,
      "restartCount": 0,
      "image": "registry.svc.ci.openshift.org/ci-op-t9zj0l6i/stable@sha256:b645cb8528a3fc8c94462ddee29b23b839fa96a2fce480ba77f421383a4809f5",
      "imageID": "",
      "started": false
    }




Version-Release number of selected component (if applicable):
cri-o-1.19.0-30.dev.rhaos4.6.git0a84af5.el8.x86_64
podman-1.9.3-1.rhaos4.6.el8.x86_64


How reproducible:
Happens in CI. We don't have a deterministic reproducer right now.

Comment 2 Kir Kolyshkin 2020-07-29 22:18:01 UTC

Related: https://github.com/containers/storage/issues/569

It seems that this happens when a layer is being created. Can't say more without a repro :(

Comment 3 Kir Kolyshkin 2020-08-20 23:16:02 UTC

As https://github.com/kubernetes/kubernetes/pull/94151 is now closed, I suggest we close this one as well.

Comment 4 Zvonko Kosic 2021-03-30 18:16:43 UTC

I am having the same issue on a 4.6.19 cluster running on ppc64le. 

Events:
  Type     Reason         Age                    From     Message
  ----     ------         ----                   ----     -------
  Warning  InspectFailed  9m24s (x2637 over 9h)  kubelet  Failed to inspect image "quay.io/openshift-psap/dcgmproftester-ppc64le:11.2.2-ubi8": rpc error: code = Unknown desc = layer not known
  Warning  Failed         4m14s (x2661 over 9h)  kubelet  Error: ImageInspectError

$ oc describe nodes

NAME      STATUS   ROLES    AGE   VERSION           INTERNAL-IP    EXTERNAL-IP   OS-IMAGE                              KERNEL-VERSION                  CONTAINER-RUNTIME
X.com     Ready    worker   26d   v1.19.0+8d12420   xxxxxxxxxxxx   <none>        Red ....46.82.202102190839-0 (Ootpa)  4.18.0-193.41.1.el8_2.ppc64le   cri-o://1.19.1-7.rhaos4.6.git6377f68.el8

Comment 6 Peter Hunt 2021-04-01 14:55:40 UTC

Zvonko, can we get the CRI-O logs from nodes hitting this?

Comment 7 Peter Hunt 2021-04-01 19:45:57 UTC

Sascha, can you take a look?

Comment 8 Sascha Grunert 2021-04-28 07:58:53 UTC

Zvonko, can we get the CRI-O logs from nodes hitting this?

Comment 9 Zvonko Kosic 2021-05-03 08:44:36 UTC

Yes, we're currently reproducing. Will get you the logs ASAP.

Comment 10 Sascha Grunert 2021-05-27 07:51:58 UTC

(In reply to Zvonko Kosic from comment #9)
> Yes, we're currently reproducing. Will get you the logs ASAP.

Hey, may I ask you to provide an update on this one? If there is no reproducer we may close this issue and reopen if it occurs again.

Comment 11 Sascha Grunert 2021-07-27 10:30:53 UTC

I'm closing this one as not reproducible for now.