1474987 – Kubelet fails to check if diskspace is available

Bug 1474987 - Kubelet fails to check if diskspace is available

Summary: Kubelet fails to check if diskspace is available

Keywords:
Status:	CLOSED DEFERRED
Alias:	None
Product:	Fedora
Classification:	Fedora
Component:	cri-o
Sub Component:
Version:	rawhide
Hardware:	x86_64
OS:	Linux
Priority:	unspecified
Severity:	high
Target Milestone:	---
Assignee:	Mrunal Patel
QA Contact:	Fedora Extras Quality Assurance
Docs Contact:
URL:
Whiteboard:
Depends On:
Blocks:
TreeView+	depends on / blocked

Reported:	2017-07-25 18:58 UTC by Sebastian Jug
Modified:	2017-07-26 14:17 UTC (History)
CC List:	5 users (show)
Fixed In Version:
Doc Type:	If docs needed, set a value
Doc Text:
Clone Of:
Environment:
Last Closed:	2017-07-26 14:17:33 UTC
Type:	Bug
Embargoed:
Dependent Products:

Attachments	(Terms of Use)
journalctl output (204.52 KB, text/x-vhdl) 2017-07-25 18:58 UTC, Sebastian Jug	no flags	Details
View All

Description Sebastian Jug 2017-07-25 18:58:00 UTC

Created attachment 1304409 [details]
journalctl output

Description of problem:
Starting preliminary perf-scale testing on CRI-O. Have a 4 node cluster with 1 master, and 3 nodes all using CRI-O on RHEL (not Fedora).

The test I am running is the nodeVertical test which creates a large number of pods. (https://github.com/openshift/svt/blob/master/openshift_scalability/nodeVertical.sh)

After ~118 pods are created, no further pods are able to be scheduled.

The relevant error seems to be:
Jul 25 14:11:36 atomic-openshift-node[56515]: E0725 14:11:36.306490   56515 kubelet.go:1666] Failed to check if disk space is available for the runtime: failed to get fs info for "runtime": ImagesFsInfo: unknown runtime: remote

Version-Release number of selected component (if applicable):
cri-o-1.0.0.alpha.0-1.git91977d3.el7
containernetworking-plugins-0.5.2-2.el7

How reproducible:
Always


Steps to Reproduce:
1. Have OpenShift cluster using CRI-O
2. Create large number of pods (node vertical test)
3. After ~118 pods they will no longer be schedulable.


Actual results:
Pods cannot be scheduled


Expected results:
Pods continue to be created and scheduled successfully


Additional info:
CRI-O kubelet arguments match those found here: https://raw.githubusercontent.com/ashcrow/openshift-ansible/571de035357e6bd219ce62e66dc786fbdedc9b9c/roles/openshift_node/templates/node.yaml.v1.j2


journal log attached

Comment 1 Antonio Murdaca 2017-07-25 20:02:38 UTC

That log isn't the cause here, I suspect something else is going on here. I'll have a look at logs soon.

Comment 4 Sebastian Jug 2017-07-26 14:17:33 UTC

This was a OCP issue, now working as expected with 3.6.171.

There are plenty of errors in the logs including the ones posted in this issue. However they do not seem to be blockers as Antonio stated.

Closing.

Note You need to log in before you can comment on or make changes to this bug.