1395183 – Unable to create pods

Bug 1395183 - Unable to create pods

Summary: Unable to create pods

Keywords:
Status:	CLOSED ERRATA
Alias:	None
Product:	OpenShift Container Platform
Classification:	Red Hat
Component:	Networking
Sub Component:
Version:	3.4.0
Hardware:	x86_64
OS:	Linux
Priority:	medium
Severity:	medium
Target Milestone:	---
Target Release:	---
Assignee:	Dan Williams
QA Contact:	Meng Bo
Docs Contact:
URL:
Whiteboard:	aos-scalability-34
Depends On:
Blocks:
TreeView+	depends on / blocked

Reported:	2016-11-15 11:09 UTC by Jiří Mencák
Modified:	2017-03-08 18:43 UTC (History)
CC List:	9 users (show)
Fixed In Version:
Doc Type:	No Doc Update
Doc Text:	undefined
Clone Of:
Environment:
Last Closed:	2017-01-18 12:55:05 UTC
Target Upstream Version:
Embargoed:

Attachments	(Terms of Use)
node_log (490.38 KB, text/plain) 2016-12-08 09:36 UTC, Meng Bo	no flags	Details
View All

Links
System	ID	Private	Priority	Status	Summary	Last Updated
Origin (Github)	11964	0	None	None	None	2016-11-18 13:35:26 UTC
Red Hat Product Errata	RHBA-2017:0066	0	normal	SHIPPED_LIVE	Red Hat OpenShift Container Platform 3.4 RPM Release Advisory	2017-01-18 17:23:26 UTC

Description Jiří Mencák 2016-11-15 11:09:13 UTC

Description of problem:
Unable to start simple hello-openshift pods.

Version-Release number of selected component (if applicable):
$ oc version
oc v3.4.0.23+24b1a58
kubernetes v1.4.0+776c994
features: Basic-Auth GSSAPI Kerberos SPNEGO

Server https://ip-172-31-42-69.us-west-2.compute.internal:8443
openshift v3.4.0.23+24b1a58
kubernetes v1.4.0+776c994

How reproducible:
This just happened during testing on a larger scale, so I haven't tried to reproduce it again.  I'll describe the steps I've done below.  

Steps to Reproduce:
1. Created 1600 hello-openshift pod-service-routes (https://github.com/jmencak/projects/tree/master/haproxy/apps/hello-openshift) across 16 different projects.
2. Tested the router (HAProxy) with varying amount of loads (not sure if necessary to reproduce) until the point of HAProxy's failure-reloads.
3. Deleted all 16 projects (verified that all projects were deleted).
4. EC2 VM shutdown.
5. EC2 VM start
6. Tried to create a simple hello-openshift pod:
 oc create -f /root/origin/examples/hello-openshift/hello-pod.json

Actual results:
$ oc get pods
NAME                      READY     STATUS              RESTARTS   AGE
hello-openshift           0/1       ContainerCreating   0          18m

$ oc get ev # selected hopefully relevant parts
4s        4s        1         hello-openshift   Pod                 Warning   FailedSync   {kubelet ip-172-31-42-77.us-west-2.compute.internal}   Error syncing pod, skipping: failed to "SetupNetwork" for "hello-openshift_hello-openshift-1" with SetupNetworkError: "Failed to setup network for pod \"hello-openshift_hello-openshift-1(6e56f898-ab20-11e6-86dd-02612c97b1c7)\" using network plugins \"cni\": CNI request failed with status 400: 'failed to run IPAM for 672a26e711cd11022b0f843d2e898c079d43cae1d270e07e586c2731033ebe0e: failed to run CNI IPAM ADD: no IP addresses available in network: openshift-sdn\n'; Skipping pod"

Expected results (from a different, live cluster):
$ oc get pods
NAME              READY     STATUS    RESTARTS   AGE
hello-openshift   1/1       Running   0          2s

Additional info:
Restarting atomic-openshift-master.service, atomic-openshift-node.service not even the whole cluster didn't help.

These are the only pods on the failed 10node cluster:
root@ip-172-31-42-69: ~/projects/haproxy/apps/hello-openshift # oc get pods --all-namespaces
NAMESPACE           NAME                       READY     STATUS              RESTARTS   AGE
default             docker-registry-2-ynzcn    1/1       Running             6          5d
default             registry-console-1-nevem   1/1       Running             5          5d
default             router-1-rcson             1/1       Running             26         5d
hello-openshift-1   hello-openshift            0/1       ContainerCreating   0          22m

Comment 2 Dan Williams 2016-11-15 19:13:11 UTC

Can we get node logs from the node that ran out of addresses?

Comment 3 Jiří Mencák 2016-11-16 09:31:08 UTC

I've generated sosreport on a node that the pod failed to start.  Please see:

http://ekuric.usersys.redhat.com/jmencak/BZ1395183/sosreport-1395183-20161116040906.txt
http://ekuric.usersys.redhat.com/jmencak/BZ1395183/sosreport-JMencak.1395183-20161116040906.tar.xz

Comment 4 Dan Williams 2016-11-16 17:06:09 UTC

(In reply to jmencak from comment #0)
> Steps to Reproduce:
> 1. Created 1600 hello-openshift pod-service-routes
> (https://github.com/jmencak/projects/tree/master/haproxy/apps/hello-
> openshift) across 16 different projects.
> 2. Tested the router (HAProxy) with varying amount of loads (not sure if
> necessary to reproduce) until the point of HAProxy's failure-reloads.
> 3. Deleted all 16 projects (verified that all projects were deleted).

How long did you wait between steps 3 and 4?

> 4. EC2 VM shutdown.
> 5. EC2 VM start

Also, on a node that has problems, can you report the contents of the /var/lib/cni/networks/openshift-sdn/ directory?  No need to tar it up or anything, just an 'ls' would be good.

Next, before you shut the VM down, can you do a quick 'journalctl -b -u openshift-node > /tmp/node.log', grab the node log, and get it to me somehow?

Comment 5 Jiří Mencák 2016-11-16 18:31:13 UTC

Not sure about the time between 3) and 4), but probably not very long because of EC2 costs.

http://ekuric.usersys.redhat.com/jmencak/BZ1395183/openshift-sdn-2016116.txt

root@ip-172-31-42-75: ~ # journalctl -b -u openshift-node
-- No entries --
root@ip-172-31-42-75: ~ #

Comment 6 Jiří Mencák 2016-11-17 10:17:05 UTC

Broke yet another EC2 cluster.

$ oc version
oc v3.4.0.26+f7e109e
kubernetes v1.4.0+776c994
features: Basic-Auth GSSAPI Kerberos SPNEGO

Server https://ip-172-31-2-138.us-west-2.compute.internal:8443
openshift v3.4.0.26+f7e109e
kubernetes v1.4.0+776c994

This time I noticed that I run out of disk space before hitting this issue.  No reboots involved.

$ oc get ev
23m       23m       1         hello-openshift-4-bxnea   Pod                 Warning   FailedSync   {kubelet ip-172-31-2-147.us-west-2.compute.internal}   Error syncing pod, skipping: failed to "SetupNetwork" for "hello-openshift-4-bxnea_hello-openshift-128" with SetupNetworkError: "Failed to setup network for pod \"hello-openshift-4-bxnea_hello-openshift-128(4ceb366f-acab-11e6-8c3b-0253c164b30f)\" using network plugins \"cni\": CNI request failed with status 400: 'failed to run IPAM for 8aa9145f6aeeee60ea0e7274e6d3d37e41bcee6d5c6b932a0a83d957baefac56: failed to run CNI IPAM ADD: no IP addresses available in network: openshift-sdn\n'; Skipping pod"

Not ruling out the same happened in the previous case, i.e. running out of disk space.

Comment 7 Ben Bennett 2016-11-17 15:41:52 UTC

(In reply to jmencak from comment #5)
> root@ip-172-31-42-75: ~ # journalctl -b -u openshift-node

Can you do: journalctl -b -u atomic-openshift-node

Comment 8 Dan Williams 2016-11-18 00:48:26 UTC

Interim fix for openshift-sdn here: https://github.com/openshift/origin/pull/11964

Real fix for upstream kubernetes: https://github.com/kubernetes/kubernetes/pull/37036

Comment 11 Jiří Mencák 2016-11-18 15:20:23 UTC

root@ip-172-31-42-75: ~ # journalctl -b -u atomic-openshift-node > /tmp/atomic-openshift-node-20161118.log

Please find the requested log here:
http://ekuric.usersys.redhat.com/jmencak/BZ1395183/atomic-openshift-node-20161118.log

Comment 12 Ben Bennett 2016-11-21 17:53:38 UTC

Cherry-pick to 1.4: https://github.com/openshift/origin/pull/11983

Comment 13 Jiří Mencák 2016-11-23 11:04:29 UTC

FWIW, the same issue on another cluster.

$ oc version
oc v3.4.0.28+dfe3a66
kubernetes v1.4.0+776c994
features: Basic-Auth GSSAPI Kerberos SPNEGO

Server https://ip-172-31-13-129.us-west-2.compute.internal:8443
openshift v3.4.0.28+dfe3a66
kubernetes v1.4.0+776c994

without running out of disk space and reboots.

25m       25m       1         nginx-1-ocsdv   Pod                 Warning   FailedSync   {kubelet ip-172-31-13-134.us-west-2.compute.internal}   Error syncing pod, skipping: failed to "SetupNetwork" for "nginx-1-ocsdv_nginx3" with SetupNetworkError: "Failed to setup network for pod \"nginx-1-ocsdv_nginx3(b79b4358-b168-11e6-bd8a-02153dcdaf69)\" using network plugins \"cni\": CNI request failed with status 400: 'failed to run IPAM for a7f88d071467875f2b70a0ee943c42162142c0c6f9b07e236ae6092d1350f20c: failed to run CNI IPAM ADD: no IP addresses available in network: openshift-sdn\n'; Skipping pod"

Comment 15 Ben Bennett 2016-11-23 13:19:20 UTC

It finally made it through the merge queue and landed in the 1.4 branch on 2016-11-22 7:09PM EST.  I don't know when that will get to OSE 3.4 and what build that will be in.

Comment 16 Troy Dawson 2016-11-23 21:39:02 UTC

This has been merged into ocp and is in OCP v3.4.0.29 or newer.

Comment 20 Meng Bo 2016-12-08 09:36:18 UTC

Created attachment 1229405 [details]
node_log

Test this with following steps on OCP 3.4.0.32:
1. Setup env with 1 master 1 node (set the host subnet length to 8)
2. Create one pod on the env
3. Delete the pod above
4. Generate the IP files under /var/lib/cni/network/openshift-sdn manually
$ for i in {1..254} ; do echo ef0652fdc8ed9e1239ece47f4e38bd1ffb55303c7abd663d2376f29b59fc7f66 > 10.128.0.$i ; done
5. Try to create another pod
6. Check the node log

Result:
5. Pod cannot start due to 
  4m    4m      1       {kubelet ose-node1.bmeng.local}         Warning FailedSync      Error syncing pod, skipping: failed to "SetupNetwork" for "caddy-docker_bmengp1" with SetupNetworkError: "Failed to setup network for pod \"caddy-docker_bmengp1(32292159-bd21-11e6-940f-525400dd3698)\" using network plugins \"cni\": CNI request failed with status 400: 'failed to run IPAM for 6a938e84d63fb13ae295c8431b070e1949bdeb7641f9631b38857ff7cfe1c144: failed to run CNI IPAM ADD: no IP addresses available in network: openshift-sdn\n'; Skipping pod"
6. node log attached.

Comment 21 Meng Bo 2016-12-09 02:20:50 UTC

sorry for forgot to change the status in above comment.

Comment 22 Dan Williams 2016-12-09 16:00:44 UTC

This error would be expected once, and then the next time kubelet tries to recreate the pod, it should succeed.  The IPAM garbage collection runs when IPAM fails, so after that warning I think if you wait a few seconds for the kubelet backoff retry, it should succeed on the second attempt.

Comment 23 Dan Williams 2016-12-09 16:01:26 UTC

After the first failure I'd expect all the stale IPs in /var/lib/cni/networks/openshift-sdn/ to be cleaned up though.

Comment 24 Dan Williams 2016-12-10 00:21:17 UTC

The failure turned out to be that when echoing random container IDs to the files in /var/lib/cni/networks/openshift-sdn, echo puts a newline at the end.  The host-local backend does exact matching of the file contents and the container ID, leading to no matches found in the IP reservation files for container IDs when they were garbage collected, and thus no files would be removed.

Arguably container IDs shouldn't contain newlines or whitespace (at the very least, no newlines), and while this is a pretty edge case, I've submitted this PR for CNI:

https://github.com/containernetworking/cni/pull/341

Comment 25 Meng Bo 2016-12-10 05:00:36 UTC

Thanks. That works.

Use `printf` instead of `echo` and repeat the steps in comment#20

The manually generated IP files are deleted when the new pod created.

Verify the bug.

Comment 27 errata-xmlrpc 2017-01-18 12:55:05 UTC

Since the problem described in this bug report should be
resolved in a recent advisory, it has been closed with a
resolution of ERRATA.

For information on the advisory, and where to find the updated
files, follow the link below.

If the solution does not work for you, open a new bug report.

https://access.redhat.com/errata/RHBA-2017:0066

Note You need to log in before you can comment on or make changes to this bug.