1451466 – cni failed on status hook - Unexpected command output nsenter: cannot open : No such file or directory

Bug 1451466 - cni failed on status hook - Unexpected command output nsenter: cannot open : No such file or directory

Summary: cni failed on status hook - Unexpected command output nsenter: cannot open : ...

Keywords:
Status:	CLOSED DUPLICATE of bug 1434950
Alias:	None
Product:	OpenShift Container Platform
Classification:	Red Hat
Component:	Networking
Sub Component:
Version:	3.6.0
Hardware:	x86_64
OS:	Linux
Priority:	medium
Severity:	medium
Target Milestone:	---
Target Release:	---
Assignee:	Dan Williams
QA Contact:	Meng Bo
Docs Contact:
URL:
Whiteboard:
Depends On:
Blocks:
TreeView+	depends on / blocked

Reported:	2017-05-16 17:41 UTC by Mike Fiedler
Modified:	2017-06-09 20:34 UTC (History)
CC List:	5 users (show)
Fixed In Version:
Doc Type:	If docs needed, set a value
Doc Text:
Clone Of:
Environment:
Last Closed:	2017-06-09 20:34:40 UTC
Target Upstream Version:
Embargoed:

Attachments	(Terms of Use)

Description Mike Fiedler 2017-05-16 17:41:39 UTC

Description of problem:

Seen when running the same type of test as described in:  https://bugzilla.redhat.com/show_bug.cgi?id=1451110.   

However this message is popping a lot even when pods are not having problems.   While creating 125 pods on a node, this message occurred 901 times:

May 16 13:29:00 svt-n-2-67 atomic-openshift-node: W0516 13:29:00.278604   76000 docker_sandbox.go:263] NetworkPlugin cni failed on the status hook for pod "deploymentconfig2v6-1-796rm_svt-2-5": Unexpected command output nsenter: cannot open : No such file or directory


Log spam?  or real problem?


Version-Release number of selected component (if applicable): 3.6.74


How reproducible:  Always when creating a large number of pods


Steps to Reproduce:
1. Run cluster-loader (https://github.com/openshift/svt/tree/master/openshift_scalability) with the configuration below on a cluster with 2 schedulable nodes with max-pods set at least to 125.



Actual results:

Many occurrences of:

May 16 13:29:00 svt-n-2-67 atomic-openshift-node: W0516 13:29:00.278604   76000 docker_sandbox.go:263] NetworkPlugin cni failed on the status hook for pod "deploymentconfig2v6-1-796rm_svt-2-5": Unexpected command output nsenter: cannot open : No such file or directory


Expected results:


Additional info:

projects:
  - num: 10
    basename: svt-2-
    tuning: default
    templates:
      -
        num: 1
        file: ./content/build-config-template.json
      -
        num: 1
        file: ./content/build-template.json
      -
        num: 1
        file: ./content/image-stream-template.json
      - 
        num: 5   
        file: ./content/deployment-config-1rep-pause-template.json
        parameters:
          -
            ENV_VALUE: "asodfn209e8j0eij0emc2oed2ed2ed2e2easodfn209e8j0eij0emc2oed2ed2ed2e2easodfn209e8j0eij0emc2oed2ed2ed2e2easodfn209e8j0eij0emc2oed2ed2ed2e2easodfn209e8j0eij0emc2oed2ed2ed2e2easodfn209e8j0eij0emc2oed2ed2ed2e2easodfn209e8j0eij0emc2oed2ed2ed2e2easodfn209e8j0eij12"
      -
        num: 10
        file: ./content/deployment-config-2rep-pause-template.json
        parameters:
          -
            ENV_VALUE: "asodfn209e8j0eij0emc2oed2ed2ed2e2easodfn209e8j0eij0emc2oed2ed2ed2e2easodfn209e8j0eij0emc2oed2ed2ed2e2easodfn209e8j0eij0emc2oed2ed2ed2e2easodfn209e8j0eij0emc2oed2ed2ed2e2easodfn209e8j0eij0emc2oed2ed2ed2e2easodfn209e8j0eij0emc2oed2ed2ed2e2easodfn209e8j0eij12"
      -
        num: 2
        file: ./content/ssh-secret-template.json
      -
        num: 1
        file: ./content/route-template.json
      # rcs and services are implemented in deployments.
tuningsets:
  - name: default
    templates:
      stepping:
        stepsize: 3
        pause: 3 s
      rate_limit:
        delay: 250 ms
quotas:
  - name: default

Comment 2 Dan Williams 2017-05-30 20:27:16 UTC

Is there any chance we can reproduce this with nodes running --loglevel=5?  The default '2' loglevel just doesn't give much network related info.  I can see some veths being created, but nothing interesting after that, due to the logging.

Comment 4 Ben Bennett 2017-06-02 18:28:41 UTC

Logging error only, we'll try to get this next sprint rather than perturbing the code now.

Comment 5 Dan Williams 2017-06-09 20:34:40 UTC

Duping to bug 1434950 as they have the same upstream fix and are basically the same problem.

*** This bug has been marked as a duplicate of bug 1434950 ***

Note You need to log in before you can comment on or make changes to this bug.