Bug 1280169 - Can not restart node service on atomic host
Summary: Can not restart node service on atomic host
Keywords:
Status: CLOSED ERRATA
Alias: None
Product: OpenShift Container Platform
Classification: Red Hat
Component: Installer
Version: 3.1.0
Hardware: Unspecified
OS: Unspecified
high
medium
Target Milestone: ---
: ---
Assignee: Scott Dodson
QA Contact: Ma xiaoqiang
URL:
Whiteboard:
Depends On: 1286936
Blocks:
TreeView+ depends on / blocked
 
Reported: 2015-11-11 05:16 UTC by Ma xiaoqiang
Modified: 2016-01-27 19:42 UTC (History)
7 users (show)

Fixed In Version:
Doc Type: Bug Fix
Doc Text:
Clone Of:
Environment:
Last Closed: 2016-01-27 19:42:42 UTC
Target Upstream Version:
Embargoed:


Attachments (Terms of Use)


Links
System ID Private Priority Status Summary Last Updated
Red Hat Product Errata RHBA-2016:0075 0 normal SHIPPED_LIVE Red Hat OpenShift Enterprise atomic-openshift-utils bug fix update 2016-01-28 00:42:22 UTC

Description Ma xiaoqiang 2015-11-11 05:16:42 UTC
Description of problem:
Can not restart node service on atomic host

Version-Release number of selected component (if applicable):
 https://github.com/sdodson/openshift-ansible/tree/containers

How reproducible:
Always


Steps to Reproduce:

1. Install env on atomic
2. restart the node service
#systemctl restart atomic-openshift-node



Actual results:
Fail to restart the node service

Nov 11 12:14:50 openshift-153.lab.eng.nay.redhat.com docker[11622]: Error response from daemon: Cannot destroy container atomic-openshift-node: Could not ki...stroyed
Nov 11 12:14:50 openshift-153.lab.eng.nay.redhat.com docker[11622]: Error: failed to remove containers: [atomic-openshift-node]
Nov 11 12:14:50 openshift-153.lab.eng.nay.redhat.com docker[11629]: Error response from daemon: Conflict. The name "atomic-openshift-node" is already in use...t name.



Expected results:
Restart the node service successfully

Additional info:
# docker stop atomic-openshift-node
Error response from daemon: Cannot stop container atomic-openshift-node: [2] Container does not exist: container destroyed
Error: failed to stop containers: [atomic-openshift-node]

Comment 5 Scott Dodson 2015-11-30 19:11:59 UTC
Ok, done with the work now.

https://github.com/openshift/origin/pull/6022

rcm-img-docker01.build.eng.bos.redhat.com:5001/openshift3/node 50db2d604ea6

Comment 6 Ma xiaoqiang 2015-12-04 03:22:34 UTC
Check on puddle [3.1/2015-12-02.3]

# openshift version
openshift v3.1.0.4-3-ga6353c7
kubernetes v1.1.0-origin-1107-g4c8e6f4
etcd 2.1.2

# service atomic-openshift-node restart
Redirecting to /bin/systemctl restart  atomic-openshift-node.service
Job for atomic-openshift-node.service failed because the control process exited with error code. See "systemctl status atomic-openshift-node.service" and "journalctl -xe" for details.


# systemctl status atomic-openshift-node.service
● atomic-openshift-node.service
   Loaded: loaded (/etc/systemd/system/atomic-openshift-node.service; enabled; vendor preset: disabled)
   Active: activating (start-post) (Result: exit-code) since Fri 2015-12-04 11:19:15 CST; 7s ago
  Process: 12298 ExecStop=/usr/bin/docker stop atomic-openshift-node (code=exited, status=1/FAILURE)
  Process: 12313 ExecStart=/usr/bin/docker run --name atomic-openshift-node --rm --privileged --net=host --pid=host -v /:/rootfs:ro -e CONFIG_FILE=${CONFIG_FILE} -e OPTIONS=${OPTIONS} -e HOST=/rootfs -e HOST_ETC=/host-etc -v /var/lib/origin:/var/lib/origin -v /etc/origin/node:/etc/origin/node -v /etc/localtime:/etc/localtime:ro -v /etc/machine-id:/etc/machine-id:ro -v /run:/run -v /sys:/sys:ro -v /usr/bin/docker:/usr/bin/docker:ro -v /var/lib/docker:/var/lib/docker -v /lib/modules:/lib/modules -v /etc/origin/openvswitch:/etc/openvswitch -v /etc/origin/sdn:/etc/openshift-sdn -v /etc/systemd/system:/host-etc/systemd/system openshift3/node (code=exited, status=1/FAILURE)
  Process: 12305 ExecStartPre=/usr/bin/docker rm -f atomic-openshift-node (code=exited, status=1/FAILURE)
 Main PID: 12313 (code=exited, status=1/FAILURE);         : 12314 (sleep)
   Memory: 88.0K
   CGroup: /system.slice/atomic-openshift-node.service
           └─control
             └─12314 /usr/bin/sleep 10

Dec 04 11:19:15 openshift-154.lab.eng.nay.redhat.com systemd[1]: Starting atomic-openshift-node.service...
Dec 04 11:19:17 openshift-154.lab.eng.nay.redhat.com docker[12305]: Error response from daemon: Cannot destroy container atomic-openshift-node: Could not kill running container, cannot remove - [2...er destroyed
Dec 04 11:19:17 openshift-154.lab.eng.nay.redhat.com docker[12305]: Error: failed to remove containers: [atomic-openshift-node]
Dec 04 11:19:17 openshift-154.lab.eng.nay.redhat.com docker[12313]: Error response from daemon: Conflict. The name "atomic-openshift-node" is already in use by container f000050552de. You have to ...e that name.
Dec 04 11:19:17 openshift-154.lab.eng.nay.redhat.com systemd[1]: atomic-openshift-node.service: main process exited, code=exited, status=1/FAILURE
Hint: Some lines were ellipsized, use -l to show in full.

Still can not restart the node service

Comment 7 Scott Dodson 2015-12-08 20:11:35 UTC
This is fixed by ensuring you've updated the node image. I've rebuilt the node image for some other changes so please make sure you're running this image :


rcm-img-docker01.build.eng.bos.redhat.com:5001/openshift3/node 8923200e3c44

if you're still having troubles I need to know the image id you're using

Comment 8 Ma xiaoqiang 2015-12-09 08:03:25 UTC
check against 8923200e3c44, restart the node service successfully.

Comment 10 errata-xmlrpc 2016-01-27 19:42:42 UTC
Since the problem described in this bug report should be
resolved in a recent advisory, it has been closed with a
resolution of ERRATA.

For information on the advisory, and where to find the updated
files, follow the link below.

If the solution does not work for you, open a new bug report.

https://access.redhat.com/errata/RHBA-2016:0075


Note You need to log in before you can comment on or make changes to this bug.