Bug 1300471 - Openshift node Installer fails connecting to master ; but master is running
Summary: Openshift node Installer fails connecting to master ; but master is running
Keywords:
Status: CLOSED WONTFIX
Alias: None
Product: OKD
Classification: Red Hat
Component: Installer
Version: 3.x
Hardware: Unspecified
OS: Unspecified
medium
medium
Target Milestone: ---
: ---
Assignee: Jason DeTiberus
QA Contact: Ma xiaoqiang
URL:
Whiteboard:
Depends On:
Blocks:
TreeView+ depends on / blocked
 
Reported: 2016-01-20 23:45 UTC by Jay Vyas
Modified: 2018-12-02 15:17 UTC (History)
9 users (show)

Fixed In Version:
Doc Type: Bug Fix
Doc Text:
Clone Of: 1000581
Environment:
Last Closed: 2016-02-23 17:00:43 UTC
Target Upstream Version:
Embargoed:


Attachments (Terms of Use)

Description Jay Vyas 2016-01-20 23:45:58 UTC
Description: 

- After running 

sh <(curl -s https://install.openshift.com/ose/)  

on a 2 node, 1 master cluster in EC2, we can see that the ansible task for node startup fails.

- Looking deeper, we can extract the error:

[root@openshift ec2-user]# systemctl status openshift-node.service -o verbose

- And the result is: 

MESSAGE=E0120 18:30:21.606933    5868 reflector.go:209] pkg/kubelet/kubelet.go:182: Failed to watch *api.Service: Get https://ip-172-18-14-218.ec2.internal:8443/api/v1/watch/services?resourceVersion=1912: dial tcp 172.18.14.218:8443: connection refused

To do some sanity checks i confirmed: that this IP is reachable FROM the node... 

[root@openshift ec2-user]#  wget https://ip-172-18-14-218.ec2.internal:8443/api/v1/watch/services?resourceVersion=1912 --no-check-certificate
--2016-01-20 18:44:25--  https://ip-172-18-14-218.ec2.internal:8443/api/v1/watch/services?resourceVersion=1912
Resolving ip-172-18-14-218.ec2.internal (ip-172-18-14-218.ec2.internal)... 172.18.14.218
Connecting to ip-172-18-14-218.ec2.internal (ip-172-18-14-218.ec2.internal)|172.18.14.218|:8443... connected.
WARNING: cannot verify ip-172-18-14-218.ec2.internal's certificate, issued by ‘/CN=openshift-signer@1453327451’:
  Self-signed certificate encountered.
HTTP request sent, awaiting response... 403 Forbidden
2016-01-20 18:44:25 ERROR 403: Forbidden.


So I assume that its related to certificates not being percolated properly to the node that is trying to connect to the apiserver on the master.

DESIRED BEHAVIOUR

Maybe there could be some checks before starting the Nodes in the installer, that the nodes can use certs to properly connect to the master.

Comment 2 Brenton Leanhardt 2016-02-23 17:00:43 UTC
If this is still a problem please let us know.  If this was an origin v3 install you shouldn't be using install.openshift.com/ose now and instead follow https://docs.openshift.org/latest/install_config/install/index.html


Note You need to log in before you can comment on or make changes to this bug.