Bug 1596439 - [starter-ca-central-1] node cannot recover when volume-config.yaml corrupted
Summary: [starter-ca-central-1] node cannot recover when volume-config.yaml corrupted
Keywords:
Status: CLOSED ERRATA
Alias: None
Product: OpenShift Container Platform
Classification: Red Hat
Component: Node
Version: 3.10.0
Hardware: Unspecified
OS: Unspecified
unspecified
unspecified
Target Milestone: ---
: 3.11.0
Assignee: Seth Jennings
QA Contact: weiwei jiang
URL:
Whiteboard:
Depends On:
Blocks:
TreeView+ depends on / blocked
 
Reported: 2018-06-29 01:11 UTC by Justin Pierce
Modified: 2018-10-11 07:21 UTC (History)
4 users (show)

Fixed In Version:
Doc Type: No Doc Update
Doc Text:
undefined
Clone Of:
Environment:
Last Closed: 2018-10-11 07:20:43 UTC
Target Upstream Version:
Embargoed:


Attachments (Terms of Use)


Links
System ID Private Priority Status Summary Last Updated
Red Hat Product Errata RHBA-2018:2652 0 None None None 2018-10-11 07:21:19 UTC

Description Justin Pierce 2018-06-29 01:11:51 UTC
Description of problem:
It is not clear when this file was corrupted on the node in question, but once it was, the node was unable to bootstrap itself any longer. When analyzed, the volume-config.yaml file was empty and journal entries for atomic-openshift-node reported the fatal error:

Jun 29 00:55:52 ip-172-31-24-47.ca-central-1.compute.internal atomic-openshift-node[7997]: F0629 00:55:52.167998    7997 server.go:218] Local quota setup failed: expected kind "VolumeConfig" and apiVersion "kubelet.config.openshift.io/v1" for volume config file


Version-Release number of selected component (if applicable):
v3.10.7


Expected result:
If an error/interruption of the sync process can leave this file empty, it should not be treated as an error.

Comment 1 Seth Jennings 2018-06-29 14:17:56 UTC
The sync pod should keep this file up to date.  Is the configmap key also empty?

Comment 2 Seth Jennings 2018-06-29 14:28:49 UTC
Origin master PR:
https://github.com/openshift/origin/pull/20154

Comment 4 weiwei jiang 2018-08-30 06:45:21 UTC
Checked with 

# oc version 
oc v3.11.0-0.25.0
kubernetes v1.11.0+d4cacc0
features: Basic-Auth GSSAPI Kerberos SPNEGO

Server https://qe-wjiang-master-etcd-1:8443
openshift v3.11.0-0.25.0
kubernetes v1.11.0+d4cacc0


Atomic-openshift-node can start with a empty /etc/origin/node/volume-config.yaml now.

Comment 6 errata-xmlrpc 2018-10-11 07:20:43 UTC
Since the problem described in this bug report should be
resolved in a recent advisory, it has been closed with a
resolution of ERRATA.

For information on the advisory, and where to find the updated
files, follow the link below.

If the solution does not work for you, open a new bug report.

https://access.redhat.com/errata/RHBA-2018:2652


Note You need to log in before you can comment on or make changes to this bug.