Note: This bug is displayed in read-only format because the product is no longer active in Red Hat Bugzilla.

Bug 1596439

Summary: [starter-ca-central-1] node cannot recover when volume-config.yaml corrupted
Product: OpenShift Container Platform Reporter: Justin Pierce <jupierce>
Component: NodeAssignee: Seth Jennings <sjenning>
Status: CLOSED ERRATA QA Contact: weiwei jiang <wjiang>
Severity: unspecified Docs Contact:
Priority: unspecified    
Version: 3.10.0CC: aos-bugs, dma, jokerman, mmccomas
Target Milestone: ---Keywords: TestCaseNeeded
Target Release: 3.11.0   
Hardware: Unspecified   
OS: Unspecified   
Whiteboard:
Fixed In Version: Doc Type: No Doc Update
Doc Text:
undefined
Story Points: ---
Clone Of: Environment:
Last Closed: 2018-10-11 07:20:43 UTC Type: Bug
Regression: --- Mount Type: ---
Documentation: --- CRM:
Verified Versions: Category: ---
oVirt Team: --- RHEL 7.3 requirements from Atomic Host:
Cloudforms Team: --- Target Upstream Version:
Embargoed:

Description Justin Pierce 2018-06-29 01:11:51 UTC
Description of problem:
It is not clear when this file was corrupted on the node in question, but once it was, the node was unable to bootstrap itself any longer. When analyzed, the volume-config.yaml file was empty and journal entries for atomic-openshift-node reported the fatal error:

Jun 29 00:55:52 ip-172-31-24-47.ca-central-1.compute.internal atomic-openshift-node[7997]: F0629 00:55:52.167998    7997 server.go:218] Local quota setup failed: expected kind "VolumeConfig" and apiVersion "kubelet.config.openshift.io/v1" for volume config file


Version-Release number of selected component (if applicable):
v3.10.7


Expected result:
If an error/interruption of the sync process can leave this file empty, it should not be treated as an error.

Comment 1 Seth Jennings 2018-06-29 14:17:56 UTC
The sync pod should keep this file up to date.  Is the configmap key also empty?

Comment 2 Seth Jennings 2018-06-29 14:28:49 UTC
Origin master PR:
https://github.com/openshift/origin/pull/20154

Comment 4 weiwei jiang 2018-08-30 06:45:21 UTC
Checked with 

# oc version 
oc v3.11.0-0.25.0
kubernetes v1.11.0+d4cacc0
features: Basic-Auth GSSAPI Kerberos SPNEGO

Server https://qe-wjiang-master-etcd-1:8443
openshift v3.11.0-0.25.0
kubernetes v1.11.0+d4cacc0


Atomic-openshift-node can start with a empty /etc/origin/node/volume-config.yaml now.

Comment 6 errata-xmlrpc 2018-10-11 07:20:43 UTC
Since the problem described in this bug report should be
resolved in a recent advisory, it has been closed with a
resolution of ERRATA.

For information on the advisory, and where to find the updated
files, follow the link below.

If the solution does not work for you, open a new bug report.

https://access.redhat.com/errata/RHBA-2018:2652