Bugzilla will be upgraded to version 5.0. The upgrade date is tentatively scheduled for 2 December 2018, pending final testing and feedback.
Bug 1596439 - [starter-ca-central-1] node cannot recover when volume-config.yaml corrupted
[starter-ca-central-1] node cannot recover when volume-config.yaml corrupted
Status: CLOSED ERRATA
Product: OpenShift Container Platform
Classification: Red Hat
Component: Pod (Show other bugs)
3.10.0
Unspecified Unspecified
unspecified Severity unspecified
: ---
: 3.11.0
Assigned To: Seth Jennings
weiwei jiang
: TestCaseNeeded
Depends On:
Blocks:
  Show dependency treegraph
 
Reported: 2018-06-28 21:11 EDT by Justin Pierce
Modified: 2018-10-11 03:21 EDT (History)
4 users (show)

See Also:
Fixed In Version:
Doc Type: No Doc Update
Doc Text:
undefined
Story Points: ---
Clone Of:
Environment:
Last Closed: 2018-10-11 03:20:43 EDT
Type: Bug
Regression: ---
Mount Type: ---
Documentation: ---
CRM:
Verified Versions:
Category: ---
oVirt Team: ---
RHEL 7.3 requirements from Atomic Host:
Cloudforms Team: ---


Attachments (Terms of Use)


External Trackers
Tracker ID Priority Status Summary Last Updated
Red Hat Product Errata RHBA-2018:2652 None None None 2018-10-11 03:21 EDT

  None (edit)
Description Justin Pierce 2018-06-28 21:11:51 EDT
Description of problem:
It is not clear when this file was corrupted on the node in question, but once it was, the node was unable to bootstrap itself any longer. When analyzed, the volume-config.yaml file was empty and journal entries for atomic-openshift-node reported the fatal error:

Jun 29 00:55:52 ip-172-31-24-47.ca-central-1.compute.internal atomic-openshift-node[7997]: F0629 00:55:52.167998    7997 server.go:218] Local quota setup failed: expected kind "VolumeConfig" and apiVersion "kubelet.config.openshift.io/v1" for volume config file


Version-Release number of selected component (if applicable):
v3.10.7


Expected result:
If an error/interruption of the sync process can leave this file empty, it should not be treated as an error.
Comment 1 Seth Jennings 2018-06-29 10:17:56 EDT
The sync pod should keep this file up to date.  Is the configmap key also empty?
Comment 2 Seth Jennings 2018-06-29 10:28:49 EDT
Origin master PR:
https://github.com/openshift/origin/pull/20154
Comment 4 weiwei jiang 2018-08-30 02:45:21 EDT
Checked with 

# oc version 
oc v3.11.0-0.25.0
kubernetes v1.11.0+d4cacc0
features: Basic-Auth GSSAPI Kerberos SPNEGO

Server https://qe-wjiang-master-etcd-1:8443
openshift v3.11.0-0.25.0
kubernetes v1.11.0+d4cacc0


Atomic-openshift-node can start with a empty /etc/origin/node/volume-config.yaml now.
Comment 6 errata-xmlrpc 2018-10-11 03:20:43 EDT
Since the problem described in this bug report should be
resolved in a recent advisory, it has been closed with a
resolution of ERRATA.

For information on the advisory, and where to find the updated
files, follow the link below.

If the solution does not work for you, open a new bug report.

https://access.redhat.com/errata/RHBA-2018:2652

Note You need to log in before you can comment on or make changes to this bug.