Bug 1460041

Summary: Creating a scheduled job with schedule: "*/0 * * * *" causes various components to hang
Product: OpenShift Container Platform Reporter: Eric Jones <erjones>
Component: NodeAssignee: Maciej Szulik <maszulik>
Status: CLOSED CURRENTRELEASE QA Contact: DeShuai Ma <dma>
Severity: high Docs Contact:
Priority: medium    
Version: 3.4.1CC: aos-bugs, eparis, erjones, jokerman, mmccomas
Target Milestone: ---Keywords: Unconfirmed
Target Release: ---   
Hardware: Unspecified   
OS: Unspecified   
Whiteboard:
Fixed In Version: Doc Type: If docs needed, set a value
Doc Text:
Story Points: ---
Clone Of: Environment:
Last Closed: 2017-06-27 23:22:27 UTC Type: Bug
Regression: --- Mount Type: ---
Documentation: --- CRM:
Verified Versions: Category: ---
oVirt Team: --- RHEL 7.3 requirements from Atomic Host:
Cloudforms Team: --- Target Upstream Version:
Embargoed:

Description Eric Jones 2017-06-08 22:03:36 UTC
Description of problem:
Customer found an issue with creating a scheduled job that appears to break their OpenShift cluster as a whole.

They first noted pods not being able to pull images from the docker registry. It was also reported that certain pods that use the internal network were unable to do so. We also saw certain oc commands hang and others cause the terminal to freeze entirely.

We are able to restore everything to functional status by restarting the atomic-openshift-master-{api,controllers} services

As such we attempted to increase the log levels on the atomic-openshift-master-{api,controllers} and collected the logs I will be attaching to the bug shortly.

Version-Release number of selected component (if applicable):
openshift 3.4.1.2