Bug 1318497

Summary: Some pods stay in Terminating state after deleting on OpenShift v3
Product: OpenShift Container Platform Reporter: Kenjiro Nakayama <knakayam>
Component: NodeAssignee: Andy Goldstein <agoldste>
Status: CLOSED NOTABUG QA Contact: DeShuai Ma <dma>
Severity: medium Docs Contact:
Priority: medium    
Version: 3.1.0CC: aos-bugs, erich, jokerman, mmccomas
Target Milestone: ---   
Target Release: ---   
Hardware: Unspecified   
OS: Unspecified   
Whiteboard:
Fixed In Version: Doc Type: Bug Fix
Doc Text:
Story Points: ---
Clone Of:
: 1318680 1318681 (view as bug list) Environment:
Last Closed: 2016-03-17 13:41:21 UTC Type: Bug
Regression: --- Mount Type: ---
Documentation: --- CRM:
Verified Versions: Category: ---
oVirt Team: --- RHEL 7.3 requirements from Atomic Host:
Cloudforms Team: --- Target Upstream Version:
Embargoed:
Bug Depends On:    
Bug Blocks: 1318680, 1318681    

Description Kenjiro Nakayama 2016-03-17 05:20:18 UTC
Description of problem:

- some pods get stuck in "Terminating" status. It's even sometimes displayed to the user as "Pending" in the web ui but when looking at it via oc get pods it says "Terminating". An oc describe pods also says "Pending".

Version-Release number of selected component (if applicable):

- v3.1.1.6

How reproducible:

- Unfortunately, the "Terminating" status happens randomly, when deleting the pod.


Actual results:

- Pods remains as "Terminating" status.

Expected results:

- Pods should not remain as "Terminating" status.

Additional info:

- There are some similar issue.
 - https://bugzilla.redhat.com/show_bug.cgi?id=1271198
 - https://bugzilla.redhat.com/show_bug.cgi?id=1274598

- Workaround is that delete the pod with --grace-period=0.

Comment 2 Kenjiro Nakayama 2016-03-17 12:10:56 UTC
Please note that I think there are several things to fix this issue or improve for the trouble shooting.

1. pod should not state as Terminating

  - This is ideal fix.
  - But according to the bz(https://bugzilla.redhat.com/show_bug.cgi?id=1274598) Andy's comment, it might not a bug. Then, please consider below.

2. The pod's state is different from web UI and CLI. While pod shows "Terminating" by CLI, web UI has "Pending" status, according to the user's report.

  - This should be fixed.

3. There are no clue of Terminating state for users.

  - There are no clue in the "oc get event" what happened during this Terminating process. This should be improved.

Comment 3 Andy Goldstein 2016-03-17 12:27:09 UTC
If possible, please file individual bugs for each issue. Thanks!