1600052 – [3.7] Deleted objects from webconsole marked with "foregroundDeletion" are never Garbage Collected

Bug 1600052 - [3.7] Deleted objects from webconsole marked with "foregroundDeletion" are never Garbage Collected

Summary: [3.7] Deleted objects from webconsole marked with "foregroundDeletion" are ne...

Keywords:
Status:	CLOSED NOTABUG
Alias:	None
Product:	OpenShift Container Platform
Classification:	Red Hat
Component:	Master
Sub Component:
Version:	3.7.1
Hardware:	Unspecified
OS:	Unspecified
Priority:	urgent
Severity:	urgent
Target Milestone:	---
Target Release:	3.7.z
Assignee:	Michal Fojtik
QA Contact:	Xingxing Xia
Docs Contact:
URL:
Whiteboard:
Depends On:
Blocks:
TreeView+	depends on / blocked

Reported:	2018-07-11 10:14 UTC by Alejandro Coma
Modified:	2021-12-10 16:36 UTC (History)
CC List:	16 users (show)
Fixed In Version:
Doc Type:	If docs needed, set a value
Doc Text:
Clone Of:
Environment:
Last Closed:	2018-10-18 18:48:04 UTC
Target Upstream Version:
Embargoed:

Attachments	(Terms of Use)

Links
System	ID	Private	Priority	Status	Summary	Last Updated
Github	openshift openshift-docs issues 10015	0	None	closed	include storage migration requirements in manual upgrade doc	2021-02-04 15:17:13 UTC

Description Alejandro Coma 2018-07-11 10:14:53 UTC

Description of problem:
In an OCP cluster in 3.7.54-1 recently upgraded from 3.5, starting with migration to 3.6.173, deleted objects via web console are marked with:

  finalizers:
  - foregroundDeletion

But are never being garbage collected.

Might be related to https://bugzilla.redhat.com/show_bug.cgi?id=1559987 (CLOSED ERRATA)

Version-Release number of selected component (if applicable):
oadm v3.7.54
kubernetes v1.7.6+a08f5eeb62
openshift v3.7.54
kubernetes v1.7.6+a08f5eeb62

How reproducible:
Any time a resource is deleted from the Web Console, from the CLI works perfectly.

Steps to Reproduce:
1.
2.
3.

Actual results:
Objects remain undeleted.

Expected results:
Objects should be deleted.

Additional info:

Comment 51 Jordan Liggitt 2018-07-27 13:33:22 UTC

The garbage collector maintains a graph of all objects and their ownerReferences

Before processing any deletions that involve inter-object relationships, it must have a complete graph of all resources.

Deletion with foregroundDeletion means "delete objects whose ownerReferences point to this object, then delete this object". The worker responsible for doing that is not run until caches are filled for all object types and the graph is complete.

Persistent failure to list/watch HPA objects prevented that graph from ever being ready.

Comment 53 Jordan Liggitt 2018-08-14 15:47:35 UTC

Closing, issue was due to incorrect upgrade procedure. https://github.com/openshift/openshift-docs/issues/10015 is open to improve the documentation for manual upgrade.

Comment 73 Aaron Ship 2018-10-04 07:40:55 UTC

Need update on the progress of this Bug

Comment 84 David Eads 2018-10-09 12:10:09 UTC

Yes.  As @liggitt pointed out above in the chain, foreground GC requires building a complete graph of all objects before removing the target object.  Because an owner reference can come from any object to any object, all resources must be list/watchable and must be successfully list/watched before any foreground deletion is processed.  Otherwise a reference could be missed and foreground deletion would fail.

Comment 105 Maciej Szulik 2018-10-18 18:48:04 UTC

Closing based on previous comment. To summarize the problem was that customer did not run migration during upgrades which resulted in old objects (in version not any more recognized by the server) to linger in etcd and blocking proper GC cycles. Solution is to downgrade server to previous version and run migration.

Note You need to log in before you can comment on or make changes to this bug.