1379316 – [public_networking_291] Panic error in master log when adding/deleting project repeatedly

Bug 1379316 - [public_networking_291] Panic error in master log when adding/deleting project repeatedly

Summary: [public_networking_291] Panic error in master log when adding/deleting projec...

Keywords:
Status:	CLOSED CURRENTRELEASE
Alias:	None
Product:	OKD
Classification:	Red Hat
Component:	Pod
Sub Component:
Version:	3.x
Hardware:	Unspecified
OS:	Unspecified
Priority:	medium
Severity:	medium
Target Milestone:	---
Target Release:	---
Assignee:	Derek Carr
QA Contact:	zhaozhanqi
Docs Contact:
URL:
Whiteboard:
Depends On:
Blocks:
TreeView+	depends on / blocked

Reported:	2016-09-26 11:07 UTC by Meng Bo
Modified:	2016-12-09 21:52 UTC (History)
CC List:	8 users (show)
Fixed In Version:
Doc Type:	If docs needed, set a value
Doc Text:
Clone Of:
Environment:
Last Closed:	2016-12-09 21:52:47 UTC
Target Upstream Version:
Embargoed:

Attachments	(Terms of Use)
master_log_for_panic (351.32 KB, text/plain) 2016-09-26 11:07 UTC, Meng Bo	no flags	Details
View All

Description Meng Bo 2016-09-26 11:07:47 UTC

Created attachment 1204782 [details]
master_log_for_panic

Description of problem:
There is panic error appears in the master log during I was doing the adding/deleting operations on the project.

Version-Release number of selected component (if applicable):
openshift v1.4.0-alpha.0+b3ec794
kubernetes v1.4.0-beta.3+d19513f
etcd 3.0.9
git commit: b3ec794bf74052d928b2927e93a32dba1738fd54

How reproducible:
unknown

Steps to Reproduce:
1. Setup multi-node env
2. Create multiple projects via user
$ for i in {1..100}; do oc new-project userp$i ; done
3. Delete all the projects
$ oc delete project -all
4. Run another adding/deleting project operation 
$ while true; do oc new-project userxx ; oc delete project userxx ; done
5. Watching the master log.

Actual results:
There is panic error appears in the master log.

Expected results:
Should not panic.

Additional info:
Master log and core dump file attached.

Comment 2 Dan Williams 2016-09-28 18:21:13 UTC

This looks more like a bug in upstream Kubernetes than OpenShift.  My best guess is *namespace.DeletionTimestamp is causing the panic becuase the returned namespace object from retryOnConflictError() has been modified...

Filed upstream: https://github.com/kubernetes/kubernetes/issues/33676

Should we close this as UPSTREAM resolution?

Comment 3 Meng Bo 2016-09-29 02:12:29 UTC

Thanks, that is ok for me.

Comment 5 Derek Carr 2016-10-07 03:07:22 UTC

I am curious if this was an HA setup?

Comment 6 Derek Carr 2016-10-07 03:45:36 UTC

Upstream PR:
https://github.com/kubernetes/kubernetes/pull/34298

Comment 7 Meng Bo 2016-10-08 02:34:23 UTC

(In reply to Derek Carr from comment #5)
> I am curious if this was an HA setup?

For my testing, it is not HA env.

Comment 8 Derek Carr 2016-10-27 21:11:20 UTC

Origin PR:
https://github.com/openshift/origin/pull/11632

Comment 9 Derek Carr 2016-11-01 14:22:13 UTC

origin pr merged.

Comment 10 zhaozhanqi 2016-11-03 09:20:54 UTC

Tested this issue on 

# openshift version
openshift v3.4.0.19+346a31d
kubernetes v1.4.0+776c994
etcd 3.1.0-rc.0

using always add/delete namespance step. no panic logs found on master logs. 

verified this bug.

Note You need to log in before you can comment on or make changes to this bug.