Bug 2000451

Summary: 4.9: Fix multi-az zone scheduling e2e for 5 control plane replicas
Product: OpenShift Container Platform Reporter: Lili Cosic <lcosic>
Component: StorageAssignee: Jan Safranek <jsafrane>
Storage sub component: Kubernetes QA Contact: Wei Duan <wduan>
Status: CLOSED ERRATA Docs Contact:
Severity: medium    
Priority: unspecified CC: aos-bugs, aos-storage-staff, jsafrane, mnewby, wduan
Version: 4.9   
Target Milestone: ---   
Target Release: 4.9.0   
Hardware: Unspecified   
OS: Unspecified   
Whiteboard:
Fixed In Version: Doc Type: If docs needed, set a value
Doc Text:
Story Points: ---
Clone Of: 2000450
: 2004009 (view as bug list) Environment:
Last Closed: 2021-11-29 10:53:41 UTC Type: ---
Regression: --- Mount Type: ---
Documentation: --- CRM:
Verified Versions: Category: ---
oVirt Team: --- RHEL 7.3 requirements from Atomic Host:
Cloudforms Team: --- Target Upstream Version:
Embargoed:
Bug Depends On: 2004009    
Bug Blocks: 1989180    

Description Lili Cosic 2021-09-02 07:53:41 UTC
+++ This bug was initially created as a clone of Bug #2000450 +++

+++ This bug was initially created as a clone of Bug #1989180 +++

We run e2e tests on a cluster with masters in 5 different zones, while workers only in 3 zones. Test "should schedule pods in the same zones as statically provisioned PVs" schedules a pod on a master, which then won't run.

We have a "<drop>" PR to skip the test: https://github.com/openshift/kubernetes/pull/870. 
Replace this <drop> patch with a proper fix upstream.


Idea: Use GetReadySchedulableNodes to get list of all schedulable and untainted nodes when considering zones where to provision a volume.

--- Additional comment from Jan Safranek on 2021-08-02 16:04:25 UTC ---

Upstream PR: https://github.com/kubernetes/kubernetes/pull/104077

--- Additional comment from Maru Newby on 2021-08-04 15:12:08 UTC ---



--- Additional comment from Maru Newby on 2021-08-04 15:22:55 UTC ---

Adding proposed test skip pending the fix.

Comment 4 Jan Safranek 2021-11-22 18:34:32 UTC
The PR has merged a long time ago, moving to ON_QA.

Comment 5 Wei Duan 2021-11-23 05:04:08 UTC
Checked latest test CI, verified pass.

Comment 8 errata-xmlrpc 2021-11-29 10:53:41 UTC
Since the problem described in this bug report should be
resolved in a recent advisory, it has been closed with a
resolution of ERRATA.

For information on the advisory (OpenShift Container Platform 4.9.9 bug fix update), and where to find the updated
files, follow the link below.

If the solution does not work for you, open a new bug report.

https://access.redhat.com/errata/RHBA-2021:4834