Bug 1989180

Summary: 4.8 Fix multi-az zone scheduling e2e for 5 control plane replicas
Product: OpenShift Container Platform Reporter: Jan Safranek <jsafrane>
Component: StorageAssignee: Jan Safranek <jsafrane>
Storage sub component: Kubernetes QA Contact: Wei Duan <wduan>
Status: CLOSED WONTFIX Docs Contact:
Severity: medium    
Priority: unspecified CC: aos-bugs, mnewby
Version: 4.7   
Target Milestone: ---   
Target Release: 4.8.0   
Hardware: Unspecified   
OS: Unspecified   
Whiteboard:
Fixed In Version: Doc Type: If docs needed, set a value
Doc Text:
Story Points: ---
Clone Of:
: 2000450 (view as bug list) Environment:
Last Closed: 2021-11-22 18:36:08 UTC Type: Bug
Regression: --- Mount Type: ---
Documentation: --- CRM:
Verified Versions: Category: ---
oVirt Team: --- RHEL 7.3 requirements from Atomic Host:
Cloudforms Team: --- Target Upstream Version:
Embargoed:
Bug Depends On: 2000451, 2004009    
Bug Blocks:    

Description Jan Safranek 2021-08-02 15:41:15 UTC
We run e2e tests on a cluster with masters in 5 different zones, while workers only in 3 zones. Test "should schedule pods in the same zones as statically provisioned PVs" schedules a pod on a master, which then won't run.

We have a "<drop>" PR to skip the test: https://github.com/openshift/kubernetes/pull/870. 
Replace this <drop> patch with a proper fix upstream.


Idea: Use GetReadySchedulableNodes to get list of all schedulable and untainted nodes when considering zones where to provision a volume.

Comment 1 Jan Safranek 2021-08-02 16:04:25 UTC
Upstream PR: https://github.com/kubernetes/kubernetes/pull/104077

Comment 2 Maru Newby 2021-08-04 15:12:08 UTC
*** Bug 1988535 has been marked as a duplicate of this bug. ***

Comment 3 Maru Newby 2021-08-04 15:22:55 UTC
Adding proposed test skip pending the fix.

Comment 4 Jan Safranek 2021-11-22 18:36:08 UTC
The test is fixed in 4.10 and skipped in 4.8 and 4.9.

Comment 5 Jan Safranek 2021-11-22 18:39:35 UTC
currection: test will be fixed in 4.10 when rebase lands and it's skipped in 4.9