Bug 1756467
| Summary: | Stage pod scheduling fails in clusters where pod affinity isn't working | ||
|---|---|---|---|
| Product: | OpenShift Container Platform | Reporter: | Scott Seago <sseago> |
| Component: | Migration Tooling | Assignee: | Scott Seago <sseago> |
| Status: | CLOSED ERRATA | QA Contact: | Xin jiang <xjiang> |
| Severity: | medium | Docs Contact: | |
| Priority: | unspecified | ||
| Version: | 4.2.0 | CC: | dymurray, jmatthew, xjiang |
| Target Milestone: | --- | ||
| Target Release: | 4.3.0 | ||
| Hardware: | Unspecified | ||
| OS: | Unspecified | ||
| Whiteboard: | |||
| Fixed In Version: | Doc Type: | If docs needed, set a value | |
| Doc Text: | Story Points: | --- | |
| Clone Of: | Environment: | ||
| Last Closed: | 2020-02-06 20:20:44 UTC | Type: | Bug |
| Regression: | --- | Mount Type: | --- |
| Documentation: | --- | CRM: | |
| Verified Versions: | Category: | --- | |
| oVirt Team: | --- | RHEL 7.3 requirements from Atomic Host: | |
| Cloudforms Team: | --- | Target Upstream Version: | |
| Embargoed: | |||
|
Description
Scott Seago
2019-09-27 17:17:59 UTC
The fix is to replace the podAffinity scheduling with explictly setting NodeName: Fixes are merged to master (and to the stable branch: https://github.com/fusor/mig-controller/pull/327 https://github.com/fusor/mig-controller/pull/332 I expect that these will be pulled over to the release-1.0 branch tomorrow. Verified with below steps.
1. Prepare 2 clusters, one is ocp3.11, one is ocp4.3
2. Remove below section from /etc/origin/master/scheduler.json inside ocp3.11 cluster, the restart controller with command "master-restart controllers"
{
"name": "InterPodAffinityPriority",
"weight": 1
},
3. create a new project named "test-affinity"
#oc new-project test-affinity
4. Deploy Statefulset application
# oc create -f https://raw.githubusercontent.com/openshift-qe/v3-testfiles/master/statefulset/stable-storage.yaml
# oc get pod
NAME READY STATUS RESTARTS AGE
hello-statefulset-0 1/1 Running 0 32m
hello-statefulset-1 1/1 Running 0 31m
# oc get pvc
NAME STATUS VOLUME CAPACITY ACCESS MODES STORAGECLASS AGE
www-hello-statefulset-0 Bound pvc-52d77dca-3da8-11ea-affc-0e876d566add 1Gi RWO gp2 32m
www-hello-statefulset-1 Bound pvc-6acf33a6-3da8-11ea-affc-0e876d566add 1Gi RWO gp2 31m
5. Create migplan and execute migplan
6. the application is migrated to ocp4.3 and pods are running well
Since the problem described in this bug report should be resolved in a recent advisory, it has been closed with a resolution of ERRATA. For information on the advisory, and where to find the updated files, follow the link below. If the solution does not work for you, open a new bug report. https://access.redhat.com/errata/RHEA-2020:0440 |