Bug 1289819
Summary: | [platformmanagement_public_547]When the previous deployment failed by cancel, the posterior will alway timeout and the replicas with wrong num | ||
---|---|---|---|
Product: | OKD | Reporter: | zhou ying <yinzhou> |
Component: | Deployments | Assignee: | Dan Mace <dmace> |
Status: | CLOSED CURRENTRELEASE | QA Contact: | zhou ying <yinzhou> |
Severity: | medium | Docs Contact: | |
Priority: | medium | ||
Version: | 3.x | CC: | aos-bugs |
Target Milestone: | --- | ||
Target Release: | --- | ||
Hardware: | Unspecified | ||
OS: | Unspecified | ||
Whiteboard: | |||
Fixed In Version: | Doc Type: | Bug Fix | |
Doc Text: | Story Points: | --- | |
Clone Of: | Environment: | ||
Last Closed: | 2016-05-12 17:15:19 UTC | Type: | Bug |
Regression: | --- | Mount Type: | --- |
Documentation: | --- | CRM: | |
Verified Versions: | Category: | --- | |
oVirt Team: | --- | RHEL 7.3 requirements from Atomic Host: | |
Cloudforms Team: | --- | Target Upstream Version: | |
Embargoed: |
Description
zhou ying
2015-12-09 03:19:22 UTC
Zhou, Good find. Here's an easier way to reproduce the root issue: 1. Create the deployment config with no hooks and replicas=5 2. Deploy once successfully 3. Deploy again, cancel midway through scaling so that some of the new RC is scaled up and some of the old RC is scaled down. The latest failed RC will eventually be scaled back down to zero (correct), but the active/last completed successful RC (version 1) will remain at the scaling count it was at when the deployer was cancelled. Then, the DC will be updated to match the replica count of RC version 1. Then, because the DC now has the incorrect replica count, new RCs will use the value from the DC. Please keep in mind that after a failed deployment, it can take a while (up to a minute) for the cluster to reconcile and bring the failed deployment back to 0 and the old active deployment back up to the correct scale (assuming you didn't start another deployment already.) I've confirmed on latest ami, the issue has fiexd. [root@ip-172-18-4-35 amd64]# openshift version openshift v1.1-428-ged29520 kubernetes v1.1.0-origin-1107-g4c8e6f4 etcd 2.1.2 |