Bug 1801095
Summary: | Failed to force the deployment to use local name lookup | ||||||
---|---|---|---|---|---|---|---|
Product: | OpenShift Container Platform | Reporter: | zhou ying <yinzhou> | ||||
Component: | openshift-controller-manager | Assignee: | Maciej Szulik <maszulik> | ||||
Status: | CLOSED ERRATA | QA Contact: | zhou ying <yinzhou> | ||||
Severity: | high | Docs Contact: | |||||
Priority: | high | ||||||
Version: | 4.4 | CC: | aivaraslaimikis, aos-bugs, gferrazs, maszulik, mfojtik, rmarasch, tnozicka | ||||
Target Milestone: | --- | Keywords: | Regression | ||||
Target Release: | 4.4.0 | ||||||
Hardware: | Unspecified | ||||||
OS: | Unspecified | ||||||
Whiteboard: | workloads | ||||||
Fixed In Version: | Doc Type: | No Doc Update | |||||
Doc Text: | Story Points: | --- | |||||
Clone Of: | Environment: | ||||||
Last Closed: | 2020-05-13 21:57:16 UTC | Type: | Bug | ||||
Regression: | --- | Mount Type: | --- | ||||
Documentation: | --- | CRM: | |||||
Verified Versions: | Category: | --- | |||||
oVirt Team: | --- | RHEL 7.3 requirements from Atomic Host: | |||||
Cloudforms Team: | --- | Target Upstream Version: | |||||
Embargoed: | |||||||
Bug Depends On: | 1805155 | ||||||
Bug Blocks: | |||||||
Attachments: |
|
Description
zhou ying
2020-02-10 09:16:48 UTC
can you get the yaml for the deployment, replicaset and the pod? Created attachment 1662384 [details]
deployment and rs
The deploy will burst and old pod will be deleted , so it's hard for me to get the pod's yaml. Is it creating RSs in a loop? it looks to me this way by deployment.kubernetes.io/revision: 2879 and from the dump. also the RS has injected image that is already resolved to: image: openshift/deployment-example@sha256:c505b916f7e5143a356ff961f2c21aee40fbd2cd906c1e3feeb8d5e978da284b I thought that image plugin is suppose to resolve on Pod level... Wonder if it is broken on previous releases too The image resolve admission plugin works (opened https://github.com/openshift/origin/pull/24530 to prove it) but `oc set image-lookup deploy/app` sets the annotation only for the template not itself which is what triggers the deepequal hotloop. OC client sets the annotation only on the template for at least 3 years: https://github.com/openshift/oc/blame/master/pkg/cli/set/imagelookup.go#L236-L251 I wonder what else might have changed to make this stop working, I will check on 4.2 to see if it is working there. 1. We don't have the Deployment recreation loop on 4.2. 2. The annotation is created only on template when on 4.2(See comment-5) as it is in the 4.4. 3. It seems like there is no difference between the created Deployments from different revisions during the loop on 4.4(that is weird). 4. There is a constant increase on Deployment's `status.collisionCount` during the loop on 4.4. I have patched oc to also add the annotation on the deployment, together with the template. That does not solved. oc patch deploy/redis -p '{"spec":{"template":{"metadata":{"annotations":{"alpha.image.policy.openshift.io/resolve-names":"*"}}}}}' --type=merge or oc patch deploy/redis -p '{"metadata":{"annotations":{"alpha.image.policy.openshift.io/resolve-names":"*"}},"spec":{"template":{"metadata":{"annotations":{"alpha.image.policy.openshift.io/resolve-names":"*"}}}}}' --type=merge both cause the same problem. This is a problem with mutators in ImagePolicy admission, working on a fix atm. fyi, I have found out that it doesn't matter if the annotation is on the object it self or the template, if the object is being registered for resolve. The real issue here is that the admission was incorrectly skipping updates that were enabling the resolve. More details on the referenced PRs. Also CronJobs weren't registered which we fixed as well. Confirmed with latest payload:4.4.0-0.nightly-2020-02-24-105333 , can't reproduce the issue now. Since the problem described in this bug report should be resolved in a recent advisory, it has been closed with a resolution of ERRATA. For information on the advisory, and where to find the updated files, follow the link below. If the solution does not work for you, open a new bug report. https://access.redhat.com/errata/RHBA-2020:0581 |