Bug 1292021
Summary: | [devexp_public_640] Failed to "Cancel deployments in Openshift" via jenkins | ||||||||
---|---|---|---|---|---|---|---|---|---|
Product: | OKD | Reporter: | wewang <wewang> | ||||||
Component: | Image | Assignee: | Gabe Montero <gmontero> | ||||||
Status: | CLOSED CURRENTRELEASE | QA Contact: | Wang Haoran <haowang> | ||||||
Severity: | medium | Docs Contact: | |||||||
Priority: | medium | ||||||||
Version: | 3.x | CC: | aos-bugs, bparees, mmccomas | ||||||
Target Milestone: | --- | ||||||||
Target Release: | --- | ||||||||
Hardware: | Unspecified | ||||||||
OS: | Unspecified | ||||||||
Whiteboard: | |||||||||
Fixed In Version: | Doc Type: | Bug Fix | |||||||
Doc Text: | Story Points: | --- | |||||||
Clone Of: | |||||||||
: | 1294940 1312826 (view as bug list) | Environment: | |||||||
Last Closed: | 2016-05-12 17:12:40 UTC | Type: | Bug | ||||||
Regression: | --- | Mount Type: | --- | ||||||
Documentation: | --- | CRM: | |||||||
Verified Versions: | Category: | --- | |||||||
oVirt Team: | --- | RHEL 7.3 requirements from Atomic Host: | |||||||
Cloudforms Team: | --- | Target Upstream Version: | |||||||
Embargoed: | |||||||||
Bug Depends On: | |||||||||
Bug Blocks: | 1294940, 1308390, 1312826 | ||||||||
Attachments: |
|
Description
wewang
2015-12-16 09:02:28 UTC
Created attachment 1106336 [details]
Cancel deploy config attachment
Created attachment 1106337 [details]
cancel deploy output log
Yep, an external customer reported this just the other day. I have reproduced using the steps in the description with version 1.0.3 of the plugin that is currently installed in the jenkins images. I then updated my running copy of the jenkins image with the currenty 1.0.4-snapshot, ran through the same steps, and see the deployment cancellation happen successfully. I'll work with wewang in the trello card https://trello.com/c/6gai8wLM/640-8-ci-jenkins-openshift-v3-plugin to sort out the logistics of delivering this fix. v1.0.4 with the fixes for this defect is available for both the centos and rhel versions of the jenkins image. moving to ON_QA for @wenwang 's verification. test env has something wrong ,if it's ok , will verify the bug ,thanks The Jenkins RHEL image seems to have been reverted somehow (assuming I was not hallucinating during my verification that the RHEL image had v1.0.4 back on Dec 18), as what I pulled from ci.dev.openshift now shows v1.0.3 of the plugin being installed. It needs to be v1.0.4 to have the fix for this bug. I've emailed the particulars in this process (Troy Dawson, Scott Dawson, Ben Parees). Most likely though the RHEL image on ci.dev.openshift won't get properly updated until after the holiday. OK, with assists from bparess, sdodson, and tdawson, we've got the jenkins rhel image now updated on ci.dev.openshift.redhat.com:5000 updated with v104 of the plugin. Moving back to QA to attempt to verify the fix. Based on some code analysis on where "DeadlineExceeded" gets set, a pod ending up in "DeadlineExceeded" status appears to be an independent event to the OpenShift cancelling of deployments. In fact, k8s manages to setting of that status whereas the OpenShift deployment controller manages the "cancelling". And it certainly appears to be independent of the DeploymentCancelledAnnotation the jenkins plugin sets on the deployment, same as the oc cli, and not something either can prevent. I would then contend that with the absence of the Java stack trace, which was the true problem captured by this defect, we should mark this defect verified. If you see this consistently, I would expect the same to occur with the `oc` command's version of cancelling the deployment. Assuming that's true, we should open up a new issue with the Deployment team to confirm if "DeadlineExceeded" is occurring erroneously or not. @Gabe Montero, you are right, thx for your detail comments, I confirmed with coworkers and I tried to cancel deployment with oc cli, the status is "DeadlineExceeded", so I will verify the bug: [root@dhcp-128-91 test]# /home/2016/test/oc deploy frontend --latest Started deployment #3 [root@dhcp-128-91 test]# /home/2016/test/oc deploy frontend --cancel Cancelled deployment #3 [root@dhcp-128-91 test]# oc get pods NAME READY STATUS RESTARTS AGE frontend-1-build 0/1 Completed 0 26m frontend-2-deploy 0/1 DeadlineExceeded 0 3m frontend-3-deploy 0/1 DeadlineExceeded 0 50s frontend-3-gvler 1/1 Running 0 47s jenkins-1-cir23 1/1 Running 0 27m @Wen Wang - sound great - thanks!! |