Created attachment 1332610 [details] HOSA pods info Description of problem: it failed to undeploy HOSA only, hawkular-openshift-agent pods always in ContainerCreating status, described pods, error info: MountVolume.SetUp failed for volume "hawkular-openshift-agent-token-f60xz" : secrets "hawkular-openshift-agent-token-f60xz" not found Checked,secret,sa were deleted, but configmap, daemonset,clusterrole were not delete # oc get secret | grep hawkular-openshift-agent # oc get template | grep agent No resources found. # oc get configmap | grep hawkular-openshift-agent hawkular-openshift-agent-configuration 2 18m # oc get daemonset | grep hawkular-openshift-agent hawkular-openshift-agent 2 2 0 2 0 <none> 18m # oc get clusterrole | grep hawkular-openshift-agent hawkular-openshift-agent # oc get sa | grep hawkular-openshift-agent # oc get secret | grep hawkular-openshift-agent # oc get template | grep hawkular-openshift-agent No resources found. Version-Release number of the following components: # rpm -qa | grep openshift-ansible openshift-ansible-docs-3.7.0-0.134.0.git.0.6f43fc3.el7.noarch openshift-ansible-filter-plugins-3.7.0-0.134.0.git.0.6f43fc3.el7.noarch openshift-ansible-3.7.0-0.134.0.git.0.6f43fc3.el7.noarch openshift-ansible-lookup-plugins-3.7.0-0.134.0.git.0.6f43fc3.el7.noarch openshift-ansible-roles-3.7.0-0.134.0.git.0.6f43fc3.el7.noarch openshift-ansible-callback-plugins-3.7.0-0.134.0.git.0.6f43fc3.el7.noarch openshift-ansible-playbooks-3.7.0-0.134.0.git.0.6f43fc3.el7.noarch HOSA image: metrics-hawkular-openshift-agent:v3.7.0-0.135.0.0 How reproducible: Always Steps to Reproduce: 1. Deploy HOSA along with Metrics 2. After all the pods get ready, undeploy HOSA only 3. Actual results: Failed to undeploy HOSA Expected results: Should be undepoly HOSA successfully Additional info: #Inventory file [OSEv3:children] masters etcd [masters] ${MASTER} openshift_public_hostname=${MASTER} [etcd] ${ETCD} openshift_public_hostname=${ETCD} [OSEv3:vars] ansible_ssh_user=root ansible_ssh_private_key_file="~/libra.pem" deployment_type=openshift-enterprise openshift_docker_additional_registries=brew-pulp-docker01.web.prod.ext.phx2.redhat.com:8888 # Undeploy HOSA openshift_metrics_install_hawkular_agent=false openshift_metrics_hawkular_hostname=hawkular-metrics.apps.0930-0ar.qe.rhcloud.com openshift_metrics_image_prefix=brew-pulp-docker01.web.prod.ext.phx2.redhat.com:8888/openshift3/ openshift_metrics_image_version=v3.7
Created attachment 1332611 [details] Undeploy HOSA ansible log
Tested, HOSA can be uninstalled successfully now, although the hawkular-openshift-agent confimap is kept after uninstallation. oc get configmap | grep hawkular-openshift-agent hawkular-openshift-agent-configuration 2 19m Env: metrics-hawkular-openshift-agent/images/v3.9.0-0.24.0.0 # rpm -qa | grep openshift-ansible openshift-ansible-playbooks-3.9.0-0.23.0.git.0.d53d7ed.el7.noarch openshift-ansible-roles-3.9.0-0.23.0.git.0.d53d7ed.el7.noarch openshift-ansible-3.9.0-0.23.0.git.0.d53d7ed.el7.noarch openshift-ansible-docs-3.9.0-0.23.0.git.0.d53d7ed.el7.noarch
The current workaround for OCP 3.7 would be just to remove ds (oc delete ds hawkular-openshift-agent)?
oh, for OCP 3.7 You can use the following commands to delete HOSA oc project default oc delete configmap hawkular-openshift-agent-configuration oc delete daemonset hawkular-openshift-agent oc delete clusterrole hawkular-openshift-agent oc delete sa hawkular-openshift-agent
Please change it to ON_QA, it is fixed in 3.9, see Comment 4
Because it is not completely fixed for 3.9 and because the changes cover both 3.7 and 3.9, I am going to change the target release of this to 3.9. I will also clone this BZ to create a 3.7 BZ.
PR submitted - https://github.com/openshift/openshift-ansible/pull/7276
Tested with openshift-ansible-3.9.0-0.53.0, HOSA can be undeployed successfully now. # rpm -qa | grep openshift-ansible openshift-ansible-roles-3.9.0-0.53.0.git.0.f8f01ef.el7.noarch openshift-ansible-playbooks-3.9.0-0.53.0.git.0.f8f01ef.el7.noarch openshift-ansible-3.9.0-0.53.0.git.0.f8f01ef.el7.noarch openshift-ansible-docs-3.9.0-0.53.0.git.0.f8f01ef.el7.noarch
Since the problem described in this bug report should be resolved in a recent advisory, it has been closed with a resolution of ERRATA. For information on the advisory, and where to find the updated files, follow the link below. If the solution does not work for you, open a new bug report. https://access.redhat.com/errata/RHBA-2018:0489