Bug 1732599

Summary: Tempest tests are not getting triggered for RHOSP13 jobs in redhat DCI
Product: Red Hat OpenStack Reporter: shaju <shajuvk>
Component: dciAssignee: François Charlier <fcharlie>
Status: CLOSED NOTABUG QA Contact: Martin Kopec <mkopec>
Severity: unspecified Docs Contact:
Priority: unspecified    
Version: 13.0 (Queens)CC: apevec, fcharlie, lhh, shajuvk, slinaber, udesale
Target Milestone: ---   
Target Release: ---   
Hardware: Unspecified   
OS: Unspecified   
Whiteboard:
Fixed In Version: Doc Type: If docs needed, set a value
Doc Text:
Story Points: ---
Clone Of: Environment:
Last Closed: 2020-02-11 19:03:25 UTC Type: Bug
Regression: --- Mount Type: ---
Documentation: --- CRM:
Verified Versions: Category: ---
oVirt Team: --- RHEL 7.3 requirements from Atomic Host:
Cloudforms Team: --- Target Upstream Version:
Embargoed:
Attachments:
Description Flags
UI screenshot of DCI jobs result page none

Description shaju 2019-07-23 20:38:07 UTC
Created attachment 1592960 [details]
UI screenshot of DCI jobs result page

Description of problem:

Tempest tests are not getting triggered for RHOSP13 jobs in redhat DCI (screenshot attached)

Reference bug: https://bugzilla.redhat.com/show_bug.cgi?id=1722147 (The issue mentioned above was observed while verifying patch for bug ID 1722147)

Version-Release number of selected component (if applicable):


How reproducible:
Always


Steps to Reproduce:
Open https://www.distributed-ci.io/jobs
Select "filter by topic" and choose "RHOSP13" as value.

Actual results:
No Tempest tests are been executed

Expected results:
Tempest tests should have been executed

Additional info:
Screenshot attached

Comment 1 François Charlier 2019-07-24 08:51:05 UTC
Hello Shaju,

After reviewing your issue and as the fix for the bug #1722147 doesn't work well with DCI's OpenStack agent,
could you try the following:

- add a line with "tempest_track_resources: false" in your /etc/dci-openstack-agent/settings.yml (or the settings file you're using).

This will disable all the tempest cleanup tasks (tempest won't inventory the initial state of the resources & won't clean after its run).
In DCI context this should not be an issue because the environment is new before running tempest and not kept for long after.

Please tell us if it works and if it is acceptable for you.

Comment 2 shaju 2019-07-25 18:09:19 UTC
Hi,

I have made this change and restarted dci-openstack-agent.timer. I will update you after the next job is completed.

Thanks,

Comment 3 shaju 2019-07-26 17:32:25 UTC
Hi,
After trying the above suggested change, still I see RHOSP13 jobs are failing

Something went wrong during the running. Review the log at: https://www.distributed-ci.io/jobs/eba1ca01-14df-4c23-959a-54a4226d048b/jobStates
[{'_ansible_no_log': False,
  'changed': False,
  'msg': u'Something went wrong during the running. Review the log at: https://www.distributed-ci.io/jobs/eba1ca01-14df-4c23-959a-54a4226d048b/jobStates'}]

Comment 4 François Charlier 2019-07-26 23:37:00 UTC
If you look at the step the error happened[1], you'll see that there's an issue with the deployment because introspection fails:

[1] https://www.distributed-ci.io/jobs/eba1ca01-14df-4c23-959a-54a4226d048b/jobStates#52c427c6-41ab-4972-bb56-76d7b951cccd:file0

With the following meaningful excerpt formatted properly:

    Retrying 4 nodes that failed introspection. Attempt 3 of 3
    Introspection of node 341c63e2-93d9-4d39-b4ad-5d9f0ca69b6c timed out.
    Introspection of node 61367517-5b24-4e68-b004-458507c4a4e4 timed out.
    Introspection of node 3a06408b-7479-453d-ba76-018ebd4d4d9c timed out.
    Introspection of node fa263143-1534-47fd-aaa3-01e1142cafb6 timed out.
    Retry limit reached with 4 nodes still failing introspection

These errors are the same that were occurring from June 20th to July 2nd on your remoteCI.

To be clear, this has nothing to do with the "tempest_track_resources: false" flag which effect happens only when running Tempest.

Comment 5 shaju 2019-08-05 21:29:34 UTC
Looks like deployment of undercloud task skipped after we tried with changes made to reconfiguration,

The condition rconfiguration == 'RHEL7-CDN' didn’t meet.

Logs:
====
 
Aug 02 18:55:03 rhel-cache.englab.juniper.net ansible-playbook[13131]: TASK [Deploy undercloud] *******************************************************
Aug 02 18:55:03 rhel-cache.englab.juniper.net ansible-playbook[13131]: task path: /etc/dci-openstack-agent/hooks/running.yml:8
Aug 02 18:55:03 rhel-cache.englab.juniper.net ansible-playbook[13131]: skipping: [localhost] => {"changed": false, "skip_reason": "Conditional result was False"}
Aug 02 18:55:03 rhel-cache.englab.juniper.net ansible-playbook[13131]: TASK [Deploy overcloud] ********************************************************
Aug 02 18:55:03 rhel-cache.englab.juniper.net ansible-playbook[13131]: task path: /etc/dci-openstack-agent/hooks/running.yml:15
 
 
Code:
===
- name: Deploy undercloud
  shell: "ansible-playbook -i inventory playbooks/prepare_undercloud.yml"
  args:
    chdir: /var/lib/dci-openstack-agent/ansible
  when: rconfiguration == 'RHEL7-CDN'. ==================è this condition didn’t met
 
- name: Deploy undercloud
  shell: "ansible-playbook -i inventory playbooks/prepare_undercloud.yml -e director_extra_yum_repos=ftp://partners.redhat.com/1c5d859a/juniper-4dbbfc30a829359271b73ab5b6ad965a/RHEL-7.6-Snapshot-3/RHEL-7-server.repo"
  args:
    chdir: /var/lib/dci-openstack-agent/ansible
  when: rconfiguration == 'RHEL7-partner-beta'

Comment 6 shaju 2020-02-11 19:02:26 UTC
This works now.. Closing the bug.

Comment 7 Red Hat Bugzilla 2023-09-14 05:32:15 UTC
The needinfo request[s] on this closed bug have been removed as they have been unresolved for 1000 days