Note: This bug is displayed in read-only format because the product is no longer active in Red Hat Bugzilla.

Bug 1974353

Summary: Getting error while using "--run-validations" during overcloud deployment
Product: Red Hat OpenStack Reporter: Aman Gupta <amagupta>
Component: openstack-tripleo-commonAssignee: mathieu bultel <mbultel>
Status: CLOSED CURRENTRELEASE QA Contact: David Rosenfeld <drosenfe>
Severity: medium Docs Contact:
Priority: high    
Version: 16.1 (Train)CC: gchamoul, jjoyce, jpodivin, jschluet, jslagle, mbultel, mburns, slinaber, tvignaud
Target Milestone: z9Keywords: Triaged
Target Release: 16.1 (Train on RHEL 8.2)   
Hardware: x86_64   
OS: Linux   
Whiteboard:
Fixed In Version: openstack-tripleo-common-11.4.1-1.20220926013655.75bd92a.el8ost Doc Type: If docs needed, set a value
Doc Text:
Story Points: ---
Clone Of:
: 2001917 (view as bug list) Environment:
Last Closed: 2024-09-18 19:44:21 UTC Type: Bug
Regression: --- Mount Type: ---
Documentation: --- CRM:
Verified Versions: Category: ---
oVirt Team: --- RHEL 7.3 requirements from Atomic Host:
Cloudforms Team: --- Target Upstream Version:
Embargoed:
Bug Depends On: 2056602    
Bug Blocks: 2001917    
Attachments:
Description Flags
Templates that were used none

Description Aman Gupta 2021-06-21 13:04:32 UTC
Created attachment 1792650 [details]
Templates that were used

Description of problem:
Getting error while using "--run-validations" during overcloud deployment.
If we remove --run-validations tag, deployment is successfull

Version-Release number of selected component (if applicable):
RHOSP 16.1

How reproducible:
Reproduced on multiple test labs

Steps to Reproduce:
1.
2.
3.


The undercloud was successfully installed but when we run the overcloud deploy command, I obtain again error

(undercloud) [stack@director16 ~]$ openstack overcloud deploy --templates /home/stack/templates/ceph_rendered_templates \
> -r /home/stack/templates/ceph_rendered_templates/roles_data.yaml \
> -e /usr/share/openstack-tripleo-heat-templates/environments/ceph-ansible/ceph-ansible.yaml \
> -e /home/stack/templates/ceph_rendered_templates/environments/network-environment.yaml \
> -e /home/stack/templates/ceph_rendered_templates/environments/network-isolation.yaml \
> -e /home/stack/templates/node-info.yaml \
> -e /home/stack/containers-prepare-parameter.yaml \
> -e /home/stack/inject-trust-anchor-hiera.yaml \
> -e /home/stack/templates/ceph-config-custom.yaml \
> -e /home/stack/templates/timezone.yaml \
> --run-validations
Removing the current plan files
Uploading new plan files
Temporary Swift GET/PUT URL parameters have successfully been updated.
Plan updated.
Processing templates in the directory /tmp/tripleoclient-l8a0sc83/tripleo-heat-templates
WARNING: Following parameter(s) are defined but not currently used in the deployment plan. These parameters may be valid but not in use due to the service or deployment configuration. NetConfigDataLookup
Deploying templates in the directory /tmp/tripleoclient-l8a0sc83/tripleo-heat-templates
Initializing overcloud plan deployment
{'result': 'Failure caused by error in tasks: find_validations\n\n  find_validations [task_ex_id=e9a414a7-4441-4a09-9ed9-008125fbbcea] -> {\'msg\': "Error loading validations from Swift: Container GET failed: https://10.96.11.18:13808/v1/AUTH_118ea8570c184bd2bb5f9c1d17d3fd88/tripleo-validations?format=json 404 Not Found  [first 60 chars of response] b\'<html><h1>Not Found</h1><p>The resource could not be found.<\'"}\n    [action_ex_id=5fb51f96-2fb3-48c9-b848-a0446987ceaa, idx=0]: {\'msg\': "Error loading validations from Swift: Container GET failed: https://10.96.11.18:13808/v1/AUTH_118ea8570c184bd2bb5f9c1d17d3fd88/tripleo-validations?format=json 404 Not Found  [first 60 chars of response] b\'<html><h1>Not Found</h1><p>The resource could not be found.<\'"}\n'}
Exception occured while running the command
Traceback (most recent call last):
  File "/usr/lib/python3.6/site-packages/tripleoclient/command.py", line 32, in run
    super(Command, self).run(parsed_args)
  File "/usr/lib/python3.6/site-packages/osc_lib/command/command.py", line 41, in run
    return super(Command, self).run(parsed_args)
  File "/usr/lib/python3.6/site-packages/cliff/command.py", line 185, in run
    return_code = self.take_action(parsed_args) or 0
  File "/usr/lib/python3.6/site-packages/tripleoclient/v1/overcloud_deploy.py", line 1037, in take_action
    self._deploy_tripleo_heat_templates_tmpdir(stack, parsed_args)
  File "/usr/lib/python3.6/site-packages/tripleoclient/v1/overcloud_deploy.py", line 411, in _deploy_tripleo_heat_templates_tmpdir
    new_tht_root, tht_root)
  File "/usr/lib/python3.6/site-packages/tripleoclient/v1/overcloud_deploy.py", line 528, in _deploy_tripleo_heat_templates
    deployment_options=deployment_options)
  File "/usr/lib/python3.6/site-packages/tripleoclient/v1/overcloud_deploy.py", line 547, in _try_overcloud_deploy_with_compat_yaml
    deployment_options=deployment_options)
  File "/usr/lib/python3.6/site-packages/tripleoclient/v1/overcloud_deploy.py", line 284, in _heat_deploy
    deployment_options=deployment_options)
  File "/usr/lib/python3.6/site-packages/tripleoclient/workflows/deployment.py", line 87, in deploy_and_wait
    deploy(log, clients, **workflow_input)
  File "/usr/lib/python3.6/site-packages/tripleoclient/workflows/deployment.py", line 69, in deploy
    % (payload['status'], wf_name))
ValueError: Unexpected status FAILED for tripleo.deployment.v1.deploy_plan
Unexpected status FAILED for tripleo.deployment.v1.deploy_plan


Actual results:

Got Error in "--run-validations"

Expected results:

Since after removing "--run-validations" overcloud deployment was successfull. Customer wanted to know exact reason why it fails with "--run-validations" tag

Additional info:
1. Attaching templates that are being used.
2. Also attaching sos-report from the director node.

Comment 1 Aman Gupta 2021-06-21 13:08:27 UTC
Below is the sos-report for director node in the support-shell , case number: 02937461

sosreport-director16-02937461-2021-05-26-ciqpqyk.tar.xz

Comment 7 mathieu bultel 2021-09-02 07:55:07 UTC
Hi,

I have pushed a fix for this bug.
The main issue is that the validation swift container has been removed previously in the undercloud post install tasks, so the --run-validations option has been broken.

I'm patch aim to fix that, but I'm wondering if we can just totally get rid of mistral for running Validation in stable/train and just use the VF with direct call, like what have done for the undercloud-preflight checks.

Comment 18 Red Hat Bugzilla 2025-01-30 04:25:08 UTC
The needinfo request[s] on this closed bug have been removed as they have been unresolved for 120 days