Bug 1403941 - Introspection fails with 'mistral engine error' on Ocata if an 'ironic node-update' is done before introspection runs
Summary: Introspection fails with 'mistral engine error' on Ocata if an 'ironic node-u...
Keywords:
Status: CLOSED EOL
Alias: None
Product: Red Hat OpenStack
Classification: Red Hat
Component: openstack-tripleo-common
Version: 11.0 (Ocata)
Hardware: Unspecified
OS: Unspecified
medium
high
Target Milestone: ---
: ---
Assignee: Dougal Matthews
QA Contact: Dan Yasny
URL:
Whiteboard:
Depends On:
Blocks:
TreeView+ depends on / blocked
 
Reported: 2016-12-12 16:40 UTC by Ronelle Landy
Modified: 2018-06-22 12:29 UTC (History)
10 users (show)

Fixed In Version:
Doc Type: If docs needed, set a value
Doc Text:
Clone Of:
Environment:
Last Closed: 2018-06-22 12:29:32 UTC
Target Upstream Version:


Attachments (Terms of Use)
sosreport (12.03 MB, application/x-xz)
2017-12-05 17:07 UTC, Ziv Greenberg
no flags Details
introspection error log (72.38 KB, text/plain)
2017-12-05 17:08 UTC, Ziv Greenberg
no flags Details
ironic's logs (78.64 KB, application/x-gzip)
2017-12-05 17:11 UTC, Ziv Greenberg
no flags Details

Description Ronelle Landy 2016-12-12 16:40:03 UTC
Description of problem:

Reproducible on only one of the CI hardware environments.

This environment requires disk hints and therefore an 'ironic node-update' needs to be run before introspection starts. Introspection fails with:

(from: /var/log/mistral/engine.log)

*****************************************

2016-12-12 12:50:47.806 18707 ERROR mistral.engine.task_handler [-] Failed to run task [error=Invalid input [name=baremetal_introspection.introspect, class=NoneType, unexpected=[u'uuid']], wf=WorkflowExecution {'state_info': None, 'accepted': False, 'output': {}, 'name': u'tripleo.baremetal.v1.introspect', 'workflow_name': u'tripleo.baremetal.v1.introspect', 'state': 'RUNNING', 'created_at': '2016-12-12 12:50:47', 'runtime_context': {'index': 0}, 'updated_at': '2016-12-12 12:50:47', 'spec': {u'input': [u'node_uuids', {u'queue_name': u'tripleo'}], u'tasks': {u'send_message': {u'retry': u'count=5 delay=1', u'name': u'send_message', u'version': '2.0', u'action': u'zaqar.queue_post', u'input': {u'queue_name': u'<% $.queue_name %>', u'messages': {u'body': {u'type': u'tripleo.baremetal.v1.introspect', u'payload': {u'status': u"<% $.get('status', 'SUCCESS') %>", u'message': u"<% $.get('message', '') %>", u'execution': u'<% execution() %>', u'introspected_nodes': u"<% $.get('introspected_nodes', []) %>"}}}}, u'type': 'direct'}, u'set_status_failed_start_introspection': {u'version': '2.0', u'type': 'direct', u'name': u'set_status_failed_start_introspection', u'publish': {u'status': u'FAILED', u'message': u'<% task(start_introspection).result %>', u'introspected_nodes': []}, u'on-success': u'send_message'}, u'wait_for_introspection_to_finish': {u'name': u'wait_for_introspection_to_finish', u'on-success': u'send_message', u'publish': {u'status': u'SUCCESS', u'message': u'Successfully introspected nodes.', u'introspected_nodes': u'<% task(wait_for_introspection_to_finish).result %>'}, u'version': '2.0', u'action': u'baremetal_introspection.wait_for_finish', u'input': {u'uuids': u'<% $.node_uuids %>'}, u'type': 'direct'}, u'start_introspection': {u'with-items': u'uuid in <% $.node_uuids %>', u'name': u'start_introspection', u'on-error': u'set_status_failed_start_introspection', u'on-success': u'wait_for_introspection_to_finish', u'version': '2.0', u'action': u'baremetal_introspection.introspect uuid=<% $.uuid %>', u'type': 'direct'}}, u'description': u'Take a list of nodes and move them through introspection.', u'version': u'2.0', u'name': u'introspect'}, 'workflow_id': u'c8dcb78b-b727-4b8c-ae8b-4adce508695c', 'params': {'index': 0, 'task_execution_id': 'ba7849fa-9fee-4b2a-b51d-c319c234b402'}, 'context': {'openstack': {u'is_target': False, u'project_name': u'admin', u'user_id': u'aed139d0a2dd4603ac1811dd9e1ac588', u'region_name': None, u'roles': [u'admin'], u'auth_uri': u'http://10.8.147.1:5000/v3', u'auth_cacert': None, u'auth_token': u'gAAAAABYTp0lTPzIGrToxqnaTtJR0zJ3ZAiRbH-EvyOOWNMHbtORpRZeplKOihc4DP8v_KW2Z1VfVeU46bpvlRZyHP30bra-pxb2wS80zDjB6cOUgu-3jHh-jaU8OCkfPChqNuZSDPGWLDPusL14wcr1ReptDRoqCpIhdwhFWqkixVpXyB6e_84', u'expires_at': u'2016-12-12T16:50:45.000000Z', u'is_trust_scoped': False, u'service_catalog': {u'is_domain': False, u'methods': [u'password'], u'roles': [{u'id': u'93ac8d5cf04245d3acf90b0818a52eb6', u'name': u'admin'}], u'auth_token': u'gAAAAABYTp0lTPzIGrToxqnaTtJR0zJ3ZAiRbH-EvyOOWNMHbtORpRZeplKOihc4DP8v_KW2Z1VfVeU46bpvlRZyHP30bra-pxb2wS80zDjB6cOUgu-3jHh-jaU8OCkfPChqNuZSDPGWLDPusL14wcr1ReptDRoqCpIhdwhFWqkixVpXyB6e_84', u'expires_at': u'2016-12-12T16:50:45.000000Z', u'project': {u'domain': {u'id': u'default', u'name': u'Default'}, u'id': u'089d583354d74f86815e0b535f97ddeb', u'name': u'admin'}, u'catalog': [{u'endpoints': [{u'url': u'http://10.8.147.1:5050', u'interface': u'public', u'region': u'regionOne', u'region_id': u'regionOne', u'id': u'54bc453d3ef643d99b01779608fb312f'}, {u'url': u'http://10.8.147.1:5050', u'interface': u'internal', u'region': u'regionOne', u'region_id': u'regionOne', u'id': u'62c65def6e4d461990d68e2df7fa1cab'}, {u'url': u'http://10.8.147.1:5050', u'interface': u'admin', u'region': u'regionOne', u'region_id': u'regionOne', u'id': u'fa6413b1ed3b4caaa3365a5aa2dbbd09'}], u'type': u'baremetal-introspection', u'id': u'0763c34e3e4144899019dd1ca236febf', u'name': u'ironic-inspector'}, {u'endpoints': [{u'url': u'http://10.8.147.1:8080/v1/AUTH_089d583354d74f86815e0b535f97ddeb', u'interface': u'internal', u'region': u'regionOne', u'region_id': u'regionOne', u'id': u'33555740738f4d089a75ddb3aaa59fb4'}, {u'url': u'http://10.8.147.1:8080/v1/AUTH_089d583354d74f86815e0b535f97ddeb', u'interface': u'public', u'region': u'regionOne', u'region_id': u'regionOne', u'id': u'5bccd075ec6a4f399d77fa5f30dcc2f7'}, {u'url': u'http://10.8.147.1:8080', u'interface': u'admin', u'region': u'regionOne', u'region_id': u'regionOne', u'id': u'9bb1067de7ff47ff8cfcfafdcabf5efe'}], u'type': u'object-store', u'id': u'1673c7744f334fb08cd0131f8e830f70', u'name': u'swift'}, {u'endpoints': [{u'url': u'http://10.8.147.1:9696', u'interface': u'internal', u'region': u'regionOne', u'region_id': u'regionOne', u'id': u'6cbf467fb5c545fa8526f8be53ef6ccd'}, {u'url': u'http://10.8.147.1:9696', u'interface': u'admin', u'region': u'regionOne', u'region_id': u'regionOne', u'id': u'8e308179ecbb4532a96ec712208f706b'}, {u'url': u'http://10.8.147.1:9696', u'interface': u'public', u'region': u'regionOne', u'region_id': u'regionOne', u'id': u'9b73d8f592d6424380a9422c958196be'}], u'type': u'network', u'id': u'2172c2272b984665906004b557469472', u'name': u'neutron'}, {u'endpoints': [{u'url': u'http://10.8.147.1:6385', u'interface': u'public', u'region': u'regionOne', u'region_id': u'regionOne', u'id': u'6df34b7f1eec467bb14b3944c37059a6'}, {u'url': u'http://10.8.147.1:6385', u'interface': u'internal', u'region': u'regionOne', u'region_id': u'regionOne', u'id': u'c7f02ebb380c4186bde1d008c39513dc'}, {u'url': u'http://10.8.147.1:6385', u'interface': u'admin', u'region': u'regionOne', u'region_id': u'regionOne', u'id': u'ff3033285f3647b586100ae0db6f605f'}], u'type': u'baremetal', u'id': u'26d65f0d77674b06a637eb6a53bfb7b0', u'name': u'ironic'}, {u'endpoints': [{u'url': u'http://10.8.147.1:35357/v2.0', u'interface': u'admin', u'region': u'regionOne', u'region_id': u'regionOne', u'id': u'2e050847902a44c99151f4a4f71aab34'}, {u'url': u'http://10.8.147.1:5000/v2.0', u'interface': u'internal', u'region': u'regionOne', u'region_id': u'regionOne', u'id': u'5e9bb0f0964748ffa43c58cb38d45590'}, {u'url': u'http://10.8.147.1:5000/v2.0', u'interface': u'public', u'region': u'regionOne', u'region_id': u'regionOne', u'id': u'a7dec80e790c4ce0b2c3bd417c6c9fc6'}], u'type': u'identity', u'id': u'272b28f1684244c29593c9b63353a501', u'name': u'keystone'}, {u'endpoints': [{u'url': u'ws://10.8.147.1:9000', u'interface': u'public', u'region': u'regionOne', u'region_id': u'regionOne', u'id': u'6b588bcef9c842dfa8b3ace7431ce8ff'}, {u'url': u'ws://10.8.147.1:9000', u'interface': u'admin', u'region': u'regionOne', u'region_id': u'regionOne', u'id': u'a9fc05ebd6874f84b3abe6207eef4f38'}, {u'url': u'ws://10.8.147.1:9000', u'interface': u'internal', u'region': u'regionOne', u'region_id': u'regionOne', u'id': u'c230dafd206b4ca286c4df79518b0141'}], u'type': u'messaging-websocket', u'id': u'33dd7330c74042b38e5ee3caa6a6fae1', u'name': u'zaqar-websocket'}, {u'endpoints': [{u'url': u'http://10.8.147.1:8888', u'interface': u'internal', u'region': u'regionOne', u'region_id': u'regionOne', u'id': u'af9f6453f53c44d9a237baa3ee2870fd'}, {u'url': u'http://10.8.147.1:8888', u'interface': u'admin', u'region': u'regionOne', u'region_id': u'regionOne', u'id': u'b468ef18904248c484fd17ddf7e12e11'}, {u'url': u'http://10.8.147.1:8888', u'interface': u'public', u'region': u'regionOne', u'region_id': u'regionOne', u'id': u'df771240c9b742689cc55eb005dc772a'}], u'type': u'messaging', u'id': u'4ab09ba9a3ba4f4dbe80d437a09fe3c7', u'name': u'zaqar'}, {u'endpoints': [{u'url': u'http://10.8.147.1:8989/v2', u'interface': u'public', u'region': u'regionOne', u'region_id': u'regionOne', u'id': u'1bcad173369f47d6a258637bd6ecfd9a'}, {u'url': u'http://10.8.147.1:8989/v2', u'interface': u'internal', u'region': u'regionOne', u'region_id': u'regionOne', u'id': u'453f965aa888481aaf3d3949b28c7e35'}, {u'url': u'http://10.8.147.1:8989/v2', u'interface': u'admin', u'region': u'regionOne', u'region_id': u'regionOne', u'id': u'eca9aaa384e64246845212b00a4bb3b4'}], u'type': u'workflowv2', u'id': u'63a803930c2c4203b8f088644a6a8d67', u'name': u'mistral'}, {u'endpoints': [{u'url': u'http://10.8.147.1:9292', u'interface': u'internal', u'region': u'regionOne', u'region_id': u'regionOne', u'id': u'5cf9634030ac4a1084b9c94f24356986'}, {u'url': u'http://10.8.147.1:9292', u'interface': u'admin', u'region': u'regionOne', u'region_id': u'regionOne', u'id': u'725bd2f217084e98b6a2810c4c04eab9'}, {u'url': u'http://10.8.147.1:9292', u'interface': u'public', u'region': u'regionOne', u'region_id': u'regionOne', u'id': u'7f66ff1b1ab142caa1b27532d9888c00'}], u'type': u'image', u'id': u'826a229aa9e74ed1a7de521f90fb0d4b', u'name': u'glance'}, {u'endpoints': [{u'url': u'http://10.8.147.1:8777', u'interface': u'admin', u'region': u'regionOne', u'region_id': u'regionOne', u'id': u'4659dc3c2a4d4a1792ad5118182a2e13'}, {u'url': u'http://10.8.147.1:8777', u'interface': u'public', u'region': u'regionOne', u'region_id': u'regionOne', u'id': u'7e675da024fe4c4ab635dbd1535504c3'}, {u'url': u'http://10.8.147.1:8777', u'interface': u'internal', u'region': u'regionOne', u'region_id': u'regionOne', u'id': u'caf0218bd21948e985390caae52dc9df'}], u'type': u'metering', u'id': u'93b3af96a1604e9ea2c32e3016e41d24', u'name': u'ceilometer'}, {u'endpoints': [{u'url': u'http://10.8.147.1:8042', u'interface': u'internal', u'region': u'regionOne', u'region_id': u'regionOne', u'id': u'184dd7cd9da94353b512746a0d11e528'}, {u'url': u'http://10.8.147.1:8042', u'interface': u'admin', u'region': u'regionOne', u'region_id': u'regionOne', u'id': u'b0c47d4e0f3c4646b36f7a07aad5bc92'}, {u'url': u'http://10.8.147.1:8042', u'interface': u'public', u'region': u'regionOne', u'region_id': u'regionOne', u'id': u'e53db6a1b21b4765b1eb66364a699d26'}], u'type': u'alarming', u'id': u'a39fe5df7e1647088a8ec3f9a71cfcec', u'name': u'aodh'}, {u'endpoints': [{u'url': u'http://10.8.147.1:8774/v2.1', u'interface': u'internal', u'region': u'regionOne', u'region_id': u'regionOne', u'id': u'08653ccb172049ad89ca1926307a0be8'}, {u'url': u'http://10.8.147.1:8774/v2.1', u'interface': u'public', u'region': u'regionOne', u'region_id': u'regionOne', u'id': u'207e5abb177a45368abe20a784c27c20'}, {u'url': u'http://10.8.147.1:8774/v2.1', u'interface': u'admin', u'region': u'regionOne', u'region_id': u'regionOne', u'id': u'3f8748b060744293aad1a2f60d9e5724'}], u'type': u'compute', u'id': u'a67e06c4adde4abfb5ef7f43429bc337', u'name': u'nova'}, {u'endpoints': [{u'url': u'http://10.8.147.1:8004/v1/089d583354d74f86815e0b535f97ddeb', u'interface': u'admin', u'region': u'regionOne', u'region_id': u'regionOne', u'id': u'32eeb6b86a7948c49c0eba7447b0436c'}, {u'url': u'http://10.8.147.1:8004/v1/089d583354d74f86815e0b535f97ddeb', u'interface': u'internal', u'region': u'regionOne', u'region_id': u'regionOne', u'id': u'4bd0413b6b3e46b0b9450edcb5163889'}, {u'url': u'http://10.8.147.1:8004/v1/089d583354d74f86815e0b535f97ddeb', u'interface': u'public', u'region': u'regionOne', u'region_id': u'regionOne', u'id': u'6ca77385fd1d44128a3e66303b7450ab'}], u'type': u'orchestration', u'id': u'bf112b0a1d4742ef8c53e8303a4d9c59', u'name': u'heat'}], u'version': u'v3', u'user': {u'domain': {u'id': u'default', u'name': u'Default'}, u'id': u'aed139d0a2dd4603ac1811dd9e1ac588', u'name': u'admin', u'password_expires_at': None}, u'audit_ids': [u'PcF0fzdaR8aKqk6hhWqWNg'], u'issued_at': u'2016-12-12T12:50:45.000000Z'}, u'project_id': u'089d583354d74f86815e0b535f97ddeb', u'user_name': u'admin'}, '__execution': {'id': '78de6b67-3ffc-468b-a317-41c69eefde1d'}}, 'input': {u'queue_name': u'dc9e4d0b-7d77-4e77-8f34-cbde2126544d', u'node_uuids': [u'8dc1bedf-b1ca-4738-8721-a222b5458780', u'0d8beb91-7bb3-4051-ae41-8be439be60aa', u'c3ab5301-348a-4f25-8134-44badbb96e58', u'bb836b8f-ec8e-4ebd-b5cd-a2a7dee96451']}, 'scope': 'private', 'project_id': u'089d583354d74f86815e0b535f97ddeb', 'task_execution_id': 'ba7849fa-9fee-4b2a-b51d-c319c234b402', 'id': '78de6b67-3ffc-468b-a317-41c69eefde1d', 'description': 'sub-workflow execution'}, task=start_introspection]:
Traceback (most recent call last):
  File "/usr/lib/python2.7/site-packages/mistral/engine/task_handler.py", line 58, in run_task
    task.run()
  File "/usr/lib/python2.7/site-packages/osprofiler/profiler.py", line 154, in wrapper
    return f(*args, **kwargs)
  File "/usr/lib/python2.7/site-packages/mistral/engine/tasks.py", line 270, in run
    self._run_new()
  File "/usr/lib/python2.7/site-packages/mistral/engine/tasks.py", line 293, in _run_new
    self._schedule_actions()
  File "/usr/lib/python2.7/site-packages/mistral/engine/tasks.py", line 465, in _schedule_actions
    action.validate_input(input_dict)
  File "/usr/lib/python2.7/site-packages/mistral/engine/actions.py", line 279, in validate_input
    e_utils.validate_input(self.action_def, input_dict)
  File "/usr/lib/python2.7/site-packages/mistral/engine/utils.py", line 58, in validate_input
    msg % tuple(msg_props)
InputException: Invalid input [name=baremetal_introspection.introspect, class=NoneType, unexpected=[u'uuid']]

**********************************

Version-Release number of selected component (if applicable):

[stack@undercloud ~]$ rpm -qa | grep ironic-inspector
openstack-ironic-inspector-4.2.1-0.20161207210344.c106d91.el7.centos.noarch
python-ironic-inspector-client-1.9.0-0.20161116144913.6364bc9.el7.centos.noarch

[stack@undercloud ~]$ rpm -qa | grep mistral
openstack-mistral-executor-4.0.0-0.20161209115108.317e4f3.el7.centos.noarch
puppet-mistral-10.0.0-0.20161209230917.17d6fcf.el7.centos.noarch
python2-mistralclient-2.1.2-0.20161207154141.1dcfc2c.el7.centos.noarch
openstack-mistral-common-4.0.0-0.20161209115108.317e4f3.el7.centos.noarch
openstack-mistral-api-4.0.0-0.20161209115108.317e4f3.el7.centos.noarch
python-mistral-tests-4.0.0-0.20161209115108.317e4f3.el7.centos.noarch
openstack-mistral-engine-4.0.0-0.20161209115108.317e4f3.el7.centos.noarch
python-openstack-mistral-4.0.0-0.20161209115108.317e4f3.el7.centos.noarch


How reproducible:
100% reproducible on specific hardware 

Steps to Reproduce:
1. On a system that requires disk hints or any other node-update, install the undercloud
2. Register the nodes with ironic and then update the nodes
3. Run bulk introspection

Actual results:

Introspection fails with mistral error above

Expected results:
Introspection completes


Additional info:

***** WORKAROUND *****: 

(from dmatthews)

Execute:
>> sudo mistral-db-manage populate

after the node update and before introspection and introspection completes.

Comment 1 Dmitry Tantsur 2016-12-13 09:51:53 UTC
Are you sure that node-update is really related? I can think of no way it could affect Mistral workflows...

Comment 2 Ronelle Landy 2016-12-19 14:57:13 UTC
Dmitry, I don't have any direct proof that node-update is related. 
CI only reproduces the error on one environment (and that environment happens to use disk hints and therefore requires update to run) but it could be some other feature of this environment that triggers the failure.

If you have any other debugging steps to suggest, I can run through those to narrow down the problem.

Comment 3 Dmitry Tantsur 2017-02-03 10:10:27 UTC
Sorry, this issue fall out of my radar. Is it still a problem?

Comment 4 Ronelle Landy 2017-02-13 13:40:16 UTC
I have the suggested workaround:

>> sudo mistral-db-manage populate

in place in the CI code. I would have to take the workaround out to retest.
Hard to tell if this is just a quirk of one particular hardware setup or a more generic issue that shows up.

Comment 5 Bob Fournier 2017-09-28 17:49:08 UTC
As we don't have a way to duplicate this or make any progress on it, and it doesn't appear to be happening over the last year I'd like to close this.  Please reopen if the problem occurs again. Thanks.

Comment 6 Ziv Greenberg 2017-12-05 17:06:16 UTC
Hi,

I'm reopening the following bug as I've encountered the same issue in my environment as well.

My deployment as following:

OSP11 with puddle 2017-12-05.1
Hybrid environment includes virtual nodes and baremetal nodes (virt undercloud, virt 3 controllers, and 1 bm compute).
Deployed with an infrared tool.
FYI, I've applied the proposed workaround in c#4, it worked for me as well.

I'm attaching the sosreport, ironic's logs, and a full introspection execution error log.

Thanks.
Ziv

Comment 7 Ziv Greenberg 2017-12-05 17:07:17 UTC
Created attachment 1363285 [details]
sosreport

Comment 8 Ziv Greenberg 2017-12-05 17:08:42 UTC
Created attachment 1363286 [details]
introspection error log

Comment 9 Ziv Greenberg 2017-12-05 17:11:20 UTC
Created attachment 1363287 [details]
ironic's logs

Comment 10 Dougal Matthews 2017-12-05 17:23:04 UTC
> Invalid input [name=baremetal_introspection.introspect, class=NoneType, unexpected=[u\'uuid\']

Taken from the command output. That suggests that the Mistral action failed to correctly load. This can be worked around usually by running...

> sudo mistral-db-manage populate

After that you may need to restart Mistral too.

Comment 11 Ziv Greenberg 2017-12-05 17:53:06 UTC
OK, that's what I did, I've applied the W/A.
But what is the root cause of the issue, please?

Comment 12 Bob Fournier 2017-12-05 19:37:33 UTC
Ziv - thanks for the logs. 

What I see in the ironic inspector log are those "Could not find node xxx in cache" errors:
ironic/ironic-inspector.log:2017-12-05 10:52:14.115 26036 ERROR ironic_inspector.utils [-] Could not find node c92928f2-dc60-414d-8de3-0b5959970743 in cache
ironic/ironic-inspector.log:2017-12-05 10:52:14.115 26036 DEBUG ironic_inspector.main [-] Returning error to client: Could not find node c92928f2-dc60-414d-8de3-0b5959970743 in cache error_response /usr/lib/python2.7/site-packages/ironic_inspector/main.py:79
ironic/ironic-inspector.log:2017-12-05 10:52:14.123 26036 ERROR ironic_inspector.utils [-] Could not find node cd1dad5c-80f3-4dae-8d97-981afccf6754 in cache
ironic/ironic-inspector.log:2017-12-05 10:52:14.123 26036 DEBUG ironic_inspector.main [-] Returning error to client: Could not find node cd1dad5c-80f3-4dae-8d97-981afccf6754 in cache error_response /usr/lib/python2.7/site-packages/ironic_inspector/main.py:79
ironic/ironic-inspector.log:2017-12-05 10:52:14.129 26036 ERROR ironic_inspector.utils [-] Could not find node 1b4ec862-7244-4d6f-a33e-9eced28ac5b2 in cache
ironic/ironic-inspector.log:2017-12-05 10:52:14.129 26036 DEBUG ironic_inspector.main [-] Returning error to client: Could not find node 1b4ec862-7244-4d6f-a33e-9eced28ac5b2 in cache error_response /usr/lib/python2.7/site-packages/ironic_inspector/main.py:79

This looks similar to https://bugzilla.redhat.com/show_bug.cgi?id=1519232, it has the same "Could not find node xxx in cache" errors.  That one was closed because of a  misconfiguration within the env".  Is it the same issue here?

Comment 13 Ziv Greenberg 2017-12-06 03:46:24 UTC
The reason I closed it, is because in our full BM CI the introspection did pass eventually with no particular issue.

For now, seems like the problem coexists in the hybrid environments only.

Comment 14 Bob Fournier 2017-12-06 17:12:05 UTC
Thanks Ziv. 

I think Mistral is out of sync and using older Ironic node IDs in the introspection bulk command, which is why Ironic generates all of these "Could not find node xxx in cache".  The node IDs in the error message don't match the current Ironic node IDs listed in capture.

[stack@undercloud-0 ~]$ openstack baremetal node list
+--------------------------------------+------------------+---------------+-------------+--------------------+-------------+
| UUID                                 | Name             | Instance UUID | Power State | Provisioning State | Maintenance |
+--------------------------------------+------------------+---------------+-------------+--------------------+-------------+
| 78ed5643-3efd-4ced-9a8b-595dabc9f169 | computeovsdpdk-0 | None          | power off   | manageable         | False       |
| b3878de0-6504-4f39-ae17-668a25b045c0 | controller-0     | None          | power off   | manageable         | False       |
| b9899335-9c03-4832-bb81-7b333633ecc8 | controller-1     | None          | power off   | manageable         | False       |
| 33ba313d-7f19-46ba-8e06-bc88ce3cbae4 | controller-2     | None          | power off   | manageable         | False       |
+--------------------------------------+------------------+---------------+-------------+--------------------+-------------+

The logs and workaround point to this being a workflows issue, not an Ironic one. I'm moving this to the Workflows DFG to help answer the question in comment 11.

Comment 15 Scott Lewis 2018-06-22 12:29:32 UTC
OSP11 is now retired, see details at https://access.redhat.com/errata/product/191/ver=11/rhel---7/x86_64/RHBA-2018:1828


Note You need to log in before you can comment on or make changes to this bug.