Bug 2178501 - 16.2 - #5- Disconnecting from the wrong host - Fix issues with nova-manage volume_attachment subcommand
Summary: 16.2 - #5- Disconnecting from the wrong host - Fix issues with nova-manage vo...
Keywords:
Status: ON_DEV
Alias: None
Product: Red Hat OpenStack
Classification: Red Hat
Component: openstack-nova
Version: 16.2 (Train)
Hardware: Unspecified
OS: Unspecified
high
high
Target Milestone: z2
: 17.1
Assignee: Amit Uniyal
QA Contact: OSP DFG:Compute
URL:
Whiteboard:
: 2178507 (view as bug list)
Depends On:
Blocks:
TreeView+ depends on / blocked
 
Reported: 2023-03-15 06:14 UTC by Amit Uniyal
Modified: 2023-08-11 15:39 UTC (History)
11 users (show)

Fixed In Version:
Doc Type: Bug Fix
Doc Text:
In RHOSP 17.1 GA there is a known issue where the volume refresh fails. The cause of this is because the compute host is not defined correctly in the connector file. There is currently no workaround.
Clone Of:
Environment:
Last Closed:
Target Upstream Version:
Embargoed:


Attachments (Terms of Use)


Links
System ID Private Priority Status Summary Last Updated
OpenStack gerrit 877446 0 None NEW Disconnecting volume from the compute host 2023-05-31 15:16:23 UTC
Red Hat Issue Tracker OSP-23091 0 None None None 2023-03-15 06:16:21 UTC

Description Amit Uniyal 2023-03-15 06:14:31 UTC
Work item: disconnecting the volume from the correct host. From the email thread:

>> 5- Disconnecting from the wrong host
>>
>> There were cases where the instance said to live in compute#1 but the
>> connection_info in the BDM record was for compute#2, and when the script
>> called `remote_volume_connection` then nova would call os-brick on
>> compute#1 (the wrong node) and try to detach it.
>>
>> In some case os-brick would mistakenly think that the volume was
>> attached (because the target and lun matched an existing volume on the
>> host) and would try to disconnect, resulting in errors on the compute
>> logs.
>>
>> It wasn't a problem (besides creating some confusion and noise) because
>> the removal of the multipath failed since it was in use by an instance.
>>
>> I believe it may be necessary to change the code here:
>>
>>                  compute_rpcapi.remove_volume_connection(
>>                      cctxt, instance, volume_id, instance.host)
>>
>> To use the "host" from the connector properties in the
>> bdb.connection_info if it is present.
> 
> ya that also sound like a clear bug

This bug was initially created as a copy of Bug #2161733

I am copying this bug because: 



Description of problem:

Gorka had to make heavy use of the `nova-manage volume_attachment` commands in resolving an escalation for Ericsson, and he had some feedback for us. We'd like to implement that feedback.

Version-Release number of selected component (if applicable):

From master all the way down to 16.2.

How reproducible:

N/A

Steps to Reproduce:

N/A

Actual results:

N/A

Expected results:

N/A

Additional info:

The thread where Gorka explains his feedback is at [1]. I'll try to break it down into specific fixes/work items in subsequent comments in this BZ.

[1] https://lists.corp.redhat.com/archives/rhos-compute/2022-December/000883.html

Comment 1 Artom Lifshitz 2023-06-01 15:52:28 UTC
Optimistically targeting at z6 as a UX improvement that'd be nice to have.

Comment 11 Artom Lifshitz 2023-08-11 15:39:08 UTC
*** Bug 2178507 has been marked as a duplicate of this bug. ***


Note You need to log in before you can comment on or make changes to this bug.