Bug 882671

Summary: PRD32 - if a host doesn't see any storage domains, it should move to none-operational - even if it is the last host up
Product: Red Hat Enterprise Virtualization Manager Reporter: Barak <bazulay>
Component: ovirt-engineAssignee: mkublin <mkublin>
Status: CLOSED NOTABUG QA Contact: vvyazmin <vvyazmin>
Severity: unspecified Docs Contact:
Priority: unspecified    
Version: 3.1.0CC: acathrow, bazulay, dyasny, hateya, iheim, jturner, lpeer, Rhev-m-bugs, sgordon, sgrinber, yeylon, ykaul
Target Milestone: ---   
Target Release: 3.3.0   
Hardware: Unspecified   
OS: Unspecified   
Whiteboard: infra
Fixed In Version: sf3 Doc Type: Bug Fix
Doc Text:
Story Points: ---
Clone Of: Environment:
Last Closed: 2013-05-27 11:35:17 UTC Type: Bug
Regression: --- Mount Type: ---
Documentation: --- CRM:
Verified Versions: Category: ---
oVirt Team: Infra RHEL 7.3 requirements from Atomic Host:
Cloudforms Team: --- Target Upstream Version:
Embargoed:
Bug Depends On: 912054    
Bug Blocks: 915537    
Attachments:
Description Flags
## Logs vdsm, rhevm
none
## Logs vdsm, rhevm (iSCSI) none

Description Barak 2012-12-02 15:24:22 UTC
This is a part of the solution to resolve a series of issues related to "last host in up" and as a result of discussions done about Bug 869309.

The idea is that when the last host in up does not see any SDs it should move to none-operational.

There is a different bug to enable using a none-operational host as fence proxy when the reason is not network (Bug 876235)

Comment 5 Barak 2012-12-06 13:55:51 UTC
This was actually fixed on upstream by commit 
http://gerrit.ovirt.org/#/c/9566/

The above commit solves also Bug 844350

Hence moving to POST, I don't close as duplicate as I would like the 2 issues to be tested.

Comment 6 mkublin 2012-12-19 11:51:24 UTC
Not exactly understand requirements.
What I did:
1. If host can not connect to pool it will not moved to status UP, event it is a only host in pool.
2. If we performed a failover and we deactivated a last storage domain , the pool should be moved to Maintenance. When pool in maintenance status no failover process should run. These is a intention in code at least for last 2 years (code is written in such way, can be seen in DeactivateStorageDomainCommand). If it is not working in a such way it is a bug.

Comment 7 Stephen Gordon 2013-01-03 21:46:46 UTC
Setting docs_scoped- here, as per comment # 6 it seems like what is actually happening here is we are fixing a regression that was introduced when we added the autorecovery paths. I don't think there is any documentation impact here.

Comment 8 vvyazmin@redhat.com 2013-02-06 14:30:06 UTC
Failed on RHEVM 3.2 - SF05 environment (FC and iSCSI):

RHEVM: rhevm-3.2.0-6.el6ev.noarch
VDSM: vdsm-4.10.2-5.0.el6ev.x86_64
LIBVIRT: libvirt-0.10.2-17.el6.x86_64
QEMU & KVM: qemu-kvm-rhev-0.12.1.2-2.348.el6.x86_64
SANLOCK: sanlock-2.6-2.el6.x86_64


Steps to Reproduce:
1. Create FC or iSCSI DC environment with two hosts
2. Create a multiple SD's
3. From first host, block connection to all SD's
4. From second host, block connection to SD's, except one SD
5. From second host, block connection to last SD

Second host stay in status UP.

Comment 9 vvyazmin@redhat.com 2013-02-06 14:30:50 UTC
Created attachment 693959 [details]
## Logs vdsm, rhevm

Comment 10 vvyazmin@redhat.com 2013-02-06 14:31:29 UTC
Created attachment 693960 [details]
## Logs vdsm, rhevm (iSCSI)

Comment 11 mkublin 2013-02-06 15:05:09 UTC
Like I wrote, I did not understand requirements and what I did:

1. If host can not connect to pool it will not moved to status UP, event it is a only host in pool.
2. If we performed a failover and we deactivated a last storage domain , the pool should be moved to Maintenance. When pool in maintenance status no failover process should run. These is a intention in code at least for last 2 years (code is written in such way, can be seen in DeactivateStorageDomainCommand). If it is not working in a such way it is a bug.

I did not solve a case described here and I had not any intention to do these

Comment 12 mkublin 2013-02-17 13:50:32 UTC
Actually only when all domains are Inactive, I can move host to NonOperationals

Comment 13 Ayal Baron 2013-05-02 20:24:53 UTC
Barak, isn't this bug irrelevant in light of the changes Liron made in the lifecycle?

Comment 14 Barak 2013-05-27 11:35:17 UTC
I agree with comment #13, this bug is not relevant any more,
As we chose a different path to solve the various issues with the host life-cycle.

Hence moving this bug to CLOSE NOTABUG