Note: This bug is displayed in read-only format because the product is no longer active in Red Hat Bugzilla.

Bug 1170240

Summary: [ppc] failed to move spm between ppc and x86_64
Product: Red Hat Enterprise Virtualization Manager Reporter: Petr Beňas <pbenas>
Component: ovirt-engineAssignee: Nir Soffer <nsoffer>
Status: CLOSED WONTFIX QA Contact: Aharon Canan <acanan>
Severity: urgent Docs Contact:
Priority: unspecified    
Version: 3.4.4CC: amureini, danken, ecohen, gklein, iheim, lpeer, lsurette, michal.skrivanek, pstehlik, rbalakri, Rhev-m-bugs, scohen, sherold, teigland, yeylon
Target Milestone: ---   
Target Release: 3.4.5   
Hardware: Unspecified   
OS: Unspecified   
Whiteboard: storage
Fixed In Version: Doc Type: Bug Fix
Doc Text:
Story Points: ---
Clone Of: Environment:
Last Closed: 2014-12-11 14:43:23 UTC Type: Bug
Regression: --- Mount Type: ---
Documentation: --- CRM:
Verified Versions: Category: ---
oVirt Team: Storage RHEL 7.3 requirements from Atomic Host:
Cloudforms Team: --- Target Upstream Version:
Embargoed:
Bug Depends On: 1159821    
Bug Blocks: 1122979    
Attachments:
Description Flags
engine log
none
x86_64 vdsm log
none
ppc vdsm log
none
retest engine log
none
retest ppc log
none
retest x86_64 log none

Description Petr Beňas 2014-12-03 14:49:32 UTC
Description of problem:


Version-Release number of selected component (if applicable):
 3.4.4-2.2.el6ev

How reproducible:
100%

Steps to Reproduce:
1. ppc spm in one cluster, x86_64 in other within same DC
2. move spm to the x64 host
3.

Actual results:
ppc is still spm

Expected results:


Additional info:

Comment 1 Petr Beňas 2014-12-03 14:50:25 UTC
Created attachment 964150 [details]
engine log

Comment 2 Petr Beňas 2014-12-03 14:51:28 UTC
Created attachment 964151 [details]
x86_64 vdsm log

Comment 3 Petr Beňas 2014-12-03 14:58:18 UTC
Created attachment 964167 [details]
ppc vdsm log

Comment 6 Allon Mureinik 2014-12-03 16:26:21 UTC
The underlying issue is probably bug 1159821.
What version of sanlock are you using on each host?

Comment 7 Petr Beňas 2014-12-03 16:40:10 UTC
sanlock-3.2.1-1.1.pkvm2_1_1.1.ppc64 and sanlock-2.8-1.el6.x86_64

Comment 8 Allon Mureinik 2014-12-04 15:19:17 UTC
Can you retest with sanlock-3.2.2-2.fc20 (
http://koji.fedoraproject.org/koji/buildinfo?buildID=590689)

Comment 9 Petr Beňas 2014-12-05 12:19:05 UTC
(In reply to Allon Mureinik from comment #8)
> Can you retest with sanlock-3.2.2-2.fc20 (
> http://koji.fedoraproject.org/koji/buildinfo?buildID=590689)

Hi Alon,

I'd like to, but unfortunately can't test fedora package for you. Problem is not fedora, problem is fedora can't be used as a rhel host, only in ovirt. We don't have capacity to test ovirt, we test D/S only. Can you provide rhel rpm? el6/el7, doesn't matter....

Comment 10 Allon Mureinik 2014-12-06 09:28:08 UTC
(In reply to Petr Beňas from comment #9)
> (In reply to Allon Mureinik from comment #8)
> > Can you retest with sanlock-3.2.2-2.fc20 (
> > http://koji.fedoraproject.org/koji/buildinfo?buildID=590689)
> 
> Hi Alon,
> 
> I'd like to, but unfortunately can't test fedora package for you. Problem is
> not fedora, problem is fedora can't be used as a rhel host, only in ovirt.
> We don't have capacity to test ovirt, we test D/S only. Can you provide rhel
> rpm? el6/el7, doesn't matter....

Oops, good point.

David, do we have a RHEL build for 3.2.2?
Or are the numbers different and some other build that includes the mixed-endian support?

Comment 11 David Teigland 2014-12-08 15:02:02 UTC
I can create a RHEL build once sanlock bug 1159821 is approved,
"Sanlock lockspace cannot be shared by host with different endianness (ppc/x86)".

Comment 12 Allon Mureinik 2014-12-08 16:15:54 UTC
Sean/Scott - can you assist with this please?

Comment 13 Allon Mureinik 2014-12-08 16:17:41 UTC
(In reply to David Teigland from comment #11)
> I can create a RHEL build once sanlock bug 1159821 is approved,

Can you clarify what you mean? The BZ now has the rhel-7.1.0+ flag.
Is something else needed?

Comment 14 David Teigland 2014-12-08 16:34:14 UTC
Until moments ago, bz 1159821 did not have the necessary flags to build sanlock with the endianness fix.  With the approval done, I've now built sanlock-3.2.2-2 that includes the fix:
https://brewweb.devel.redhat.com/taskinfo?taskID=8344598

Comment 15 Allon Mureinik 2014-12-08 17:00:04 UTC
(In reply to David Teigland from comment #14)
> Until moments ago, bz 1159821 did not have the necessary flags to build
> sanlock with the endianness fix.  With the approval done, I've now built
> sanlock-3.2.2-2 that includes the fix:
> https://brewweb.devel.redhat.com/taskinfo?taskID=8344598
David - awesome, thanks!
What else can we do to push this towards CLOSED_ERRATA?

Comment 16 Allon Mureinik 2014-12-08 17:01:21 UTC
(In reply to Petr Beňas from comment #9)
> (In reply to Allon Mureinik from comment #8)
> > Can you retest with sanlock-3.2.2-2.fc20 (
> > http://koji.fedoraproject.org/koji/buildinfo?buildID=590689)
> 
> Hi Alon,
> 
> I'd like to, but unfortunately can't test fedora package for you. Problem is
> not fedora, problem is fedora can't be used as a rhel host, only in ovirt.
> We don't have capacity to test ovirt, we test D/S only. Can you provide rhel
> rpm? el6/el7, doesn't matter....

Petr, there now is an EL7 build for both x86 and PPC.
Can you please retest?

https://brewweb.devel.redhat.com/buildinfo?buildID=402377

Comment 17 David Teigland 2014-12-08 17:06:49 UTC
sanlock-3.2.2-2.el7 is included in the RHEL7.1 errata, and I think that is closed when 7.1 is released.

Comment 18 Allon Mureinik 2014-12-08 18:39:59 UTC
(In reply to David Teigland from comment #17)
> sanlock-3.2.2-2.el7 is included in the RHEL7.1 errata, and I think that is
> closed when 7.1 is released.
Makes sense... Any chance to get a 7.0.z build?
(I can't even propose this flag, but that's what needed, I'll find the PM that can)

Comment 19 David Teigland 2014-12-08 18:56:02 UTC
RHEL7.0 is based on sanlock-3.1.0 which does not even contain the basic big endian support.

Comment 20 Allon Mureinik 2014-12-08 19:05:26 UTC
(In reply to David Teigland from comment #19)
> RHEL7.0 is based on sanlock-3.1.0 which does not even contain the basic big
> endian support.

So a no-go on that one. Ack.
Thanks!

[also, returning needinfo flag on Petr for the test requested in comment 16]

Comment 21 Petr Beňas 2014-12-09 09:19:06 UTC
Created attachment 966152 [details]
retest engine log

After upgrading sanlock on the x86_64 host, the bug was still present. After upgrading sanlock on the ppc, both of the hosts failed to become SPM. Attaching the logs.

Comment 22 Petr Beňas 2014-12-09 09:20:29 UTC
Created attachment 966153 [details]
retest ppc log

Comment 23 Petr Beňas 2014-12-09 09:23:45 UTC
Created attachment 966157 [details]
retest x86_64 log

Comment 24 Michal Skrivanek 2014-12-11 11:01:54 UTC
can someone please clarify what we are trying to do here?

sanlock 3.2.2 is required on ppc side, sanlock version on x86 should not matter.
if the DC is created by PPC host it *must* be created using sanlock 3.2.2. Preexisting DCs nees to be removed.

Comment 25 Allon Mureinik 2014-12-11 11:09:27 UTC
(In reply to Petr Beňas from comment #21)
> Created attachment 966152 [details]
> retest engine log
> 
> After upgrading sanlock on the x86_64 host, the bug was still present. After
> upgrading sanlock on the ppc, both of the hosts failed to become SPM.
> Attaching the logs.

This adds credibility to Michal's idea in comment 24.
Petr - can we double verify that you CREATED the domains/pool with sanlock 3.2.2?

Comment 26 Petr Beňas 2014-12-11 14:11:27 UTC
Michal is right, I've used the old storage domains. When I recreated them with sanlock-3.2.2-2.el7.ppc64, everything worked as expected.

Comment 27 Allon Mureinik 2014-12-11 14:39:13 UTC
(In reply to Petr Beňas from comment #26)
> Michal is right, I've used the old storage domains. When I recreated them
> with sanlock-3.2.2-2.el7.ppc64, everything worked as expected.
Michal, Petr - thanks.

Michal - afaik, we do not use spec dependencies in PPC, so I think this BZ can simply be closed.
Or is there any other action item?

Comment 28 Michal Skrivanek 2014-12-11 14:43:23 UTC
good, so it works as expected. Please reopen if you find otherwise...I hope not:)