Bug 1151835
| Summary: | [PPC] Vdsm fails to reconstruct master domain,sanlock causes host's to reboot instead of vdsmd restart | ||||||
|---|---|---|---|---|---|---|---|
| Product: | Red Hat Enterprise Virtualization Manager | Reporter: | Ori Gofen <ogofen> | ||||
| Component: | vdsm | Assignee: | Nir Soffer <nsoffer> | ||||
| Status: | CLOSED CURRENTRELEASE | QA Contact: | Ori Gofen <ogofen> | ||||
| Severity: | urgent | Docs Contact: | |||||
| Priority: | unspecified | ||||||
| Version: | 3.5.0 | CC: | acanan, amureini, bazulay, ecohen, gklein, glazarov, hannsj_uhl, iheim, lpeer, lsurette, michal.skrivanek, nsoffer, ogofen, scohen, tnisan, yeylon | ||||
| Target Milestone: | --- | ||||||
| Target Release: | 3.4.3 | ||||||
| Hardware: | ppc64 | ||||||
| OS: | Unspecified | ||||||
| Whiteboard: | storage | ||||||
| Fixed In Version: | Doc Type: | Bug Fix | |||||
| Doc Text: | Story Points: | --- | |||||
| Clone Of: | Environment: | ||||||
| Last Closed: | 2014-11-10 13:09:09 UTC | Type: | Bug | ||||
| Regression: | --- | Mount Type: | --- | ||||
| Documentation: | --- | CRM: | |||||
| Verified Versions: | Category: | --- | |||||
| oVirt Team: | Storage | RHEL 7.3 requirements from Atomic Host: | |||||
| Cloudforms Team: | --- | Target Upstream Version: | |||||
| Embargoed: | |||||||
| Bug Depends On: | 1142454, 1152594, 1156017 | ||||||
| Bug Blocks: | 1122979, 1148013 | ||||||
| Attachments: |
|
||||||
|
Description
Ori Gofen
2014-10-12 13:13:15 UTC
(In reply to Ori from comment #0) It is not clear what is "wrong PPC boot configuration", and how is this related to vdsm. If the root cause of this bug is the same as in bug 1141658, then this is a duplicate. If this bug describe another issue (wrong boot configuration?), then it is not a vdsm bug. Also I don't see how reconstruct master is related to bug 1141658. That bug is about blocking access to storage, which leads to reboot of the machine, because sanlock cannot terminate or kill vdsm (because selinux policy is incorrect). To make it more clear, please attach these logs: - /var/log/sanlock.log - /var/log/audit/audit.log* Also missing: - vdsm version - steps to reproduce - reproducible: 100% - how may times did you reproduced this? Nir, This bug is related to BZ #1141658, but this pertains to PPC (RHEV for IBM PPC), we require separation between the bug given the different platforms and multiple validations. Regarding the "wrong PPC boot configuration", the reboot caused by the selinux blocking the SAN locking mechanism ends up with the OS not booting up (wrong petitboot configuration). This indeed is not vdsm related but is just my observation. Per comment https://bugzilla.redhat.com/show_bug.cgi?id=1141658#c4, the reconstruct master is related to this bug. This is 100% reproducible as stated in the bug, I reproduced this 4 times. I will attach the logs requested with the 5th try. Thanks, Ori. (In reply to Gilad Lazarovich from comment #2) > Per comment https://bugzilla.redhat.com/show_bug.cgi?id=1141658#c4, the > reconstruct master is related to this bug. The issue is not related to in any way to reconstruct master. The issue (bug 1141658) is: 1. blocking access to storage 2. sanlock try to stop the spm and fail (selinux policy bug) 3. sanlock reboot the machine (expected behavior) Reconstruct master did not happen because the machine was rebooted. > This is 100% reproducible as stated in the bug, I reproduced this 4 times. > I will attach the logs requested with the 5th try. Do not forget the steps to reproduce. Currently we can do nothing with this bug. Sorry Nir, PPC OS is taking it's first QA steps which means that at any given time, some Engineers are running tests on the system, I cannot make any changes to the SElinux,in addition, from the web https://brewweb.devel.redhat.com/taskinfo?taskID=8095125 it's not clear whether the packages supports PPC, I don't want to break any running tests. (In reply to Ori from comment #7) > Sorry Nir, PPC OS is taking it's first QA steps which means that at any > given time, some Engineers are running tests on the system, I cannot make > any changes to the SElinux,in addition, from the web > https://brewweb.devel.redhat.com/taskinfo?taskID=8095125 it's not clear > whether the packages supports PPC, I don't want to break any running tests. Then this bug will have to wait until you have a release including this selinux policy. (In reply to Ori from comment #7) > Sorry Nir, PPC OS is taking it's first QA steps which means that at any > given time, some Engineers are running tests on the system, I cannot make > any changes to the SElinux,in addition, from the web > https://brewweb.devel.redhat.com/taskinfo?taskID=8095125 it's not clear > whether the packages supports PPC, I don't want to break any running tests. It's a noarch package - It supports PPC. Please schedule yourself a window on the PPC machine and test with this package. will do it next time I'll have those hosts. this is already in 3.4.3, ON_QA for retest… BZ #1156017 have been marked as blocker due to lack of storage resources on host, I need 2 domains with different ip's to verify the correct behavior, right now the creation of any nfs Storage domain is impossible. verified on powerKVM latest ver. the whole operation takes longer than usual though: normal time is about 15 minutes where reconstruct on PPC setup took about 22 minutes. released Oct 31 |