Bug 1565083
| Summary: | VDSM - "Sanlock resource read failure" exception during getSpmStatus | ||||||
|---|---|---|---|---|---|---|---|
| Product: | [oVirt] ovirt-engine | Reporter: | Yosi Ben Shimon <ybenshim> | ||||
| Component: | BLL.Storage | Assignee: | Nir Soffer <nsoffer> | ||||
| Status: | CLOSED NOTABUG | QA Contact: | Elad <ebenahar> | ||||
| Severity: | medium | Docs Contact: | |||||
| Priority: | medium | ||||||
| Version: | 4.2.1.2 | CC: | bugs, lsvaty, nsoffer, ylavi | ||||
| Target Milestone: | ovirt-4.2.4 | Keywords: | Automation | ||||
| Target Release: | --- | Flags: | rule-engine:
ovirt-4.2+
ylavi: exception+ |
||||
| Hardware: | Unspecified | ||||||
| OS: | Unspecified | ||||||
| Whiteboard: | |||||||
| Fixed In Version: | Doc Type: | If docs needed, set a value | |||||
| Doc Text: | Story Points: | --- | |||||
| Clone Of: | Environment: | ||||||
| Last Closed: | 2018-05-28 16:57:35 UTC | Type: | Bug | ||||
| Regression: | --- | Mount Type: | --- | ||||
| Documentation: | --- | CRM: | |||||
| Verified Versions: | Category: | --- | |||||
| oVirt Team: | Storage | RHEL 7.3 requirements from Atomic Host: | |||||
| Cloudforms Team: | --- | Target Upstream Version: | |||||
| Embargoed: | |||||||
| Attachments: |
|
||||||
|
Description
Yosi Ben Shimon
2018-04-09 10:57:09 UTC
This means sanlock was not able to access storage. Most likely your storage was not accessible at this time either because you block access to storage, or because of some issue in your environment. I don't see any bug the system. Please reopen when you have data suggesting that this is a bug in the system, and when you can reproduce this. Hi Nir, Can you consider adding more relevant message to logs instead of the traceback? We would avoid bugs like this, if the right ERROR is displayed that storage is unaccesible, so Admin can check it. Can you consider reopenning for clarifying the ERROR? (In reply to Lukas Svaty from comment #2) > Can you consider adding more relevant message to logs instead of the > traceback? I don't think so. The traceback is required to understand the sanlock failure and we cannot hide it. We would not be able to debug the system if we start to hide tracebacks in logs. > We would avoid bugs like this, if the right ERROR is displayed that > storage is unaccesible, so Admin can check it. Storage QE should be familiar with vdsm errors and know what should be checked when they happen. Maybe we need better documentation about common errors. > Can you consider reopenning for clarifying the ERROR? The error is very clear: SanlockException: (-202, 'Sanlock resource read failure', 'IO timeout') I agree that we can add possible recovery info for some errors. For example for error -202, it is likely that there an issue on storage or network, so we can recommend to check storage and network health. But we learned few weeks ago that this can happen when importing images from glance (bug 1832967), because of incorrect import code. So recommending to check storage and network is can be misleading in such case, it is pretty hard to provide good message based on single error. Let's open RFE for adding recovery info for storage errors. We can work on this for 4.5. |