Bug 511113
| Summary: | qdisk does not autoboot/self-fence system if write errors take longer than interval*tko | ||
|---|---|---|---|
| Product: | Red Hat Enterprise Linux 5 | Reporter: | Eduardo Damato <edamato> |
| Component: | cman | Assignee: | Christine Caulfield <ccaulfie> |
| Status: | CLOSED ERRATA | QA Contact: | Cluster QE <mspqa-list> |
| Severity: | medium | Docs Contact: | |
| Priority: | medium | ||
| Version: | 5.4 | CC: | cluster-maint, cward, edamato, jkortus, tao |
| Target Milestone: | rc | ||
| Target Release: | --- | ||
| Hardware: | All | ||
| OS: | Linux | ||
| Whiteboard: | |||
| Fixed In Version: | cman-2.0.115-15.el5.src.rpm | Doc Type: | Bug Fix |
| Doc Text: | Story Points: | --- | |
| Clone Of: | 510611 | Environment: | |
| Last Closed: | 2010-03-30 08:40:30 UTC | Type: | --- |
| Regression: | --- | Mount Type: | --- |
| Documentation: | --- | CRM: | |
| Verified Versions: | Category: | --- | |
| oVirt Team: | --- | RHEL 7.3 requirements from Atomic Host: | |
| Cloudforms Team: | --- | Target Upstream Version: | |
| Embargoed: | |||
| Bug Depends On: | 510611 | ||
| Bug Blocks: | |||
| Attachments: | |||
|
Description
Eduardo Damato
2009-07-13 18:16:35 UTC
Created attachment 351506 [details]
patch1 - implements io_timeout - adapted from RHEL4 patch
Created attachment 351507 [details]
patch2 - implements io_timeout turning max_error_cycles off
Created attachment 351508 [details]
patch3 - io_timeout implements independent read and write timeout counters for read and write operations
This patch would fix the situation where patch2 turns of max_error_cycles, disabling the detection of read() errors, and only rebooting on write() errors.
This patch creates a timer for last successful read and reboots the system if last successful read was more than interval*tko ago.
Master branch: http://git.fedorahosted.org/git/?p=cluster.git;a=commit;h=6e91a44cfb2d6baa1a639a2f6e6023bf82ab3cb7 http://git.fedorahosted.org/git/?p=cluster.git;a=commit;h=51049be41e3c3f198f7b39173bddb2d31786bc5b http://git.fedorahosted.org/git/?p=cluster.git;a=commit;h=a9ef89ce68381955d35288eb329d249e37b31618 Created attachment 366860 [details]
Reformatted patch for patch #1
Created attachment 366861 [details]
Reformatted patch for patch #2
Created attachment 366862 [details]
Reformatted patch for patch #3
Created attachment 366864 [details]
Ancillary fix from Fabio in STABLE3
http://git.fedorahosted.org/git/?p=cluster.git;a=commit;h=fe9a89972834d0459c312bede9e4a32df52e445a http://git.fedorahosted.org/git/?p=cluster.git;a=commit;h=8742ae97a69c8cc282faf39d8c1e7bfda441e5b2 http://git.fedorahosted.org/git/?p=cluster.git;a=commit;h=fe46f6b6e9ed9a40c37fa60966fafc1cf07e36d2 http://git.fedorahosted.org/git/?p=cluster.git;a=commit;h=4ad333b7009ada0af3c1a2a5ad8f9815fb67b582 Reassigning to default component owner for build. ~~ Attention Customers and Partners - RHEL 5.5 Beta is now available on RHN ~~ RHEL 5.5 Beta has been released! There should be a fix present in this release that addresses your request. Please test and report back results here, by March 3rd 2010 (2010-03-03) or sooner. Upon successful verification of this request, post your results and update the Verified field in Bugzilla with the appropriate value. If you encounter any issues while testing, please describe them and set this bug into NEED_INFO. If you encounter new defects or have additional patch(es) to request for inclusion, please clone this bug per each request and escalate through your support representative. An advisory has been issued which should help the problem described in this bug report. This report is therefore being closed with a resolution of ERRATA. For more information on therefore solution and/or where to find the updated files, please follow the link below. You may reopen this bug report if the solution does not work for you. http://rhn.redhat.com/errata/RHBA-2010-0266.html The needinfo request[s] on this closed bug have been removed as they have been unresolved for 1000 days |