Bug 438434
| Summary: | clvmd fails to start on 58th node | ||||||
|---|---|---|---|---|---|---|---|
| Product: | Red Hat Enterprise Linux 5 | Reporter: | Nate Straz <nstraz> | ||||
| Component: | lvm2-cluster | Assignee: | Christine Caulfield <ccaulfie> | ||||
| Status: | CLOSED ERRATA | QA Contact: | GFS Bugs <gfs-bugs> | ||||
| Severity: | high | Docs Contact: | |||||
| Priority: | medium | ||||||
| Version: | 5.2 | CC: | edamato, mbroz | ||||
| Target Milestone: | rc | Keywords: | TestBlocker | ||||
| Target Release: | --- | ||||||
| Hardware: | All | ||||||
| OS: | Linux | ||||||
| Whiteboard: | |||||||
| Fixed In Version: | RHBA-2008-0379 | Doc Type: | Bug Fix | ||||
| Doc Text: | Story Points: | --- | |||||
| Clone Of: | Environment: | ||||||
| Last Closed: | 2008-05-21 14:26:41 UTC | Type: | --- | ||||
| Regression: | --- | Mount Type: | --- | ||||
| Documentation: | --- | CRM: | |||||
| Verified Versions: | Category: | --- | |||||
| oVirt Team: | --- | RHEL 7.3 requirements from Atomic Host: | |||||
| Cloudforms Team: | --- | Target Upstream Version: | |||||
| Embargoed: | |||||||
| Attachments: |
|
||||||
|
Description
Nate Straz
2008-03-20 20:54:44 UTC
Created attachment 298748 [details]
clvmd core file
Upon further inspection, it appears that all of the clvmd processes on all nodes have exitted and left behind their dlm lockspaces. Restarting clvmd on each node produces a core file. Adding TestBlocker flag since I really need clvmd in order to mount something since the four disks come up ordered differently on some nodes. This is one of those "AAaargh!" bugs, the basic fix is : - int *new_updown = realloc(node_updown, new_size); + int *new_updown = realloc(node_updown, sizeof(int) * new_size); The checkin below also contains a fix to setting the initial size of the array. Checking in daemons/clvmd/clvmd-cman.c; /cvs/lvm2/LVM2/daemons/clvmd/clvmd-cman.c,v <-- clvmd-cman.c new revision: 1.21; previous revision: 1.20 done I am still hitting realloc problems and clvmd not staying up on all nodes with high node counts. It turns out the initial allocation can be wrong too. This patch fixes: Checking in daemons/clvmd/clvmd-cman.c; /cvs/lvm2/LVM2/daemons/clvmd/clvmd-cman.c,v <-- clvmd-cman.c new revision: 1.22; previous revision: 1.21 done An advisory has been issued which should help the problem described in this bug report. This report is therefore being closed with a resolution of ERRATA. For more information on the solution and/or where to find the updated files, please follow the link below. You may reopen this bug report if the solution does not work for you. http://rhn.redhat.com/errata/RHBA-2008-0379.html |