Bug 203916
Summary: | groupd daemon segfault and mount hang | ||||||||
---|---|---|---|---|---|---|---|---|---|
Product: | Red Hat Enterprise Linux 5 | Reporter: | Robert Peterson <rpeterso> | ||||||
Component: | cman | Assignee: | Robert Peterson <rpeterso> | ||||||
Status: | CLOSED CURRENTRELEASE | QA Contact: | Cluster QE <mspqa-list> | ||||||
Severity: | medium | Docs Contact: | |||||||
Priority: | medium | ||||||||
Version: | 5.0 | CC: | cluster-maint, teigland | ||||||
Target Milestone: | --- | Keywords: | Reopened | ||||||
Target Release: | --- | ||||||||
Hardware: | All | ||||||||
OS: | Linux | ||||||||
Whiteboard: | |||||||||
Fixed In Version: | 5.0.0 | Doc Type: | Bug Fix | ||||||
Doc Text: | Story Points: | --- | |||||||
Clone Of: | Environment: | ||||||||
Last Closed: | 2006-10-11 14:04:24 UTC | Type: | --- | ||||||
Regression: | --- | Mount Type: | --- | ||||||
Documentation: | --- | CRM: | |||||||
Verified Versions: | Category: | --- | |||||||
oVirt Team: | --- | RHEL 7.3 requirements from Atomic Host: | |||||||
Cloudforms Team: | --- | Target Upstream Version: | |||||||
Embargoed: | |||||||||
Attachments: |
|
Description
Robert Peterson
2006-08-24 14:03:15 UTC
Created attachment 134812 [details]
Proposed patch to fix the problem
Created attachment 134920 [details]
Better patch for mount hangs
The previous patch was still hanging because the pollfd array
that was allocated did not initialize its 'revents'. That caused
the system to try to execute revents that didn't exist, and somehow
that caused the hang. One side-effect was socket write errors in
the daemon.
Also, in the process of debugging this, I learned that the
gfs_controld daemon was also not dynamically growing its pollfd either.
That did have ramifications, but I don't know the full extent of that.
I do know that group_tool -v would not show you the proper list of
groups when gfs_controld did not dynamically grow its list. This
occurred when a node tried to allocate 5 or more gfs mount points.
When gfs_controld is allowed to dynamically grow its pollfd array,
the proper group list is displayed by group_tool.
This improved patch fixes both problems and I've tested it by
mounting ten GFS file systems without a problem.
I highly suspect these problems might have been causing some or
most of the problems encountered by the QE team with the tank
and such.
Our original mount hangs seem to be fixed. We'll file new bugs when we run into new hangs. Passing through verified to get metric correct. Moving all RHCS ver 5 bugs to RHEL 5 so we can remove RHCS v5 which never existed. |