Bug 429920 - RHEL5 cmirror tracker: clogd gets stuck in saCkptInitialize
RHEL5 cmirror tracker: clogd gets stuck in saCkptInitialize
Product: Red Hat Enterprise Linux 5
Classification: Red Hat
Component: openais (Show other bugs)
All Linux
high Severity high
: rc
: ---
Assigned To: Steven Dake
Cluster QE
: TestBlocker
Depends On:
Blocks: 430797
  Show dependency treegraph
Reported: 2008-01-23 14:52 EST by Corey Marthaler
Modified: 2016-04-26 11:39 EDT (History)
7 users (show)

See Also:
Fixed In Version: RHBA-2008-0411
Doc Type: Bug Fix
Doc Text:
Story Points: ---
Clone Of:
Last Closed: 2008-05-21 10:31:19 EDT
Type: ---
Regression: ---
Mount Type: ---
Documentation: ---
Verified Versions:
Category: ---
oVirt Team: ---
RHEL 7.3 requirements from Atomic Host:
Cloudforms Team: ---

Attachments (Terms of Use)

  None (edit)
Description Corey Marthaler 2008-01-23 14:52:55 EST
Description of problem:
Running the init script or starting clogd by hand will never return. It's stuck
wait4(3960, 0x7fff91b1a38c, WNOHANG, NULL) = 0

Version-Release number of selected component (if applicable):

How reproducible:
Comment 1 Corey Marthaler 2008-01-23 18:46:59 EST
This is on the latest build as well. 

[root@hayes-02 ~]# service cmirror start
Loading clustered mirror log module:                       [  OK  ]
Starting clustered mirror log server:                      [  OK  ]
[root@hayes-02 ~]# service cmirror stop
Stopping clustered mirror log server:                      [  OK  ]
Unloading clustered mirror log module:                     [  OK  ]
[root@hayes-02 ~]# service cmirror start
Loading clustered mirror log module:                       [  OK  ]
Starting clustered mirror log server:      

Comment 2 Jonathan Earl Brassow 2008-01-24 12:33:40 EST
I've seen this too.  I think this is because I'm not properly exiting the AIS
ckpt service.  It should be in the latest build now though.

Either way, if the log server dies or gets killed by -9, we should be able to
restart... and AIS should cleanup.

What you've done above is the same as doing:
1) clogd
2) killall clogd
3) clogd

If this now works, then try:
1) clogd
2) killall -9 clogd
3) clogd
Comment 5 Corey Marthaler 2008-02-04 09:56:18 EST
This issue is marked ON_QA but isn't fixed in the latest build. Should the fix
be in the following?

Comment 6 Jonathan Earl Brassow 2008-02-04 12:14:59 EST
my guess is that this is an OpenAIS issue and won't be fixed by any of the above

I have a cluster you can use to test this.
Comment 7 Steven Dake 2008-02-04 12:24:11 EST
sigh I believed whoever changed it to modified had fixed the problem. 
Apparently not.

I'll talk to you Jon and reassign this bug from you to me.
Comment 8 Corey Marthaler 2008-02-26 15:10:18 EST
Fix verified in openais-0.80.3-12.el5.
Comment 10 errata-xmlrpc 2008-05-21 10:31:19 EDT
An advisory has been issued which should help the problem
described in this bug report. This report is therefore being
closed with a resolution of ERRATA. For more information
on the solution and/or where to find the updated files,
please follow the link below. You may reopen this bug report
if the solution does not work for you.


Note You need to log in before you can comment on or make changes to this bug.