Bug 980076
Summary: | After a crash, rhqctl won't notice that storage node is down | ||||||||||||
---|---|---|---|---|---|---|---|---|---|---|---|---|---|
Product: | [Other] RHQ Project | Reporter: | Michael Burman <yak> | ||||||||||
Component: | No Component | Assignee: | Heiko W. Rupp <hrupp> | ||||||||||
Status: | CLOSED CURRENTRELEASE | QA Contact: | Mike Foley <mfoley> | ||||||||||
Severity: | unspecified | Docs Contact: | |||||||||||
Priority: | unspecified | ||||||||||||
Version: | 4.9 | CC: | ahovsepy, hrupp, jshaughn, vkhurana | ||||||||||
Target Milestone: | --- | ||||||||||||
Target Release: | RHQ 4.9 | ||||||||||||
Hardware: | Unspecified | ||||||||||||
OS: | Unspecified | ||||||||||||
Whiteboard: | |||||||||||||
Fixed In Version: | Doc Type: | Bug Fix | |||||||||||
Doc Text: | Story Points: | --- | |||||||||||
Clone Of: | Environment: | ||||||||||||
Last Closed: | 2013-09-24 19:08:23 UTC | Type: | Bug | ||||||||||
Regression: | --- | Mount Type: | --- | ||||||||||
Documentation: | --- | CRM: | |||||||||||
Verified Versions: | Category: | --- | |||||||||||
oVirt Team: | --- | RHEL 7.3 requirements from Atomic Host: | |||||||||||
Cloudforms Team: | --- | Target Upstream Version: | |||||||||||
Embargoed: | |||||||||||||
Bug Depends On: | |||||||||||||
Bug Blocks: | 951619 | ||||||||||||
Attachments: |
|
Description
Michael Burman
2013-07-01 11:18:34 UTC
Created attachment 767411 [details]
Checks if that pid is really active or not, and deletes the pid file if it isn't
master b3aa6d8a54 Thanks Michael! when i do kill -9 <process id of cassandra> and then i run ./rhqctl status it still says that RHQ Storage node is running. Not sure if this should be filed as a seperate bug or should this issue be addressed by this bug. Hi, Fixing that one wasn't in my mind when I created the patch, but here's a fix attached (which I can't right now test unfortunately, but I'll do that later). Created attachment 781565 [details]
Fix the status command with ghost pid files
based on comment #3, this does not look verified. After applying the newest (attachment 781565 [details]) patch:
michael@grace-mint ~/projects/rhq/dev-container/rhq-server/bin $ ./rhqctl status
00:28:53,533 INFO [org.jboss.modules] JBoss Modules version 1.2.0.CR1
RHQ Storage Node (pid 1462 ) IS running
RHQ Server (pid 1651 ) IS running
JBossAS Java VM child process (pid 1651 ) IS running
RHQ Agent (pid 1861 ) IS running
michael@grace-mint ~/projects/rhq/dev-container/rhq-server/bin $ kill -9 1462
michael@grace-mint ~/projects/rhq/dev-container/rhq-server/bin $ ./rhqctl status
00:29:06,301 INFO [org.jboss.modules] JBoss Modules version 1.2.0.CR1
RHQ Storage Node (no pid file) IS NOT running
RHQ Server (pid 1651 ) IS running
JBossAS Java VM child process (pid 1651 ) IS running
RHQ Agent (pid 1861 ) IS running
michael@grace-mint ~/projects/rhq/dev-container/rhq-server/bin $ echo $?
0
michael@grace-mint ~/projects/rhq/dev-container/rhq-server/bin $
So it works fine now with status command also.
2nd patch has been pushed to master as cdf64dfe0d3b3 Thanks again, Michael! verified. please get screenshot and log attached. Created attachment 793155 [details]
status.png
Created attachment 793161 [details]
status.log
Bulk closing of RHQ 4.9 verified items |