Bug 498062
Summary: | monitoring fails to restart after execution of probes | ||||||
---|---|---|---|---|---|---|---|
Product: | Red Hat Satellite 5 | Reporter: | wes hayutin <whayutin> | ||||
Component: | Monitoring | Assignee: | Milan Zázrivec <mzazrivec> | ||||
Status: | CLOSED CURRENTRELEASE | QA Contact: | wes hayutin <whayutin> | ||||
Severity: | medium | Docs Contact: | |||||
Priority: | low | ||||||
Version: | 530 | CC: | bperkins, msuchy | ||||
Target Milestone: | --- | ||||||
Target Release: | --- | ||||||
Hardware: | All | ||||||
OS: | Linux | ||||||
URL: | na | ||||||
Whiteboard: | |||||||
Fixed In Version: | sat530-unconfirmed | Doc Type: | Bug Fix | ||||
Doc Text: | Story Points: | --- | |||||
Clone Of: | Environment: | ||||||
Last Closed: | 2009-10-28 19:49:33 UTC | Type: | --- | ||||
Regression: | --- | Mount Type: | --- | ||||
Documentation: | --- | CRM: | |||||
Verified Versions: | Category: | --- | |||||
oVirt Team: | --- | RHEL 7.3 requirements from Atomic Host: | |||||
Cloudforms Team: | --- | Target Upstream Version: | |||||
Embargoed: | |||||||
Bug Depends On: | |||||||
Bug Blocks: | 463877 | ||||||
Attachments: |
|
Description
wes hayutin
2009-04-28 17:33:02 UTC
Milan can you investigate it, please? actually this bug is worse than I thought.. going to change it a bit. recreate. 1. setup monitoring 2. execute probes , include some that utilize rhnmd 3. restart satellite 4. Monitoring and MonitoringScout service will *not* restart ' Failed 5 times to get data for this node. 2009-05-01 14:35:08 NPBootstrap: !! STDERR: 2009-05-01 14:35:08 NPBootstrap: !! EXIT: 256 2009-05-01 14:35:08 MonitoringScout: ----------- SputLite STATUS --------------- 2009-05-01 14:35:08 MonitoringScout: ----------- Dequeuer STATUS --------------- 2009-05-01 14:35:08 MonitoringScout: ----------- Dispatcher STATUS --------------- hrm.. even more interesting... A few minutes later I tried to restart the Monitoring and MonitoringScout service individually and it worked... maybe its a timing issue w/ another service? more debugging.. but I can take it off qa-blockers I think. yup.. seems to be some sort of timing issue w/ other services. after creating several probes restarting the entire satellite will break the restart of Monitoring and MonitoringScout.. However if you go back a few minutes later and manually restart Monitoring and Monitoring Scout it will work. Done. Starting rhn-satellite... Starting Jabber services [ OK ] Starting Oracle Net Listener ... [ OK ] Starting Oracle DB instance "rhnsat" ... [ OK ] Starting osa-dispatcher: [ OK ] Starting tomcat5: [ OK ] Starting httpd: [ OK ] Starting Monitoring ... Starting InstallSoftwareConfig ... [ OK ] Starting GenerateNotifConfig ... [ OK ] Starting NotifEscalator ... [ OK ] Starting NotifLauncher ... [ OK ] Starting Notifier ... [ OK ] Starting AckProcessor ... [ OK ] Starting TSDBLocalQueue ... [ OK ] [ OK ] Starting MonitoringScout ... [ FAIL ] Starting NPBootstrap ... ' Failed 5 times to get data for this node. 2009-05-01 14:46:33 NPBootstrap: !! STDERR: 2009-05-01 14:46:33 NPBootstrap: !! EXIT: 256 Starting SputLite ... [ OK ] Starting Dequeuer ... [ OK ] Starting Dispatcher ... [ OK ] [ OK ] Starting rhn-search... Starting cobbler daemon: [ OK ] Starting RHN Taskomatic... Done. [root@grandprix admin]# /etc/init.d/Monitoring restart Stopping Monitoring ... Stopping TSDBLocalQueue ... [ OK ] Stopping AckProcessor ... [ OK ] Stopping Notifier ... [ OK ] Stopping NotifLauncher ... [ OK ] Stopping NotifEscalator ... [ OK ] Stopping GenerateNotifConfig ... [ OK ] Stopping InstallSoftwareConfig ... [ OK ] [ OK ] Starting Monitoring ... Starting InstallSoftwareConfig ... [ OK ] Starting GenerateNotifConfig ... [ OK ] Starting NotifEscalator ... [ OK ] Starting NotifLauncher ... [ OK ] Starting Notifier ... [ OK ] Starting AckProcessor ... [ OK ] Starting TSDBLocalQueue ... [ OK ] [ OK ] [root@grandprix admin]# [root@grandprix admin]# /etc/init.d/MonitoringScout restart Stopping MonitoringScout ... Stopping Dispatcher ... [ OK ] Stopping Dequeuer ... [ OK ] Stopping SputLite ... [ OK ] Stopping NPBootstrap ... [ OK ] Stopping InstallSoftwareConfig ... [ OK ] [ OK ] Starting MonitoringScout ... Starting InstallSoftwareConfig ... [ OK ] Starting NPBootstrap ... [ OK ] Starting SputLite ... [ OK ] Starting Dequeuer ... [ OK ] Starting Dispatcher ... [ OK ] [ OK ] [root@grandprix admin]# Satellite-5.3.0-RHEL5-re20090507.1 on s390x, I was not able to reproduce the problem following the steps in comment #0 and comment #3. Everything always restarts smoothly, no error like the above. Do you still see the problem on the latest ISO? no.. I dont think this a problem anymore on 5/7.1 moving to on_qa to really test it out verified |