Red Hat Bugzilla – Bug 1243979
celery startup never returns if mongodb is down
Last modified: 2018-09-19 11:19:14 EDT
Description of problem: Version-Release number of selected component (if applicable): Satellite 6.1 Snap12 How reproducible: Steps to Reproduce: 1. Ensure mongod cannot start properly. For instance, shutdown mongod then create a bogus /var/lib/mongod/mongod.lock pid file. 2. $ katello-service restart Actual results: $ katello-service restart Stopping Qpid AMQP daemon: [ OK ] Starting Qpid AMQP daemon: [ OK ] Shutting down qdrouterd services: [ OK ] Starting qdrouterd services: [ OK ] celery multi v3.1.11 (Cipater) > Stopping nodes... > resource_manager@hostname: QUIT -> 9704 > Waiting for 1 node -> 9704..... > resource_manager@hostname: OK celery multi v3.1.11 (Cipater) > Starting nodes... > resource_manager@hostname: OK celery init v10.0. Using config script: /etc/default/pulp_resource_manager Stopping mongod: [ OK ] Starting mongod: [ OK ] Waiting for mongod to become available: [FAILED] Stopping elasticsearch: [ OK ] Starting elasticsearch: [ OK ] Stopping tomcat6: [ OK ] Starting tomcat6: [ OK ] Stopping foreman-proxy: [ OK ] Starting foreman-proxy: [ OK ] celery init v10.0. Using configuration: /etc/default/pulp_workers, /etc/default/pulp_celerybeat Restarting celery periodic task scheduler Stopping pulp_celerybeat... OK Starting pulp_celerybeat... celery multi v3.1.11 (Cipater) > Stopping nodes... > reserved_resource_worker-0@hostname: QUIT -> 9936 > reserved_resource_worker-1@hostname: QUIT -> 9962 > Waiting for 2 nodes -> 9936, 9962...... > reserved_resource_worker-0@hostname: OK > Waiting for 1 node -> 9962.... > reserved_resource_worker-1@hostname: OK celery multi v3.1.11 (Cipater) > Starting nodes... > reserved_resource_worker-0@hostname: No handlers could be found for logger "pulp.server.db.connection" *** gets stuck here indefinitely until user CTRL-c's *** Expected results: Even if a database connection cannot be established the startup should still return. Additional info:
Since this issue was entered in Red Hat Bugzilla, the release flag has been set to ? to ensure that it is properly evaluated for this release.
Was this on RHEL6? When you respond, please also add that info to the upstream bug report (linked above under "External Trackers").
The Pulp upstream bug status is at NEW. Updating the external tracker on this bug.
The Pulp upstream bug priority is at Normal. Updating the external tracker on this bug.
this was occurring on EL6.
The Pulp upstream bug status is at MODIFIED. Updating the external tracker on this bug.
Moving this to POST. Please verify once the 2.8 version of pulp is incorporated into teh downstream build.
The Pulp upstream bug status is at ON_QA. Updating the external tracker on this bug.
The Pulp upstream bug status is at CLOSED - CURRENTRELEASE. Updating the external tracker on this bug.
VERIFIED. @Sat6.2.0-Snap10.1 katello-service-3.0.0-3.el6sat.noarch # chown -R root:root /var/lib/mongodb # katello-service restart Stopping httpd: [ OK ] celery multi v3.1.11 (Cipater) > Stopping nodes... > reserved_resource_worker-3@<FQDN>: QUIT -> 29260 > reserved_resource_worker-1@<FQDN>: QUIT -> 29220 > reserved_resource_worker-2@<FQDN>: QUIT -> 29239 > reserved_resource_worker-0@<FQDN>: QUIT -> 29205 > Waiting for 4 nodes -> 29260, 29220, 29239, 29205........ > reserved_resource_worker-3@<FQDN>: OK > Waiting for 3 nodes -> 29220, 29239, 29205.... > reserved_resource_worker-1@<FQDN>: OK > Waiting for 2 nodes -> 29239, 29205.... > reserved_resource_worker-2@<FQDN>: OK > Waiting for 1 node -> 29205.... > reserved_resource_worker-0@<FQDN>: OK celery init v10.0. Using config script: /etc/default/pulp_workers Stopping tomcat6: waiting for processes 28938 to exit [ OK ] Stopping foreman-proxy: [ OK ] celery init v10.0. Using configuration: /etc/default/pulp_workers, /etc/default/pulp_celerybeat Stopping pulp_celerybeat... NOT RUNNING celery multi v3.1.11 (Cipater) > Stopping nodes... > resource_manager@<FQDN>: QUIT -> 29391 > Waiting for 1 node -> 29391..... > resource_manager@<FQDN>: OK celery init v10.0. Using config script: /etc/default/pulp_resource_manager [ OK ] Qpid AMQP daemon: [ OK ] Shutting down qdrouterd services: [ OK ] Stopping postgresql service: [ OK ] Success! Starting postgresql service: [ OK ] Starting mongod: [ OK ] Waiting for mongod to become available: [FAILED] Starting qdrouterd services: [ OK ] Starting Qpid AMQP daemon: [ OK ] celery multi v3.1.11 (Cipater) > Starting nodes... > resource_manager@<FQDN>: OK celery init v10.0. Using config script: /etc/default/pulp_resource_manager celery init v10.0. Using configuration: /etc/default/pulp_workers, /etc/default/pulp_celerybeat Starting pulp_celerybeat... Starting foreman-proxy: [ OK ] Starting tomcat6: [ OK ] celery multi v3.1.11 (Cipater) > Starting nodes... > reserved_resource_worker-0@<FQDN>: OK > reserved_resource_worker-1@<FQDN>: OK > reserved_resource_worker-2@<FQDN>: OK > reserved_resource_worker-3@<FQDN>: OK celery init v10.0. Using config script: /etc/default/pulp_workers Starting foreman-tasks: [ OK ] [Wed May 11 05:20:56 2016] [warn] module passenger_module is already loaded, skipping Starting httpd: [ OK ] Some services failed to start: mongod # echo $? 1 >>> if mongo fails to start then katello-service doesn't wait indefinitely for failing service start and informs user about it.
pulp-server-2.8.1.3-1.el6sat.noarch ( /etc/rc.d/init.d/pulp_celerybeat )
Since the problem described in this bug report should be resolved in a recent advisory, it has been closed with a resolution of ERRATA. For information on the advisory, and where to find the updated files, follow the link below. If the solution does not work for you, open a new bug report. https://access.redhat.com/errata/RHBA-2016:1501