Bug 1608217
Summary: | goferd prints reconnect error whenever qdrouterd is restarted | ||
---|---|---|---|
Product: | Red Hat Satellite | Reporter: | Pavel Moravec <pmoravec> |
Component: | katello-agent | Assignee: | satellite6-bugs <satellite6-bugs> |
Status: | CLOSED CURRENTRELEASE | QA Contact: | Jan HutaĆ <jhutar> |
Severity: | medium | Docs Contact: | |
Priority: | medium | ||
Version: | 6.3.2 | CC: | chris.snell, egolov, hmore, janarula, jcrumple, jhutar, jortel, mmccune, pcreech, pmoravec, ramsingh |
Target Milestone: | Released | Keywords: | Triaged |
Target Release: | Unused | ||
Hardware: | All | ||
OS: | Linux | ||
Whiteboard: | |||
Fixed In Version: | Satellite 6.4.1 | Doc Type: | If docs needed, set a value |
Doc Text: | Story Points: | --- | |
Clone Of: | Environment: | ||
Last Closed: | 2018-12-11 16:20:34 UTC | Type: | Bug |
Regression: | --- | Mount Type: | --- |
Documentation: | --- | CRM: | |
Verified Versions: | Category: | --- | |
oVirt Team: | --- | RHEL 7.3 requirements from Atomic Host: | |
Cloudforms Team: | --- | Target Upstream Version: | |
Embargoed: |
Description
Pavel Moravec
2018-07-25 06:45:49 UTC
goferd already waits the 10 seconds and the error: Jul 25 09:45:04 pmoravec-caps63 goferd: [ERROR][pulp.agent.d79cc595-3b70-4e8b-8b72-d8482f4b66e9] gofer.messaging.adapter.proton.reliability:53 - Connection amqps://pmoravec-sat63.gsslab.brq2.redhat.com:5647 disconnected: Condition('amqp:connection:framing-error', 'SSL Failure: Unknown error') is printed by /usr/lib/python2.7/site-packages/gofer/messaging/adapter/proton/reliability.py : def reliable(fn): def _fn(messenger, *args, **kwargs): repair = lambda: None while not Thread.aborted(): try: repair() return fn(messenger, *args, **kwargs) except LinkDetached, le: if le.condition != NOT_FOUND: log.error(utf8(le)) repair = messenger.repair sleep(DELAY) else: raise NotFound(*le.args) except ConnectionException, pe: log.error(utf8(pe)) ###### this line repair = messenger.repair sleep(DELAY) return _fn Both "log.error(utf8(le))" are called here just when connection has been successfully established but some link or connection error (other than missing pulp.agent.* queue, that is important) is hit. Since goferd will try to reconnect in 10s, these events shall be warning instead of error. Only missing queue event is worth to be of error verbosity as that means something fishy happening on Satellite qpidd. But that is logged either way via explicit "raise NotFound(*le.args)" and logged as error in gofer.messaging.consumer:74 . So I suggest both "log.error(utf8(le))" in proton/reliability.py to be changed to warning verbosity. Jeff, do you agree? Agreed. Requested change completed/merged in the upstream project. It will be tagged and released in Fedora (updates) and Copr shortly. https://github.com/jortel/gofer/pull/90 https://github.com/jortel/gofer/pull/92 Moving to POST as the 2 PRs referenced in comment 3 have been merged upstream. We shipped this in 6.4.1 with the update here: https://bugzilla.redhat.com/show_bug.cgi?id=1646736 I tested the updated builds and see the switch to WARNING vs ERROR. Closing as CURRENTRELEASE. |