Note: This bug is displayed in read-only format because the product is no longer active in Red Hat Bugzilla.

Red Hat Satellite engineering is moving the tracking of its product development work on Satellite to Red Hat Jira (issues.redhat.com). If you're a Red Hat customer, please continue to file support cases via the Red Hat customer portal. If you're not, please head to the "Satellite project" in Red Hat Jira and file new tickets here. Individual Bugzilla bugs will be migrated starting at the end of May. If you cannot log in to RH Jira, please consult article #7032570. That failing, please send an e-mail to the RH Jira admins at rh-issues@redhat.com to troubleshoot your issue as a user management inquiry. The email creates a ServiceNow ticket with Red Hat. Individual Bugzilla bugs that are migrated will be moved to status "CLOSED", resolution "MIGRATED", and set with "MigratedToJIRA" in "Keywords". The link to the successor Jira issue will be found under "Links", have a little "two-footprint" icon next to it, and direct you to the "Satellite project" in Red Hat Jira (issue links are of type "https://issues.redhat.com/browse/SAT-XXXX", where "X" is a digit). This same link will be available in a blue banner at the top of the page informing you that that bug has been migrated.

Bug 1353458

Summary:	qdrouterd stops responding on any connection while waiting for hung DNS PTR query
Product:	Red Hat Satellite	Reporter:	Pavel Moravec <pmoravec>
Component:	katello-agent	Assignee:	Ted Ross <tross>
Status:	CLOSED ERRATA	QA Contact:	jcallaha
Severity:	medium	Docs Contact:
Priority:	medium
Version:	6.1.9	CC:	bbuckingham, bkearney, cwelton, egolov, ehelms, johan.bergstrom, mcressma, mlesko, mmithaiw, rnuccite, sjagtap, tross
Target Milestone:	Unspecified	Keywords:	Triaged
Target Release:	Unused
Hardware:	x86_64
OS:	Linux
Whiteboard:
Fixed In Version:	qpid-dispatch-0.4-17	Doc Type:	If docs needed, set a value
Doc Text:		Story Points:	---
Clone Of:		Environment:
Last Closed:	2016-11-10 08:13:23 UTC	Type:	Bug
Regression:	---	Mount Type:	---
Documentation:	---	CRM:
Verified Versions:		Category:	---
oVirt Team:	---	RHEL 7.3 requirements from Atomic Host:
Cloudforms Team:	---	Target Upstream Version:
Embargoed:

Description Pavel Moravec 2016-07-07 07:37:33 UTC

Description of problem:
Assume scenario:
- a goferd client connects to qdrouterd
- qdrouterd does reverse DNS lookup (PTR query) for client's IP address against a DNS server
- assume the DNS server is broken, such that it does not reply to the query or replies after say one minute
- qdrouterd while waiting to the response stops send any AMQP data to any other connection

Consequences:
- that causes inter-qdrouterd connection timeouts due to unresponded heartbeats
- any communication between Satellite and katello-agent is postponed/delayed, causing optionally task timeouts

Please backport https://issues.apache.org/jira/browse/DISPATCH-443 once the fix is available.

Version-Release number of selected component (if applicable):
qpid-dispatch-router-0.4-11.el7.x86_64

How reproducible:
100%

Steps to Reproduce:
0. Setup Satellite with Capsule, either with external DNS server
1. Break your DNS server such that it does not respond (on time) to some DNS PTR queries (i.e. remove some IP range from its managed rages)
2. Kick off goferd on a client such that DNS PTR query against its IP address is responded after a long time or never.
3. Observe no communication can flow through the qdrouterd where the goferd is connecting to - including inter-qdrouterd communication or new task (package install) request.

Actual results:
- Package installs to other clients will timeout (assuming the DNS query is still being "processed").
- inter-qdrouterd connection flapping (see https://access.redhat.com/solutions/2429011 for particular logs)

Expected results:
- other clients can communicate with the qdrouterd, they can accept and acknowledge tasks (to istall a package) etc.
- inter-qdrouterd connection is stable

Additional info:

Comment 2 Ted Ross 2016-07-07 20:01:04 UTC

A fix for this issue has been committed to the master branch upstream.

https://git-wip-us.apache.org/repos/asf?p=qpid-dispatch.git;a=patch;h=cf3c874

This is a low-risk update and is ready for back-port to the product builds if approved.

Comment 8 Pavel Moravec 2016-11-09 09:27:48 UTC

Hi Matej,
having Interconnect / qdrouterd knowledge, would you be able to reproduce (or even verify) this?

Comment 10 Bryan Kearney 2016-11-09 21:04:59 UTC

I am moving this to VERIFIED. We have not been able to reproduce the issue, and we have already deployed this code at certain customers with no negative imapct. Therefore, we are markign this as verified to deliver with 6.2.4. If you are still seeing this issue after 6.2.4 please feel free to re-open and provide additional information on how to reproduce.

Comment 12 errata-xmlrpc 2016-11-10 08:13:23 UTC

Since the problem described in this bug report should be
resolved in a recent advisory, it has been closed with a
resolution of ERRATA.

For information on the advisory, and where to find the updated
files, follow the link below.

If the solution does not work for you, open a new bug report.

https://access.redhat.com/errata/RHBA-2016:2699

Comment 13 Red Hat Bugzilla 2023-09-14 03:27:42 UTC

The needinfo request[s] on this closed bug have been removed as they have been unresolved for 1000 days