1740375 – DNS operator reports unavailable reason as DNSUnavailable

Bug 1740375 - DNS operator reports unavailable reason as DNSUnavailable

Summary: DNS operator reports unavailable reason as DNSUnavailable

Keywords:
Status:	CLOSED ERRATA
Alias:	None
Product:	OpenShift Container Platform
Classification:	Red Hat
Component:	Networking
Sub Component:
Version:	4.2.0
Hardware:	Unspecified
OS:	Unspecified
Priority:	low
Severity:	low
Target Milestone:	---
Target Release:	4.3.0
Assignee:	Dan Mace
QA Contact:	Hongan Li
Docs Contact:
URL:
Whiteboard:
Depends On:
Blocks:
TreeView+	depends on / blocked

Reported:	2019-08-12 18:57 UTC by Ben Parees
Modified:	2022-08-04 22:39 UTC (History)
CC List:	2 users (show)
Fixed In Version:
Doc Type:	If docs needed, set a value
Doc Text:
Clone Of:
Environment:
Last Closed:	2020-01-23 11:05:05 UTC
Target Upstream Version:
Embargoed:

Attachments	(Terms of Use)

Links
System	ID	Private	Priority	Status	Summary	Last Updated
Github	openshift cluster-dns-operator pull 134	0	'None'	closed	Bug 1761506: status: prevent degraded status flapping on rollout	2020-05-21 14:44:27 UTC
Red Hat Product Errata	RHBA-2020:0062	0	None	None	None	2020-01-23 11:05:32 UTC

Description Ben Parees 2019-08-12 18:57:06 UTC

The reason should shed light on why DNS is unavailable.

Comment 2 Dan Mace 2019-08-14 14:02:00 UTC

The message should also shed light, and in this case the message is "No DNS DaemonSets available". The dashboard isn't showing the most useful detail for this and many other cases.

What reason string would you consider to be a fix?

Comment 3 Ben Parees 2019-08-14 14:07:23 UTC

Messages don't go back to telemeter unfortunately.  

NoDaemonSetsAvailable would be an improvement.  Or NoDNSPodsAvailable?  Those things would at least make it a bit clearer where the problem is.

If there's something even more specific (why are no daemonsets available?  were they created and didn't get scheduled?  did they not get created?  are they invalid for some reason?  Was the operator unable to create the daemonset?  If pods were created and not scheduled, can we tell why?) that can be reported, even better.

Comment 4 Dan Mace 2019-08-14 14:29:21 UTC

I like NoDaemonSetsAvailable — it's probably the most accurate thing we can say right now. I like the idea of doing a deeper "why" analysis to surface. There are lots of opportunities for improvement here and in the ingress operator (which has received the most of these improvements lately).

For now, I'd like to resolve this bug with the improved reason, as I don't anticipate having time to do anything more sophisticated for the release.

Comment 5 Ben Parees 2019-08-14 14:30:48 UTC

> For now, I'd like to resolve this bug with the improved reason

i think that's a fine start.  It gives us a good starting point for questions to start asking about the cluster.

Comment 6 Dan Mace 2019-11-01 19:59:40 UTC

I think https://github.com/openshift/cluster-dns-operator/pull/134 was a decent fix for this issue. In addition to fixing the status flapping, we improved the status messaging in the ways discussed here.

Comment 7 Hongan Li 2019-11-05 09:49:27 UTC

verified as https://bugzilla.redhat.com/show_bug.cgi?id=1761506#c2

Comment 9 errata-xmlrpc 2020-01-23 11:05:05 UTC

Since the problem described in this bug report should be
resolved in a recent advisory, it has been closed with a
resolution of ERRATA.

For information on the advisory, and where to find the updated
files, follow the link below.

If the solution does not work for you, open a new bug report.

https://access.redhat.com/errata/RHBA-2020:0062

Note You need to log in before you can comment on or make changes to this bug.