Bug 1968415
Summary: | CoreDNSErrorsHigh alert generated without any issue in the cluster. | ||
---|---|---|---|
Product: | OpenShift Container Platform | Reporter: | Vedanti Jaypurkar <vjaypurk> |
Component: | Networking | Assignee: | aos-network-edge-staff <aos-network-edge-staff> |
Networking sub component: | DNS | QA Contact: | Melvin Joseph <mjoseph> |
Status: | CLOSED INSUFFICIENT_DATA | Docs Contact: | |
Severity: | medium | ||
Priority: | medium | CC: | alebedev, anpicker, cchen, cholman, fsoppels, hongli, imm, mmasters, ppitonak, skanniha, skharat, snalawad, sreber |
Version: | 4.7 | ||
Target Milestone: | --- | ||
Target Release: | --- | ||
Hardware: | Unspecified | ||
OS: | Unspecified | ||
Whiteboard: | |||
Fixed In Version: | Doc Type: | If docs needed, set a value | |
Doc Text: | Story Points: | --- | |
Clone Of: | Environment: | ||
Last Closed: | 2022-10-27 17:32:12 UTC | Type: | Bug |
Regression: | --- | Mount Type: | --- |
Documentation: | --- | CRM: | |
Verified Versions: | Category: | --- | |
oVirt Team: | --- | RHEL 7.3 requirements from Atomic Host: | |
Cloudforms Team: | --- | Target Upstream Version: | |
Embargoed: |
Comment 26
Miciah Dashiel Butler Masters
2022-01-06 18:48:20 UTC
These two upstream changes may be relevant in that they improve the logging and metrics for certain errors: > do not log NOERROR in log plugin when response is not available > the log plugin logs NOERROR rcode in case of no response is written, this PR instead changes this to log placeholder ( - ), which at least does not mislead the reader of logs https://github.com/coredns/coredns/pull/4725 > when no response is written, fallback to status of next plugin in prometheus plugin > when no response is written from up the chain of plugins, the default value of dnstest.Recorder for rcode (0) is used as rcode reported to the coredns_dns_responses_total metric, which is misleading and wrong. This PR changes the behaviour that when no response is written, the return status of the next plugin is used. https://github.com/coredns/coredns/pull/4727 We will ship a version of CoreDNS with these changes in OpenShift 4.10.0. There is an additional upstream change in CoreDNS that may be of interest to people following this BZ: "plugin/prometheus: write rcode properly to the metrics" <https://github.com/coredns/coredns/pull/5126>. The related issue, <https://github.com/coredns/coredns/issues/5125>, is as follows: > Hello, after bump to the latest 1.8.7 CoreDNS we noticed that CoreDNS prometheus metric related to DNS responses (`coredns_dns_responses_total`) shows wrong rcode (label `rcode`). Even when resolution ends in NXDOMAIN, the metric shows it as NOERROR. OpenShift 4.11.0 will include CoreDNS 1.9.2, which includes <https://github.com/coredns/coredns/pull/5126>. Aside from that, I notice that the cases linked to this BZ are all closed now. Please let me know if this BZ still requires attention. Closing per comment 30. Please re-open the BZ or file a new bug if further attention is needed on the issue. |