Bug 2039294
| Summary: | SDN controller metrics cannot be consumed correctly by prometheus | ||
|---|---|---|---|
| Product: | OpenShift Container Platform | Reporter: | Martin Kennelly <mkennell> |
| Component: | Networking | Assignee: | Martin Kennelly <mkennell> |
| Networking sub component: | openshift-sdn | QA Contact: | Weibin Liang <weliang> |
| Status: | CLOSED ERRATA | Docs Contact: | |
| Severity: | high | ||
| Priority: | high | CC: | weliang |
| Version: | 4.10 | ||
| Target Milestone: | --- | ||
| Target Release: | 4.10.0 | ||
| Hardware: | All | ||
| OS: | Unspecified | ||
| Whiteboard: | |||
| Fixed In Version: | Doc Type: | No Doc Update | |
| Doc Text: | Story Points: | --- | |
| Clone Of: | Environment: | ||
| Last Closed: | 2022-03-12 04:40:34 UTC | Type: | Bug |
| Regression: | --- | Mount Type: | --- |
| Documentation: | --- | CRM: | |
| Verified Versions: | Category: | --- | |
| oVirt Team: | --- | RHEL 7.3 requirements from Atomic Host: | |
| Cloudforms Team: | --- | Target Upstream Version: | |
| Embargoed: | |||
|
Description
Martin Kennelly
2022-01-11 13:01:22 UTC
Tested and verified in 4.10.0-0.nightly-2022-01-21-074618
[weliang@weliang verification-tests]$ oc get pod -o wide -n openshift-sdn
NAME READY STATUS RESTARTS AGE IP NODE NOMINATED NODE READINESS GATES
sdn-26qp5 2/2 Running 0 38m 10.0.161.127 ip-10-0-161-127.us-east-2.compute.internal <none> <none>
sdn-controller-74tzj 1/1 Running 0 45m 10.0.135.231 ip-10-0-135-231.us-east-2.compute.internal <none> <none>
sdn-controller-m5r5s 1/1 Running 0 45m 10.0.179.115 ip-10-0-179-115.us-east-2.compute.internal <none> <none>
sdn-controller-xb4lj 1/1 Running 0 45m 10.0.204.42 ip-10-0-204-42.us-east-2.compute.internal <none> <none>
sdn-dr5kn 2/2 Running 0 45m 10.0.204.42 ip-10-0-204-42.us-east-2.compute.internal <none> <none>
sdn-hrpth 2/2 Running 0 45m 10.0.179.115 ip-10-0-179-115.us-east-2.compute.internal <none> <none>
sdn-n7b9g 2/2 Running 0 37m 10.0.215.226 ip-10-0-215-226.us-east-2.compute.internal <none> <none>
sdn-xpnsr 2/2 Running 0 38m 10.0.128.67 ip-10-0-128-67.us-east-2.compute.internal <none> <none>
sdn-z7pdw 2/2 Running 0 45m 10.0.135.231 ip-10-0-135-231.us-east-2.compute.internal <none> <none>
[weliang@weliang verification-tests]$ oc -n openshift-sdn get cm openshift-network-controller -o yaml | grep holderIdentity
control-plane.alpha.kubernetes.io/leader: '{"holderIdentity":"ip-10-0-179-115","leaseDurationSeconds":137,"acquireTime":"2022-01-21T15:23:11Z","renewTime":"2022-01-21T16:08:50Z","leaderTransitions":0}'
[weliang@weliang verification-tests]$ oc exec -n openshift-sdn sdn-hrpth -- curl localhost:29100/metrics | grep -i egress_fire
Defaulted container "sdn" out of: sdn, kube-rbac-proxy
% Total % Received % Xferd Average Speed Time Time Time Current
Dload Upload Total Spent Left Speed
100 1017 100 1017 0 0 993k 0 --:--:-- --:--:-- --:--:-- 993k
# HELP sdn_controller_num_egress_firewall_rules The number of egress firewall rules defined
# TYPE sdn_controller_num_egress_firewall_rules gauge
sdn_controller_num_egress_firewall_rules 2
# HELP sdn_controller_num_egress_firewalls The number of egress firewall policies
# TYPE sdn_controller_num_egress_firewalls gauge
sdn_controller_num_egress_firewalls 1
[weliang@weliang verification-tests]$ oc exec -n openshift-sdn sdn-controller-74tzj -- curl localhost:29100/metrics | grep -i egress_fire
% Total % Received % Xferd Average Speed Time Time Time Current
Dload Upload Total Spent Left Speed
100 495 100 495 0 0 483k 0 --:--:-- --:--:-- --:--:-- 483k
[weliang@weliang verification-tests]$ oc exec -n openshift-sdn sdn-controller-m5r5s -- curl localhost:29100/metrics | grep -i egress_fire
% Total % Received % Xferd Average Speed Time Time Time Current
Dload Upload Total Spent Left Speed
100 1017 100 1017 0 0 993k 0 --:--:-- --:--:-- --:--:-- 993k
# HELP sdn_controller_num_egress_firewall_rules The number of egress firewall rules defined
# TYPE sdn_controller_num_egress_firewall_rules gauge
sdn_controller_num_egress_firewall_rules 2
# HELP sdn_controller_num_egress_firewalls The number of egress firewall policies
# TYPE sdn_controller_num_egress_firewalls gauge
sdn_controller_num_egress_firewalls 1
[weliang@weliang verification-tests]$ oc exec -n openshift-sdn sdn-controller-xb4lj -- curl localhost:29100/metrics | grep -i egress_fire
% Total % Received % Xferd Average Speed Time Time Time Current
Dload Upload Total Spent Left Speed
100 495 100 495 0 0 483k 0 --:--:-- --:--:-- --:--:-- 483k
[weliang@weliang verification-tests]$
Since the problem described in this bug report should be resolved in a recent advisory, it has been closed with a resolution of ERRATA. For information on the advisory (Moderate: OpenShift Container Platform 4.10.3 security update), and where to find the updated files, follow the link below. If the solution does not work for you, open a new bug report. https://access.redhat.com/errata/RHSA-2022:0056 |