Bug 2013646
Summary: | fsync controller will show false positive if gaps in metrics are observed. | ||
---|---|---|---|
Product: | OpenShift Container Platform | Reporter: | Lili Cosic <lcosic> |
Component: | Etcd | Assignee: | Nobody <nobody> |
Status: | CLOSED ERRATA | QA Contact: | Sandeep <skundu> |
Severity: | medium | Docs Contact: | |
Priority: | unspecified | ||
Version: | 4.7 | CC: | geliu, skundu |
Target Milestone: | --- | ||
Target Release: | 4.10.0 | ||
Hardware: | Unspecified | ||
OS: | Unspecified | ||
Whiteboard: | |||
Fixed In Version: | Doc Type: | If docs needed, set a value | |
Doc Text: | Story Points: | --- | |
Clone Of: | 2008175 | Environment: | |
Last Closed: | 2022-03-10 16:19:33 UTC | Type: | --- |
Regression: | --- | Mount Type: | --- |
Documentation: | --- | CRM: | |
Verified Versions: | Category: | --- | |
oVirt Team: | --- | RHEL 7.3 requirements from Atomic Host: | |
Cloudforms Team: | --- | Target Upstream Version: | |
Embargoed: | |||
Bug Depends On: | |||
Bug Blocks: | 2008175 |
Description
Lili Cosic
2021-10-13 13:02:17 UTC
oc get clusterversion NAME VERSION AVAILABLE PROGRESSING SINCE STATUS version 4.10.0-0.nightly-2022-02-16-171622 True False 5h23m Cluster version is 4.10.0-0.nightly-2022-02-16-171622 sh-4.4# etcdctl endpoint status -w table +-----------------------+------------------+---------+---------+-----------+------------+-----------+------------+--------------------+--------+ | ENDPOINT | ID | VERSION | DB SIZE | IS LEADER | IS LEARNER | RAFT TERM | RAFT INDEX | RAFT APPLIED INDEX | ERRORS | +-----------------------+------------------+---------+---------+-----------+------------+-----------+------------+--------------------+--------+ | https://10.0.0.3:2379 | 6951e8940830cac5 | 3.5.0 | 78 MB | true | false | 10 | 172448 | 172448 | | | https://10.0.0.4:2379 | 5ce2c3932bb984e2 | 3.5.0 | 78 MB | false | false | 10 | 172448 | 172448 | | | https://10.0.0.5:2379 | 41849292a8dfd31b | 3.5.0 | 78 MB | false | false | 10 | 172449 | 172449 | | +-----------------------+------------------+---------+---------+-----------+------------+-----------+------------+--------------------+--------+ Changed the leader few times (more than 2). sh-4.4# etcdctl endpoint status -w table +-----------------------+------------------+---------+---------+-----------+------------+-----------+------------+--------------------+--------+ | ENDPOINT | ID | VERSION | DB SIZE | IS LEADER | IS LEARNER | RAFT TERM | RAFT INDEX | RAFT APPLIED INDEX | ERRORS | +-----------------------+------------------+---------+---------+-----------+------------+-----------+------------+--------------------+--------+ | https://10.0.0.3:2379 | 6951e8940830cac5 | 3.5.0 | 78 MB | false | false | 10 | 172448 | 172448 | | | https://10.0.0.4:2379 | 5ce2c3932bb984e2 | 3.5.0 | 78 MB | true | false | 10 | 172448 | 172448 | | | https://10.0.0.5:2379 | 41849292a8dfd31b | 3.5.0 | 78 MB | false | false | 10 | 172449 | 172449 | | +-----------------------+------------------+---------+---------+-----------+------------+-----------+------------+--------------------+--------+ the rules:etcd_server_leader_changes_seen_total are getting updated. $ oc -n openshift-monitoring exec -c prometheus prometheus-k8s-0 -- curl -k -H "Authorization: Bearer sha256~lIpa1V2jbdLyrNBeRvihrXphl-b5mglXOmAsyEKYM9o" 'https://prometheus-k8s.openshift-monitoring.svc:9091/api/v1/query?query=etcd_server_leader_changes_seen_total' | jq % Total % Received % Xferd Average Speed Time Time Time Current Dload Upload Total Spent Left Speed 100 851 0 851 0 0 47277 0 --:--:-- --:--:-- --:--:-- 47277 { "status": "success", "data": { "resultType": "vector", "result": [ { "metric": { "__name__": "etcd_server_leader_changes_seen_total", "endpoint": "etcd-metrics", "instance": "10.0.141.21:9979", "job": "etcd", "namespace": "openshift-etcd", "pod": "etcd-ip-10-0-141-21.us-east-2.compute.internal", "service": "etcd" }, "value": [ 1637311116.534, "3" ] }, { "metric": { "__name__": "etcd_server_leader_changes_seen_total", "endpoint": "etcd-metrics", "instance": "10.0.170.2:9979", "job": "etcd", "namespace": "openshift-etcd", "pod": "etcd-ip-10-0-170-2.us-east-2.compute.internal", "service": "etcd" }, "value": [ 1637311116.534, "1" ] }, { "metric": { "__name__": "etcd_server_leader_changes_seen_total", "endpoint": "etcd-metrics", "instance": "10.0.220.249:9979", "job": "etcd", "namespace": "openshift-etcd", "pod": "etcd-ip-10-0-220-249.us-east-2.compute.internal", "service": "etcd" }, "value": [ 1637311116.534, "2" ] } ] } } $ oc -n openshift-monitoring exec -c prometheus prometheus-k8s-0 -- curl -k -H "Authorization: Bearer sha256~lIpa1V2jbdLyrNBeRvihrXphl-b5mglXOmAsyEKYM9o" 'https://prometheus-k8s.openshift-monitoring.svc:9091/api/v1/query?query=etcd_server_leader_changes_seen_total' | jq % Total % Received % Xferd Average Speed Time Time Time Current Dload Upload Total Spent Left Speed 100 851 0 851 0 0 56733 0 --:--:-- --:--:-- --:--:-- 56733 { "status": "success", "data": { "resultType": "vector", "result": [ { "metric": { "__name__": "etcd_server_leader_changes_seen_total", "endpoint": "etcd-metrics", "instance": "10.0.141.21:9979", "job": "etcd", "namespace": "openshift-etcd", "pod": "etcd-ip-10-0-141-21.us-east-2.compute.internal", "service": "etcd" }, "value": [ 1637311282.715, "1" ] }, { "metric": { "__name__": "etcd_server_leader_changes_seen_total", "endpoint": "etcd-metrics", "instance": "10.0.170.2:9979", "job": "etcd", "namespace": "openshift-etcd", "pod": "etcd-ip-10-0-170-2.us-east-2.compute.internal", "service": "etcd" }, "value": [ 1637311282.715, "2" ] }, { "metric": { "__name__": "etcd_server_leader_changes_seen_total", "endpoint": "etcd-metrics", "instance": "10.0.220.249:9979", "job": "etcd", "namespace": "openshift-etcd", "pod": "etcd-ip-10-0-220-249.us-east-2.compute.internal", "service": "etcd" }, "value": [ 1637311282.715, "3" ] } ] } } Since the problem described in this bug report should be resolved in a recent advisory, it has been closed with a resolution of ERRATA. For information on the advisory (Moderate: OpenShift Container Platform 4.10.3 security update), and where to find the updated files, follow the link below. If the solution does not work for you, open a new bug report. https://access.redhat.com/errata/RHSA-2022:0056 |