Bug 2078988

Summary: network-metrics-daemon makes costly global pod list calls scaling per node
Product: OpenShift Container Platform Reporter: OpenShift BugZilla Robot <openshift-bugzilla-robot>
Component: NetworkingAssignee: Sebastian Scheinkman <sscheink>
Networking sub component: multus QA Contact: Weibin Liang <weliang>
Status: CLOSED ERRATA Docs Contact:
Severity: high    
Priority: high    
Version: 4.8   
Target Milestone: ---   
Target Release: 4.10.z   
Hardware: Unspecified   
OS: Unspecified   
Whiteboard:
Fixed In Version: Doc Type: If docs needed, set a value
Doc Text:
Story Points: ---
Clone Of: Environment:
Last Closed: 2022-05-23 13:25:10 UTC Type: ---
Regression: --- Mount Type: ---
Documentation: --- CRM:
Verified Versions: Category: ---
oVirt Team: --- RHEL 7.3 requirements from Atomic Host:
Cloudforms Team: --- Target Upstream Version:
Embargoed:
Bug Depends On: 2078954    
Bug Blocks: 2064371, 2087049    

Comment 3 Weibin Liang 2022-05-17 18:39:37 UTC
Scale a cluster with 18182 pods, did't see large range reads on pods in 4.10.15

[weliang@weliang ~]$ oc get clusterversion
NAME      VERSION   AVAILABLE   PROGRESSING   SINCE   STATUS
version   4.10.15   True        False         3h15m   Cluster version is 4.10.15
[weliang@weliang ~]$ oc get pod --all-namespaces | wc -l
18182
[weliang@weliang ~]$ oc logs etcd-ip-10-0-138-182.us-east-2.compute.internal -c etcd | grep range_response_count
{"level":"warn","ts":"2022-05-17T15:21:00.400Z","caller":"etcdserver/util.go:166","msg":"apply request took too long","took":"734.568921ms","expected-duration":"200ms","prefix":"read-only range ","request":"key:\"/kubernetes.io/namespaces/default\" serializable:true keys_only:true ","response":"range_response_count:1 size:53"}
{"level":"warn","ts":"2022-05-17T15:21:00.400Z","caller":"etcdserver/util.go:166","msg":"apply request took too long","took":"406.379952ms","expected-duration":"200ms","prefix":"read-only range ","request":"key:\"/kubernetes.io/namespaces/default\" serializable:true keys_only:true ","response":"range_response_count:1 size:53"}
{"level":"warn","ts":"2022-05-17T15:21:00.400Z","caller":"etcdserver/util.go:166","msg":"apply request took too long","took":"481.785683ms","expected-duration":"200ms","prefix":"read-only range ","request":"key:\"/kubernetes.io/secrets/openshift-kube-apiserver/node-kubeconfigs\" ","response":"range_response_count:1 size:43450"}
{"level":"warn","ts":"2022-05-17T15:21:00.400Z","caller":"etcdserver/util.go:166","msg":"apply request took too long","took":"776.801648ms","expected-duration":"200ms","prefix":"read-only range ","request":"key:\"/kubernetes.io/persistentvolumeclaims/\" range_end:\"/kubernetes.io/persistentvolumeclaims0\" count_only:true ","response":"range_response_count:0 size:6"}
{"level":"warn","ts":"2022-05-17T15:21:00.400Z","caller":"etcdserver/util.go:166","msg":"apply request took too long","took":"501.658103ms","expected-duration":"200ms","prefix":"read-only range ","request":"key:\"/kubernetes.io/namespaces/default\" keys_only:true ","response":"range_response_count:1 size:53"}
{"level":"warn","ts":"2022-05-17T15:21:00.400Z","caller":"etcdserver/util.go:166","msg":"apply request took too long","took":"590.435979ms","expected-duration":"200ms","prefix":"read-only range ","request":"key:\"/kubernetes.io/operators.coreos.com/operatorconditions/\" range_end:\"/kubernetes.io/operators.coreos.com/operatorconditions0\" count_only:true ","response":"range_response_count:0 size:8"}
{"level":"warn","ts":"2022-05-17T15:21:00.405Z","caller":"etcdserver/util.go:166","msg":"apply request took too long","took":"323.740459ms","expected-duration":"200ms","prefix":"read-only range ","request":"key:\"/kubernetes.io/rolebindings/openshift-kube-controller-manager/system:openshift:leader-election-lock-cluster-policy-controller\" ","response":"range_response_count:1 size:729"}
{"level":"warn","ts":"2022-05-17T15:21:00.405Z","caller":"etcdserver/util.go:166","msg":"apply request took too long","took":"316.262942ms","expected-duration":"200ms","prefix":"read-only range ","request":"key:\"/kubernetes.io/configmaps/openshift-kube-controller-manager-operator/csr-signer-ca\" ","response":"range_response_count:1 size:2691"}
{"level":"warn","ts":"2022-05-17T15:21:00.405Z","caller":"etcdserver/util.go:166","msg":"apply request took too long","took":"209.496173ms","expected-duration":"200ms","prefix":"read-only range ","request":"key:\"/kubernetes.io/operators.coreos.com/clusterserviceversions/\" range_end:\"/kubernetes.io/operators.coreos.com/clusterserviceversions0\" count_only:true ","response":"range_response_count:0 size:8"}
{"level":"warn","ts":"2022-05-17T15:21:00.405Z","caller":"etcdserver/util.go:166","msg":"apply request took too long","took":"260.770344ms","expected-duration":"200ms","prefix":"read-only range ","request":"key:\"/kubernetes.io/apiregistration.k8s.io/apiservices/v1.build.openshift.io\" ","response":"range_response_count:1 size:3087"}
{"level":"warn","ts":"2022-05-17T16:01:11.375Z","caller":"etcdserver/util.go:166","msg":"apply request took too long","took":"1.422025337s","expected-duration":"200ms","prefix":"read-only range ","request":"key:\"/kubernetes.io/serviceaccounts/openshift-cluster-csi-drivers/aws-ebs-csi-driver-node-sa\" ","response":"range_response_count:1 size:421"}
{"level":"warn","ts":"2022-05-17T16:01:11.377Z","caller":"etcdserver/util.go:166","msg":"apply request took too long","took":"1.168266953s","expected-duration":"200ms","prefix":"read-only range ","request":"key:\"/kubernetes.io/configmaps/kube-system/kube-controller-manager\" ","response":"range_response_count:1 size:610"}
{"level":"warn","ts":"2022-05-17T16:01:11.377Z","caller":"etcdserver/util.go:166","msg":"apply request took too long","took":"536.053839ms","expected-duration":"200ms","prefix":"read-only range ","request":"key:\"/kubernetes.io/namespaces/default\" keys_only:true ","response":"range_response_count:1 size:53"}
{"level":"warn","ts":"2022-05-17T16:01:11.377Z","caller":"etcdserver/util.go:166","msg":"apply request took too long","took":"995.567988ms","expected-duration":"200ms","prefix":"read-only range ","request":"key:\"/kubernetes.io/namespaces/default\" keys_only:true ","response":"range_response_count:1 size:53"}
{"level":"warn","ts":"2022-05-17T16:01:11.377Z","caller":"etcdserver/util.go:166","msg":"apply request took too long","took":"333.888874ms","expected-duration":"200ms","prefix":"read-only range ","request":"key:\"/kubernetes.io/apiserver.openshift.io/apirequestcounts/clusterautoscalers.v1.autoscaling.openshift.io\" ","response":"range_response_count:1 size:8551"}
{"level":"warn","ts":"2022-05-17T16:01:11.377Z","caller":"etcdserver/util.go:166","msg":"apply request took too long","took":"1.053947255s","expected-duration":"200ms","prefix":"read-only range ","request":"key:\"health\" ","response":"range_response_count:0 size:6"}
{"level":"warn","ts":"2022-05-17T16:01:11.377Z","caller":"etcdserver/util.go:166","msg":"apply request took too long","took":"1.061312234s","expected-duration":"200ms","prefix":"read-only range ","request":"key:\"/kubernetes.io/secrets/openshift-kube-controller-manager/localhost-recovery-client-token\" ","response":"range_response_count:1 size:17769"}
{"level":"warn","ts":"2022-05-17T16:01:11.377Z","caller":"etcdserver/util.go:166","msg":"apply request took too long","took":"711.199346ms","expected-duration":"200ms","prefix":"read-only range ","request":"key:\"/openshift.io/health\" ","response":"range_response_count:0 size:6"}
{"level":"warn","ts":"2022-05-17T16:07:56.543Z","caller":"etcdserver/util.go:166","msg":"apply request took too long","took":"1.023969125s","expected-duration":"200ms","prefix":"read-only range ","request":"key:\"/kubernetes.io/ranges/servicenodeports\" ","response":"range_response_count:1 size:417"}
{"level":"warn","ts":"2022-05-17T16:07:56.543Z","caller":"etcdserver/util.go:166","msg":"apply request took too long","took":"769.617625ms","expected-duration":"200ms","prefix":"read-only range ","request":"key:\"/kubernetes.io/namespaces/kube-node-lease\" ","response":"range_response_count:1 size:748"}
{"level":"warn","ts":"2022-05-17T16:07:56.543Z","caller":"etcdserver/util.go:166","msg":"apply request took too long","took":"296.861477ms","expected-duration":"200ms","prefix":"read-only range ","request":"key:\"/kubernetes.io/namespaces/default\" keys_only:true ","response":"range_response_count:1 size:53"}
{"level":"warn","ts":"2022-05-17T16:07:56.543Z","caller":"etcdserver/util.go:166","msg":"apply request took too long","took":"785.427626ms","expected-duration":"200ms","prefix":"read-only range ","request":"key:\"/kubernetes.io/namespaces/default\" keys_only:true ","response":"range_response_count:1 size:53"}
{"level":"warn","ts":"2022-05-17T16:07:56.543Z","caller":"etcdserver/util.go:166","msg":"apply request took too long","took":"984.031134ms","expected-duration":"200ms","prefix":"read-only range ","request":"key:\"/kubernetes.io/flowschemas/probes\" ","response":"range_response_count:1 size:1072"}
{"level":"warn","ts":"2022-05-17T16:07:56.543Z","caller":"etcdserver/util.go:166","msg":"apply request took too long","took":"777.79829ms","expected-duration":"200ms","prefix":"read-only range ","request":"key:\"/kubernetes.io/operators.coreos.com/operatorgroups/\" range_end:\"/kubernetes.io/operators.coreos.com/operatorgroups0\" count_only:true ","response":"range_response_count:0 size:8"}
{"level":"warn","ts":"2022-05-17T16:07:56.543Z","caller":"etcdserver/util.go:166","msg":"apply request took too long","took":"1.003726702s","expected-duration":"200ms","prefix":"read-only range ","request":"key:\"/kubernetes.io/services/specs/\" range_end:\"/kubernetes.io/services/specs0\" ","response":"range_response_count:79 size:124006"}
{"level":"warn","ts":"2022-05-17T16:07:56.544Z","caller":"etcdserver/util.go:166","msg":"apply request took too long","took":"422.693867ms","expected-duration":"200ms","prefix":"read-only range ","request":"key:\"/kubernetes.io/events/\" range_end:\"/kubernetes.io/events0\" count_only:true ","response":"range_response_count:0 size:9"}
{"level":"warn","ts":"2022-05-17T16:16:57.534Z","caller":"etcdserver/util.go:166","msg":"apply request took too long","took":"266.245612ms","expected-duration":"200ms","prefix":"read-only range ","request":"key:\"/kubernetes.io/serviceaccounts/openshift-kube-controller-manager/localhost-recovery-client\" ","response":"range_response_count:1 size:424"}
{"level":"warn","ts":"2022-05-17T16:55:43.017Z","caller":"etcdserver/util.go:166","msg":"apply request took too long","took":"206.476515ms","expected-duration":"200ms","prefix":"read-only range ","request":"limit:1 keys_only:true ","response":"range_response_count:0 size:6"}
{"level":"warn","ts":"2022-05-17T17:41:19.937Z","caller":"etcdserver/util.go:166","msg":"apply request took too long","took":"244.776847ms","expected-duration":"200ms","prefix":"read-only range ","request":"key:\"/kubernetes.io/health\" ","response":"range_response_count:0 size:6"}
{"level":"warn","ts":"2022-05-17T18:02:58.241Z","caller":"etcdserver/util.go:166","msg":"apply request took too long","took":"258.801481ms","expected-duration":"200ms","prefix":"read-only range ","request":"key:\"/kubernetes.io/namespaces/default\" keys_only:true ","response":"range_response_count:1 size:53"}
{"level":"warn","ts":"2022-05-17T18:09:12.672Z","caller":"etcdserver/util.go:166","msg":"apply request took too long","took":"267.265282ms","expected-duration":"200ms","prefix":"read-only range ","request":"key:\"/kubernetes.io/serviceaccounts/openshift-cloud-network-config-controller/cloud-network-config-controller\" ","response":"range_response_count:1 size:1066"}
{"level":"warn","ts":"2022-05-17T18:09:12.676Z","caller":"etcdserver/util.go:166","msg":"apply request took too long","took":"214.171925ms","expected-duration":"200ms","prefix":"read-only range ","request":"key:\"/kubernetes.io/operator.openshift.io/openshiftcontrollermanagers/cluster\" ","response":"range_response_count:1 size:3731"}
{"level":"warn","ts":"2022-05-17T18:09:12.676Z","caller":"etcdserver/util.go:166","msg":"apply request took too long","took":"240.634034ms","expected-duration":"200ms","prefix":"read-only range ","request":"key:\"/kubernetes.io/network.openshift.io/clusternetworks/default\" ","response":"range_response_count:1 size:1106"}
{"level":"warn","ts":"2022-05-17T18:09:12.676Z","caller":"etcdserver/util.go:166","msg":"apply request took too long","took":"262.325664ms","expected-duration":"200ms","prefix":"read-only range ","request":"key:\"/kubernetes.io/minions/\" range_end:\"/kubernetes.io/minions0\" ","response":"range_response_count:27 size:169702"}
{"level":"warn","ts":"2022-05-17T18:15:04.499Z","caller":"etcdserver/util.go:166","msg":"apply request took too long","took":"223.187298ms","expected-duration":"200ms","prefix":"read-only range ","request":"key:\"/kubernetes.io/apiextensions.k8s.io/customresourcedefinitions/operatorconditions.operators.coreos.com\" ","response":"range_response_count:1 size:19449"}
[weliang@weliang ~]$

Comment 5 errata-xmlrpc 2022-05-23 13:25:10 UTC
Since the problem described in this bug report should be
resolved in a recent advisory, it has been closed with a
resolution of ERRATA.

For information on the advisory (OpenShift Container Platform 4.10.15 bug fix update), and where to find the updated
files, follow the link below.

If the solution does not work for you, open a new bug report.

https://access.redhat.com/errata/RHBA-2022:2258