Description of problem: etcd timed out, lost connectivity to one of the members briefly, then reconnected. Version-Release number of selected component (if applicable): 3.5.5.31-1 How reproducible: I've not been able to reproduce this Actual results: Sep 29 19:14:04 $node.ec2.internal etcd[74733]: etcdserver: request timed out, possibly due to previous leader failure Sep 29 19:14:06 $node.ec2.internal etcd[74733]: etcdserver: request timed out, possibly due to previous leader failure Sep 29 19:14:11 $node.ec2.internal etcd[74733]: lost the TCP streaming connection with peer 3727844635f090bc (stream MsgApp v2 reader) Expected results: the etcdserver requests should not lose the streaming connection
We need to alert on >25% etcd memory usage on a master and force an upgrade or prune.
The needinfo request[s] on this closed bug have been removed as they have been unresolved for 1000 days