Bug 1845369 - Resume a cluster from sleep, Alerts keep firing even though everything is fine
Summary: Resume a cluster from sleep, Alerts keep firing even though everything is fine
Keywords:
Status: CLOSED ERRATA
Alias: None
Product: OpenShift Container Platform
Classification: Red Hat
Component: Node
Version: 4.5
Hardware: Unspecified
OS: Unspecified
unspecified
urgent
Target Milestone: ---
: 4.5.0
Assignee: Ryan Phillips
QA Contact: Sunil Choudhary
URL:
Whiteboard:
Depends On: 1839098
Blocks:
TreeView+ depends on / blocked
 
Reported: 2020-06-09 03:30 UTC by OpenShift BugZilla Robot
Modified: 2020-07-13 17:44 UTC (History)
15 users (show)

Fixed In Version:
Doc Type: If docs needed, set a value
Doc Text:
Clone Of:
Environment:
Last Closed: 2020-07-13 17:43:40 UTC
Target Upstream Version:
Embargoed:


Attachments (Terms of Use)


Links
System ID Private Priority Status Summary Last Updated
Github openshift origin pull 25127 0 None closed Bug 1845369: UPSTREAM: 91500: reduce race risk in kubelet for missing KUBERNETES_SERVICE_HOST 2020-11-03 16:56:01 UTC
Red Hat Product Errata RHBA-2020:2409 0 None None None 2020-07-13 17:44:06 UTC

Comment 6 Sunil Choudhary 2020-06-22 10:21:23 UTC
Verified on 4.5.0-0.nightly-2020-06-20-194346. Stopped cluster for 24 hours by stopping VMs from AWS console. Started after 24 hours, approved CSRs and waited for 15-20 minutes. Do not see any Down alerts from console.

$ oc get clusterversion
NAME      VERSION                             AVAILABLE   PROGRESSING   SINCE   STATUS
version   4.5.0-0.nightly-2020-06-20-194346   True        False         26h     Cluster version is 4.5.0-0.nightly-2020-06-20-194346

$ oc get co
NAME                                       VERSION                             AVAILABLE   PROGRESSING   DEGRADED   SINCE
authentication                             4.5.0-0.nightly-2020-06-20-194346   True        False         False      26h
cloud-credential                           4.5.0-0.nightly-2020-06-20-194346   True        False         False      26h
cluster-autoscaler                         4.5.0-0.nightly-2020-06-20-194346   True        False         False      26h
config-operator                            4.5.0-0.nightly-2020-06-20-194346   True        False         False      26h
console                                    4.5.0-0.nightly-2020-06-20-194346   True        False         False      23m
csi-snapshot-controller                    4.5.0-0.nightly-2020-06-20-194346   True        False         False      26h
dns                                        4.5.0-0.nightly-2020-06-20-194346   True        False         False      26h
etcd                                       4.5.0-0.nightly-2020-06-20-194346   True        False         False      26h
image-registry                             4.5.0-0.nightly-2020-06-20-194346   True        False         False      26h
ingress                                    4.5.0-0.nightly-2020-06-20-194346   True        False         False      23m
insights                                   4.5.0-0.nightly-2020-06-20-194346   True        False         False      26h
kube-apiserver                             4.5.0-0.nightly-2020-06-20-194346   True        False         False      26h
kube-controller-manager                    4.5.0-0.nightly-2020-06-20-194346   True        False         False      26h
kube-scheduler                             4.5.0-0.nightly-2020-06-20-194346   True        False         False      26h
kube-storage-version-migrator              4.5.0-0.nightly-2020-06-20-194346   True        False         False      26h
machine-api                                4.5.0-0.nightly-2020-06-20-194346   True        False         False      26h
machine-approver                           4.5.0-0.nightly-2020-06-20-194346   True        False         False      26h
machine-config                             4.5.0-0.nightly-2020-06-20-194346   True        False         False      26h
marketplace                                4.5.0-0.nightly-2020-06-20-194346   True        False         False      25m
monitoring                                 4.5.0-0.nightly-2020-06-20-194346   True        False         False      23m
network                                    4.5.0-0.nightly-2020-06-20-194346   True        False         False      26h
node-tuning                                4.5.0-0.nightly-2020-06-20-194346   True        False         False      26h
openshift-apiserver                        4.5.0-0.nightly-2020-06-20-194346   True        False         False      26h
openshift-controller-manager               4.5.0-0.nightly-2020-06-20-194346   True        False         False      26h
openshift-samples                          4.5.0-0.nightly-2020-06-20-194346   True        False         False      26h
operator-lifecycle-manager                 4.5.0-0.nightly-2020-06-20-194346   True        False         False      26h
operator-lifecycle-manager-catalog         4.5.0-0.nightly-2020-06-20-194346   True        False         False      26h
operator-lifecycle-manager-packageserver   4.5.0-0.nightly-2020-06-20-194346   True        False         False      20m
service-ca                                 4.5.0-0.nightly-2020-06-20-194346   True        False         False      26h
storage                                    4.5.0-0.nightly-2020-06-20-194346   True        False         False      26h

$ oc get nodes -o wide
NAME                                         STATUS   ROLES    AGE   VERSION           INTERNAL-IP    EXTERNAL-IP   OS-IMAGE                                                       KERNEL-VERSION                CONTAINER-RUNTIME
ip-10-0-137-200.us-east-2.compute.internal   Ready    worker   26h   v1.18.3+1b98519   10.0.137.200   <none>        Red Hat Enterprise Linux CoreOS 45.82.202006201629-0 (Ootpa)   4.18.0-193.9.1.el8_2.x86_64   cri-o://1.18.2-15.dev.rhaos4.5.git7c4494f.el8
ip-10-0-156-81.us-east-2.compute.internal    Ready    master   26h   v1.18.3+1b98519   10.0.156.81    <none>        Red Hat Enterprise Linux CoreOS 45.82.202006201629-0 (Ootpa)   4.18.0-193.9.1.el8_2.x86_64   cri-o://1.18.2-15.dev.rhaos4.5.git7c4494f.el8
ip-10-0-177-137.us-east-2.compute.internal   Ready    master   26h   v1.18.3+1b98519   10.0.177.137   <none>        Red Hat Enterprise Linux CoreOS 45.82.202006201629-0 (Ootpa)   4.18.0-193.9.1.el8_2.x86_64   cri-o://1.18.2-15.dev.rhaos4.5.git7c4494f.el8
ip-10-0-190-200.us-east-2.compute.internal   Ready    worker   26h   v1.18.3+1b98519   10.0.190.200   <none>        Red Hat Enterprise Linux CoreOS 45.82.202006201629-0 (Ootpa)   4.18.0-193.9.1.el8_2.x86_64   cri-o://1.18.2-15.dev.rhaos4.5.git7c4494f.el8
ip-10-0-192-158.us-east-2.compute.internal   Ready    worker   26h   v1.18.3+1b98519   10.0.192.158   <none>        Red Hat Enterprise Linux CoreOS 45.82.202006201629-0 (Ootpa)   4.18.0-193.9.1.el8_2.x86_64   cri-o://1.18.2-15.dev.rhaos4.5.git7c4494f.el8
ip-10-0-213-239.us-east-2.compute.internal   Ready    master   26h   v1.18.3+1b98519   10.0.213.239   <none>        Red Hat Enterprise Linux CoreOS 45.82.202006201629-0 (Ootpa)   4.18.0-193.9.1.el8_2.x86_64   cri-o://1.18.2-15.dev.rhaos4.5.git7c4494f.el8

Comment 7 errata-xmlrpc 2020-07-13 17:43:40 UTC
Since the problem described in this bug report should be
resolved in a recent advisory, it has been closed with a
resolution of ERRATA.

For information on the advisory, and where to find the updated
files, follow the link below.

If the solution does not work for you, open a new bug report.

https://access.redhat.com/errata/RHBA-2020:2409


Note You need to log in before you can comment on or make changes to this bug.