Bug 1912533 - [Assisted-4.6] [Staging][NTP] No event for NTP unreachable
Summary: [Assisted-4.6] [Staging][NTP] No event for NTP unreachable
Keywords:
Status: CLOSED ERRATA
Alias: None
Product: OpenShift Container Platform
Classification: Red Hat
Component: assisted-installer
Version: 4.6
Hardware: Unspecified
OS: Unspecified
high
high
Target Milestone: ---
: 4.7.z
Assignee: Lital Alon
QA Contact: Yuri Obshansky
URL:
Whiteboard:
Depends On:
Blocks:
TreeView+ depends on / blocked
 
Reported: 2021-01-04 16:36 UTC by Lital Alon
Modified: 2021-05-04 19:51 UTC (History)
4 users (show)

Fixed In Version: OCP-Metal-V1.0.18.1
Doc Type: No Doc Update
Doc Text:
Clone Of:
Environment:
Last Closed: 2021-05-04 19:50:50 UTC
Target Upstream Version:
Embargoed:
lalon: needinfo-
lalon: needinfo-


Attachments (Terms of Use)
Unreachable example (166.17 KB, image/png)
2021-01-04 16:41 UTC, Lital Alon
no flags Details
events (111.84 KB, image/png)
2021-01-04 16:42 UTC, Lital Alon
no flags Details


Links
System ID Private Priority Status Summary Last Updated
Red Hat Product Errata RHBA-2021:1365 0 None None None 2021-05-04 19:51:14 UTC

Description Lital Alon 2021-01-04 16:36:45 UTC
Description of problem:
I simulated NTP Unreachable on one master host.
Host turn Insufficient due to missing NTP sources.
The issue is that i expected Event message to display the reason for host insufficient. Instead event described machine connectivity issue

Host master-0-0: updated status from "known" to "insufficient" (Host cannot be installed due to following failing validation(s): No connectivity to the majority of hosts in the cluster)

AND

Host master-0-0: updated status from "discovering" to "insufficient" (Host cannot be installed due to following failing validation(s): )

See attached examples

Version-Release number of selected component (if applicable):
v1.0.14.2

Steps to Reproduce:
1. Block default NTP to one master node, make sure NTP status is Unreachable
2. Look at the events

Actual results:
failing validation(s): No connectivity to the majority of hosts in the cluster OR empty validation list 

Expected results:
failing validation(s): No NTP sources

Comment 1 Lital Alon 2021-01-04 16:41:40 UTC
Created attachment 1744343 [details]
Unreachable example

Comment 2 Lital Alon 2021-01-04 16:42:16 UTC
Created attachment 1744344 [details]
events

Comment 3 Lital Alon 2021-01-05 16:54:32 UTC
@mfilanov @ygoldber
From some reason the empty validation in the events doesn't reproduce (maybe i used an old cluster). 
Looks like NTP Unreachable doesn't log an event, as the host was insufficient due to connectivity and still is insufficient late due to NTP Unreachable, so we only have this event: 
Host master-0-0: updated status from "discovering" to "insufficient" (Host cannot be installed due to following failing validation(s): No connectivity to the majority of hosts in the cluster)

So no event for NTP unreachable.

Comment 4 Ronnie Lazar 2021-02-08 10:34:12 UTC
Should be resolved by https://issues.redhat.com/browse/MGMT-3561

Comment 5 Yoni Bettan 2021-03-24 08:14:43 UTC
This is not related to https://issues.redhat.com/browse/MGMT-3561

The event you describe is part of a transition in the state machine, that change the host status from "known" to "insufficient", probably at the moment of that transition, NTP wasn't failing, once it fails it wasn't part of the transition so you didn't get a new events (with the form of "Host *: updated status from * to * ...").

Anyway, now, there are other events regarding host/cluster validations that will show the issue to the user:
* Host *: validation * that used to succeed is now failing
* Cluster validation * that used to succeed is now failing
* HOst *: validations * is now fixed
* Cluster validation * is now fixed.

Closing it.

Comment 6 Lital Alon 2021-04-04 10:51:12 UTC
Events including NTP:
Host test-infra-cluster-4cef4a83-worker-1: validation 'ntp-synced' that used to succeed is now failing
Host test-infra-cluster-4cef4a83-worker-1: validation 'ntp-synced' is now fixed
Host test-infra-cluster-4cef4a83-worker-1: updated status from "discovering" to "insufficient" (Host cannot be installed due to following failing validation(s): Host couldn't synchronize with any NTP server ; No connectivity to the majority of hosts in the cluster)

Verified in Staging  OCP-Metal-V1.0.18.1

Comment 9 errata-xmlrpc 2021-05-04 19:50:50 UTC
Since the problem described in this bug report should be
resolved in a recent advisory, it has been closed with a
resolution of ERRATA.

For information on the advisory (OpenShift Container Platform 4.7.9 bug fix update), and where to find the updated
files, follow the link below.

If the solution does not work for you, open a new bug report.

https://access.redhat.com/errata/RHBA-2021:1365


Note You need to log in before you can comment on or make changes to this bug.