Bug 1992172 - [ovn][metrics/alerts] Expose longer poll intervals on OVN components to determine load
Summary: [ovn][metrics/alerts] Expose longer poll intervals on OVN components to deter...
Keywords:
Status: NEW
Alias: None
Product: Red Hat Enterprise Linux Fast Datapath
Classification: Red Hat
Component: OVN
Version: FDP 21.C
Hardware: Unspecified
OS: Unspecified
medium
unspecified
Target Milestone: ---
: ---
Assignee: OVN Team
QA Contact: Jianlin Shi
URL:
Whiteboard:
Depends On:
Blocks:
TreeView+ depends on / blocked
 
Reported: 2021-08-10 17:18 UTC by Surya Seetharaman
Modified: 2023-07-13 07:25 UTC (History)
3 users (show)

Fixed In Version:
Doc Type: If docs needed, set a value
Doc Text:
Clone Of:
Environment:
Last Closed:
Target Upstream Version:
Embargoed:


Attachments (Terms of Use)


Links
System ID Private Priority Status Summary Last Updated
Red Hat Issue Tracker FD-1482 0 None None None 2021-08-10 17:22:07 UTC

Description Surya Seetharaman 2021-08-10 17:18:27 UTC
Description of problem:

We see lots of “WARN|Unreasonably long 13636ms poll interval” logs in the scaled up clusters [OCP-OVN-K]. This can be attributed to sbdb or controller or northd doing something that makes it busy and could thereby indicate load on the cluster.

It would be good if we could expose this somehow as a metric or alert and configure a decent threshold value based on the size of the cluster and document this to somehow indicate in an easier way if OVN is busy or not.

Its more of a feature/rfe than a bug. This was discussed briefly during the OVN-OVN-K sync up on August 10th.


Note You need to log in before you can comment on or make changes to this bug.