The FDP team is no longer accepting new bugs in Bugzilla. Please report your issues under FDP project in Jira. Thanks.
Bug 1960393 - OVSDB RAFT leader should not miss sending heartbeats due to client handling
Summary: OVSDB RAFT leader should not miss sending heartbeats due to client handling
Keywords:
Status: CLOSED WONTFIX
Alias: None
Product: Red Hat Enterprise Linux Fast Datapath
Classification: Red Hat
Component: ovsdb
Version: RHEL 8.0
Hardware: Unspecified
OS: Unspecified
high
urgent
Target Milestone: ---
: ---
Assignee: Anton Ivanov
QA Contact: Jianlin Shi
URL:
Whiteboard:
Depends On:
Blocks: 1943631 1959597 1962951
TreeView+ depends on / blocked
 
Reported: 2021-05-13 18:49 UTC by Tim Rozet
Modified: 2023-07-28 19:27 UTC (History)
7 users (show)

Fixed In Version:
Doc Type: If docs needed, set a value
Doc Text:
Clone Of:
Environment:
Last Closed: 2023-07-28 19:27:09 UTC
Target Upstream Version:
Embargoed:


Attachments (Terms of Use)


Links
System ID Private Priority Status Summary Last Updated
Red Hat Issue Tracker FD-1311 0 None None None 2021-08-17 18:56:16 UTC

Description Tim Rozet 2021-05-13 18:49:35 UTC
Description of problem:
We see that at scale with many OVSDB clients that the raft leader is so busy in its event loop that it doesn't have time to send the raft heartbeat message. Due to this the election timer fires off, which leads to a leadership change. However then the new leader has the same problem and thus...endless election thrashing. OVSDB should be able to detect that it is approaching the deadline to send the heartbeat, break out of its client loop and send the heartbeat.

Comment 1 Dan Williams 2021-05-26 16:56:48 UTC
Anton had some PoC code for this at https://github.com/kot-begemot-uk/ovs/commit/056417310763e6a55468bd954ac5c47e45205390

Reassigning to him to finish that patch and push upstream.

More context is at https://bugzilla.redhat.com/show_bug.cgi?id=1943631#c25

Comment 5 Tim Rozet 2022-02-21 19:58:03 UTC
Anton are you still going to work on this patch?

Comment 6 Anton Ivanov 2022-02-22 09:06:45 UTC
There is no interest upstream, no interest in reviewing and no interest in this. So why?

Comment 7 Mark Michelson 2023-07-28 19:27:09 UTC
I am closing this since nothing has been done on the posted patch series in a while and it appears this has withered on the vine. I also suspect that the numerous performance improvements made to OVSDB since May 2021 have lessened the severity of this issue. Tim, if this is still an observed problem, then feel free to re-open.


Note You need to log in before you can comment on or make changes to this bug.