Bug 1313305 - Calamari must filter duplicate events before pushing it to salt event bus.
Summary: Calamari must filter duplicate events before pushing it to salt event bus.
Keywords:
Status: CLOSED WONTFIX
Alias: None
Product: Red Hat Ceph Storage
Classification: Red Hat Storage
Component: Calamari
Version: 1.3.1
Hardware: Unspecified
OS: Unspecified
unspecified
medium
Target Milestone: rc
: 2.2
Assignee: Christina Meno
QA Contact: ceph-qe-bugs
URL:
Whiteboard:
Depends On:
Blocks: 1291304
TreeView+ depends on / blocked
 
Reported: 2016-03-01 11:20 UTC by Darshan
Modified: 2022-02-21 18:06 UTC (History)
3 users (show)

Fixed In Version:
Doc Type: Bug Fix
Doc Text:
Clone Of:
Environment:
Last Closed: 2017-01-05 18:41:23 UTC
Embargoed:


Attachments (Terms of Use)

Description Darshan 2016-03-01 11:20:00 UTC
Description of problem:
Some events related to osd, mon, cluster state change are emitted from all the calamari-lite instances. Since USM will be listening to multiple calamari-lite instances for events, it must send event(push event to salt bus) from only one instance. which instance has to send can be decided based on if it is residing on a leader mon node.

Version-Release number of selected component (if applicable):


How reproducible:
Always

Steps to Reproduce:
1. Have a ceph setup with multiple mon nodes(hence multiple calamari-lite instances)
2. Simulate an event for osd state change.
3. 

Actual results:
Same event would be sent from multiple calamari-lite instances to salt event bus.

Expected results:
A single event should be sent from only one calamari-lite instance to salt bus.

Additional info:

Comment 2 Christina Meno 2016-04-06 21:30:18 UTC
I believe that we should address this issue in a different way.
calamari-lite running on all ceph monitors will be a risk to cluster stability and data integrity.

I think that the storage-console should choose a single monitor to enable calamari on in the first release. That way when the inevitable happens we only loose management temporarily and not trigger data-loss

If we organize this way event filtering won't be needed until we design calamari to be HA.

Mrugesh what do you think about this approach as risk-mitigation?


Note You need to log in before you can comment on or make changes to this bug.