Red Hat Bugzilla – Bug 1028090
Implement event queue in the Indication manager
Last modified: 2017-11-01 20:07:20 EDT
The Indication manager we use as a helper/wrapper needs some kind of queue where incoming events will be sent, awaiting further processing. This is to ensure uninterrupted event flow. With the current consecutive watcher() and gather() parts there's a high chance to miss some events as long as watcher() is not always running.
Additionally, when receiving events in batches in some buffer, all events should be fully processed. With no queue, the rest of the buffer is thrown away as long as the watcher needs to end and return success.
Third requirement is a kind of rate-limiting, merging multiple equal events of the same type or for the same object in one, when sent in a short interval (user defined). This will naturally cause slight delay which may be desired for some applications - think of it as a settle timeout.
As a side-effect, the reworked watcher() should deal better with errors when critical issue prevents it from working. A kind of self-recovery would be nice too.
This currently blocks e.g. bug 1026663 to be fully race-free.
Debugging further it turned out there are more rules we should obey to ensure thread safety and play nice with memory allocations:
- any CMPI call that manipulates with instances should be made from the same thread. I.e. don't free allocated instances from other thread than collected.
- hold locks as little time as possible, operate on local variables and then fill shared memory with a quick operation with lock held
- perform proper thread shutdown and cleanup, to free all data, unlock all locks and properly detach threads from CIMOM. No forced thread cancellation.
- handle a scenario when new filters are registered while worker threads are already running (for the first poll case)
- implement a clean way to cancel worker threads from outside, the idea was to use poll()/select() with a side fd acting as a cancellation channel, similar to how GCancellable works. This should be exposed to watcher/gather callbacks and indmanager users should integrate it in their code.
Note to myself: GCancellable principle explained: http://blog.verbum.org/2013/12/03/cancelling-computation-gcancellable-or-sigint-versus-threads-versus-exceptions/
- be sure to check CMPI calls error codes and rc statuses