Description of problem:
Possible live-lock of rbd-mirror daemon's image replayer state machine during stop
Version-Release number of selected component (if applicable):
rare -- integration test rbd_mirror_stress occasionally hits the issue
Steps to Reproduce:
1. mirror an image
2. stop rbd-mirror on the non-primary
3. inject lots of IO events into the primary image
4. restart rbd-mirror on the non-primary site
5. while rbd-mirror is replaying the IO events, attempt a graceful shut down
There is a possibility that the image replayer state machine will get live-locked in the STOPPING state.
The image replayer should cleanly shut down.
verified with version - 10.2.5-29.el7cp.x86_64
followed steps mentioned in description and works for me hence moving to verified
Since the problem described in this bug report should be
resolved in a recent advisory, it has been closed with a
resolution of ERRATA.
For information on the advisory, and where to find the updated
files, follow the link below.
If the solution does not work for you, open a new bug report.