Description of problem: Possible live-lock of rbd-mirror daemon's image replayer state machine during stop Version-Release number of selected component (if applicable): 10.x How reproducible: rare -- integration test rbd_mirror_stress occasionally hits the issue Steps to Reproduce: 1. mirror an image 2. stop rbd-mirror on the non-primary 3. inject lots of IO events into the primary image 4. restart rbd-mirror on the non-primary site 5. while rbd-mirror is replaying the IO events, attempt a graceful shut down Actual results: There is a possibility that the image replayer state machine will get live-locked in the STOPPING state. Expected results: The image replayer should cleanly shut down. Additional info:
verified with version - 10.2.5-29.el7cp.x86_64 followed steps mentioned in description and works for me hence moving to verified
Since the problem described in this bug report should be resolved in a recent advisory, it has been closed with a resolution of ERRATA. For information on the advisory, and where to find the updated files, follow the link below. If the solution does not work for you, open a new bug report. https://rhn.redhat.com/errata/RHBA-2017-0514.html