Description of problem: The leader monitor periordically tells tcmalloc to release memory back to the OS, but follower monitors do not. This can result in follower monitors using more memory than their memory target, and potentially getting oom killed. A workaround is to reset the mon_memory_target config option, which will cause all monitors to ask tcmalloc to release its free memory. Alternately, mon_memory_autotune can be disabled. Version-Release number of selected component (if applicable): 4.0 and later. How reproducible: deterministic, though workload to reproduce is unclear Steps to Reproduce: Set up cluster as in https://bugzilla.redhat.com/show_bug.cgi?id=1825312 Actual results: >1GB RSS memory used by follower monitors Expected results: ~1GB RSS memory used by all monitors
Since the problem described in this bug report should be resolved in a recent advisory, it has been closed with a resolution of ERRATA. For information on the advisory (Red Hat Ceph Storage 4.1 Bug Fix update), and where to find the updated files, follow the link below. If the solution does not work for you, open a new bug report. https://access.redhat.com/errata/RHBA-2020:4144