Bug 1718135 - Multiple MDS crashing with assert(mds->sessionmap.get_version() == cmapv) in ESessions::replay while replaying journal
Summary: Multiple MDS crashing with assert(mds->sessionmap.get_version() == cmapv) in ...
Keywords:
Status: CLOSED ERRATA
Alias: None
Product: Red Hat Ceph Storage
Classification: Red Hat Storage
Component: CephFS
Version: 3.1
Hardware: x86_64
OS: Linux
medium
medium
Target Milestone: rc
: 3.3
Assignee: Yan, Zheng
QA Contact: subhash
URL:
Whiteboard:
Depends On:
Blocks: 1726135
TreeView+ depends on / blocked
 
Reported: 2019-06-07 03:18 UTC by Prashant Dhange
Modified: 2019-08-21 15:11 UTC (History)
10 users (show)

Fixed In Version: RHEL: ceph-12.2.12-18.el7cp Ubuntu: ceph_12.2.12-16redhat1xenial
Doc Type: Bug Fix
Doc Text:
.Partially flushed `ESessions` log event no longer cause the MDS to fail Previously, when a Ceph Metadata Server (MDS) had more than 1024 client sessions, sessions in the `ESessions` log event could get flushed partially. The journal replay code expects sessions in the `ESessions` log event to either be all flushed or not flushed at all, so this would cause the MDS to fail. With this update, the journal replay code can handle a partially flushed `ESessions` log event.
Clone Of:
Environment:
Last Closed: 2019-08-21 15:11:09 UTC
Embargoed:


Attachments (Terms of Use)


Links
System ID Private Priority Status Summary Last Updated
Ceph Project Bug Tracker 40211 0 None None None 2019-06-08 13:34:18 UTC
Red Hat Knowledge Base (Solution) 4208321 0 Troubleshoot None Why MDS crash with assert(mds->sessionmap.get_version() == cmapv) in RHCS? 2019-06-21 22:02:28 UTC
Red Hat Product Errata RHSA-2019:2538 0 None None None 2019-08-21 15:11:26 UTC

Comment 18 Yan, Zheng 2019-06-08 13:34:18 UTC
I was wrong.  The issue was not caused by commit 8048865b "mds: properly dirty sessions opened by journal replay". It should be caused by https://tracker.ceph.com/issues/40211

Comment 19 Vikhyat Umrao 2019-06-10 16:06:31 UTC
(In reply to Yan, Zheng from comment #18)
> I was wrong.  The issue was not caused by commit 8048865b "mds: properly
> dirty sessions opened by journal replay". It should be caused by
> https://tracker.ceph.com/issues/40211

Reopening as per comment#18.

Comment 36 Yan, Zheng 2019-06-13 16:45:34 UTC
luminous backport is at https://github.com/ceph/ceph/pull/28536

Comment 51 errata-xmlrpc 2019-08-21 15:11:09 UTC
Since the problem described in this bug report should be
resolved in a recent advisory, it has been closed with a
resolution of ERRATA.

For information on the advisory, and where to find the updated
files, follow the link below.

If the solution does not work for you, open a new bug report.

https://access.redhat.com/errata/RHSA-2019:2538


Note You need to log in before you can comment on or make changes to this bug.