Bug 2223279

Summary: [mgr] ceph-mgr fails with FAILED ceph_assert(pending_service_map.epoch > service_map.epoch)
Product: [Red Hat Storage] Red Hat Ceph Storage Reporter: Tomas Petr <tpetr>
Component: RADOSAssignee: Radoslaw Zarzynski <rzarzyns>
Status: NEW --- QA Contact: Pawan <pdhiran>
Severity: medium Docs Contact:
Priority: medium    
Version: 4.3CC: bhubbard, bkunal, ceph-eng-bugs, cephqe-warriors, nojha, vumrao
Target Milestone: ---   
Target Release: 4.3z2   
Hardware: Unspecified   
OS: Unspecified   
Whiteboard:
Fixed In Version: Doc Type: If docs needed, set a value
Doc Text:
Story Points: ---
Clone Of: Environment:
Last Closed: Type: Bug
Regression: --- Mount Type: ---
Documentation: --- CRM:
Verified Versions: Category: ---
oVirt Team: --- RHEL 7.3 requirements from Atomic Host:
Cloudforms Team: --- Target Upstream Version:
Embargoed:
Bug Depends On: 2095062    
Bug Blocks:    

Description Tomas Petr 2023-07-17 08:34:53 UTC
Description of problem:
ceph-mgr fails with:
    "assert_msg": "/builddir/build/BUILD/ceph-14.2.22/src/mgr/DaemonServer.cc: In function 'DaemonServer::got_service_map()::<lambda(const ServiceMap&)>' thread 7f4e94ae3700 time 2022-11-10 10:47:04.248006\n/builddir/build/BUILD/ceph-14.2.22/src/mgr/DaemonServer.cc: 2883: FAILED ceph_assert(pending_service_map.epoch > service_map.epoch)\n",
    "backtrace": [
        "(()+0x12c20) [0x7f4e9de95c20]",
        "(gsignal()+0x10f) [0x7f4e9c8dc37f]",
        "(abort()+0x127) [0x7f4e9c8c6db5]",
        "(ceph::__ceph_assert_fail(char const*, char const*, int, char const*)+0x1a7) [0x7f4ea07f2f5d]",
        "(()+0x276126) [0x7f4ea07f3126]",
        "(DaemonServer::got_service_map()+0xf2a) [0x55567f31de7a]",
        "(Mgr::handle_service_map(MServiceMap*)+0x145) [0x55567f34f775]",
        "(Mgr::ms_dispatch(Message*)+0x27b) [0x55567f351cab]",
        "(MgrStandby::ms_dispatch(Message*)+0x9b) [0x55567f359c6b]",
        "(Dispatcher::ms_dispatch2(boost::intrusive_ptr<Message> const&)+0x2a) [0x55567f34292a]",
        "(DispatchQueue::entry()+0x134a) [0x7f4ea0a4ec8a]",
        "(DispatchQueue::DispatchThread::entry()+0x11) [0x7f4ea0b055c1]",
        "(()+0x817a) [0x7f4e9de8b17a]",
        "(clone()+0x43) [0x7f4e9c9a1df3]"
    ],


Version-Release number of selected component (if applicable):
14.2.22-110.el8cp

How reproducible:


Steps to Reproduce:
1.
2.
3.

Actual results:


Expected results:


Additional info: