Bug 1847685

Summary: monitor crashes due to negative timespan
Product: [Red Hat Storage] Red Hat Ceph Storage Reporter: Josh Durgin <jdurgin>
Component: RADOSAssignee: Neha Ojha <nojha>
Status: CLOSED ERRATA QA Contact: Manohar Murthy <mmurthy>
Severity: high Docs Contact: Aron Gunn <agunn>
Priority: high    
Version: 4.1CC: agunn, akupczyk, bhubbard, ceph-eng-bugs, dzafman, hfukumot, kchai, mhackett, mkasturi, muagarwa, nojha, pdhange, rzarzyns, sostapov, sseshasa, tdesala, tserlin, vereddy
Target Milestone: z1Keywords: CodeChange
Target Release: 4.1   
Hardware: x86_64   
OS: Linux   
Whiteboard:
Fixed In Version: ceph-14.2.8-90.el8cp, ceph-14.2.8-90.el7cp Doc Type: Bug Fix
Doc Text:
.Crashing Ceph Monitors caused by a negative time span Previously, Ceph Monitors could crash when triggered by a monotonic clock going back in time. These crashes caused a negative monotonic time span and triggered an assertion into the Ceph Monitor leading them to crash. The Ceph Monitor code was updated to tolerate this assertion and interprets it as a zero-length interval and not a negative value. As a result, the Ceph Monitor does not crash when this assertion is made.
Story Points: ---
Clone Of: Environment:
Last Closed: 2020-08-19 19:50:24 UTC Type: Bug
Regression: --- Mount Type: ---
Documentation: --- CRM:
Verified Versions: Category: ---
oVirt Team: --- RHEL 7.3 requirements from Atomic Host:
Cloudforms Team: --- Target Upstream Version:
Embargoed:
Bug Depends On:    
Bug Blocks: 1816167, 1846989, 1903969    

Description Josh Durgin 2020-06-16 19:28:58 UTC
Description of problem:
Monitors can hit an assert about timespan durations not being negative.

Version-Release number of selected component (if applicable):
4.x

How reproducible:
sometimes, unreliable but many user reports - may depend on hardware/kernel version.

Comment 1 Neha Ojha 2020-08-03 16:47:55 UTC
*** Bug 1862916 has been marked as a duplicate of this bug. ***

Comment 2 Scott Ostapovicz 2020-08-05 14:59:24 UTC
Is this a regression, or was this problem already present?

Comment 3 Josh Durgin 2020-08-05 15:13:17 UTC
(In reply to Scott Ostapovicz from comment #2)
> Is this a regression, or was this problem already present?

This is not a regression. It's present since 4.0.

Comment 15 errata-xmlrpc 2020-08-19 19:50:24 UTC
Since the problem described in this bug report should be
resolved in a recent advisory, it has been closed with a
resolution of ERRATA.

For information on the advisory (Red Hat Ceph Storage 4.1 Bug Fix update), and where to find the updated
files, follow the link below.

If the solution does not work for you, open a new bug report.

https://access.redhat.com/errata/RHBA-2020:3524