Bug 1895040

Summary: ceph: problems with clusters containing nodes on s390x for some specific configurations and workloads
Product: [Red Hat Storage] Red Hat Ceph Storage Reporter: Christina Meno <gmeno>
Component: RADOSAssignee: eshiskki
Status: CLOSED ERRATA QA Contact: Manohar Murthy <mmurthy>
Severity: high Docs Contact: Aron Gunn <agunn>
Priority: high    
Version: 4.1CC: agunn, akupczyk, anharris, bhubbard, bniver, ceph-eng-bugs, ceph-qe-bugs, dzafman, eshiskki, gmeno, hannsj_uhl, kchai, kdreyer, madam, mmurthy, nojha, ocs-bugs, pdonnell, rzarzyns, sseshasa, tserlin, uweigand, vereddy, vumrao
Target Milestone: ---   
Target Release: 4.1z3   
Hardware: s390x   
OS: Linux   
Whiteboard:
Fixed In Version: ceph-14.2.8-113.el8cp, ceph-14.2.8-113.el7cp Doc Type: Bug Fix
Doc Text:
.Improvements to the encoding and decoding of messages on storage clusters When deploying a {storage-product} cluster containing a heterogeneous architecture, such as x86_64 and s390, could cause system crashes. Also, under certain workloads for CephFS, Ceph Monitors on s390x nodes could crash unexpectedly. With this release, properly decoding `entity_addrvec_t` with a marker of `1`, properly decoding the `enum` types on big-endian systems by using an intermediate integer variable type, and fixed encoding and decoding `float` types on big-endian systems. As a result, heterogeneous storage clusters, and Ceph Monitors on s390x nodes no longer crash.
Story Points: ---
Clone Of: 1877413 Environment:
Last Closed: 2020-12-02 15:22:34 UTC Type: ---
Regression: --- Mount Type: ---
Documentation: --- CRM:
Verified Versions: Category: ---
oVirt Team: --- RHEL 7.3 requirements from Atomic Host:
Cloudforms Team: --- Target Upstream Version:
Embargoed:
Bug Depends On: 1877413    
Bug Blocks: 1816167    

Comment 2 Christina Meno 2020-11-10 22:10:27 UTC
To git.redhat.com:ceph/ceph.git
   d3de268..2e68222  ceph-4.1-rhel-patches -> ceph-4.1-rhel-patches

I checked the PRs and this appears to contain all the required code.
Eduard please double check me when you're back from PTO

Comment 6 eshiskki 2020-11-17 18:03:47 UTC
(In reply to Christina Meno from comment #2)
> To git.redhat.com:ceph/ceph.git
>    d3de268..2e68222  ceph-4.1-rhel-patches -> ceph-4.1-rhel-patches
> 
> I checked the PRs and this appears to contain all the required code.
> Eduard please double check me when you're back from PTO

Yes, all 3 patches are backported. Looks good for me.

Comment 8 errata-xmlrpc 2020-12-02 15:22:34 UTC
Since the problem described in this bug report should be
resolved in a recent advisory, it has been closed with a
resolution of ERRATA.

For information on the advisory (Moderate: Red Hat Ceph Storage 4.1 security and bug fix update), and where to find the updated
files, follow the link below.

If the solution does not work for you, open a new bug report.

https://access.redhat.com/errata/RHSA-2020:5325

Comment 12 eshiskki 2021-02-03 12:43:16 UTC
Done.
If more technical details are needed, then let me know.

Thanks,
Eduard