Bug 1800803 - Newly-installed RHV hosts with Gluster storage domains go non-operational
Summary: Newly-installed RHV hosts with Gluster storage domains go non-operational
Keywords:
Status: CLOSED ERRATA
Alias: None
Product: Red Hat Enterprise Virtualization Manager
Classification: Red Hat
Component: vdsm
Version: 4.3.7
Hardware: Unspecified
OS: Unspecified
urgent
urgent
Target Milestone: ovirt-4.4.0
: ---
Assignee: Gobinda Das
QA Contact: SATHEESARAN
URL:
Whiteboard:
Depends On: 1780290
Blocks: 1811398
TreeView+ depends on / blocked
 
Reported: 2020-02-07 21:52 UTC by Gordon Watson
Modified: 2023-03-24 16:59 UTC (History)
14 users (show)

Fixed In Version:
Doc Type: If docs needed, set a value
Doc Text:
Clone Of:
: 1811398 (view as bug list)
Environment:
Last Closed: 2020-02-18 09:36:53 UTC
oVirt Team: Storage
Target Upstream Version:
Embargoed:


Attachments (Terms of Use)


Links
System ID Private Priority Status Summary Last Updated
Red Hat Issue Tracker RHV-46389 0 None None None 2022-06-12 23:12:01 UTC
Red Hat Knowledge Base (Solution) 4815931 0 None None None 2020-02-10 18:35:04 UTC
Red Hat Product Errata RHBA-2020:0534 0 None None None 2020-02-18 09:37:02 UTC

Description Gordon Watson 2020-02-07 21:52:45 UTC
Description of problem:

After installation of new RHEL 7.7 host in a DC with Gluster storage domains, it goes non-operational. 

The vdsm logs are full of;

2020-02-07 16:26:37,577-0500 WARN  (jsonrpc/5) [storage.PersistentDict] Could not parse line `^@^@^@^@^@^@^@^@^@^@^@^@^@^@^@^@^@^@^@^@^@^@^@^@^@


In debug mode, it's reading beyond the end of the SD metadata;

2020-02-06 05:35:14,744-0500 DEBUG (jsonrpc/2) [storage.PersistentDict] read lines (FileMetadataRW)=['CLASS=Data', 'DESCRIPTION=vmdata21', 'IOOPTIMEOUTSEC=10', 'LEASERETRIES=3', 'LEASETIMESEC=60', 'LOCKPOLICY=', 'LOCKRENEWALINTERVALSEC=5', 'POOL_UUID=00000001-0001-0001-0001-000000000311', 'REMOTE_PATH=mmr14:/vmdata21', 'ROLE=Master', 'SDUUID=22b2b7a2-65f3-47a9-b6f8-168a50451b2d', 'TYPE=GLUSTERFS', 'VERSION=4', '_SHA_CKSUM=842d15901567b9c08654a838b33bea46d5cce9a6', '\x00\x00\x00\x00\x00\x00\x00\x00\x00\x00\x00\x00\x00\x00\x00\x00\x00\x00\x00\x00\x00\x00\x00\x00\x00\x00\x00\x00\x00\x00\x00\x00\x00\x00\x00\x00\x00\x00\x00\x00\x00\x00\....
2020-02-06 05:35:14,744-0500 WARN  (jsonrpc/2) [storage.PersistentDict] Could not parse line `^@^@^@^@^@^@^@^@^@^@^@....


So, it appears to be reading and parsing the entire 512-byte block.

Other hosts in the same DC have 'vdsm-4.30.33.1' and are working fine. I downgraded the system in question to 'vdsm-4.30.33.1' and it still went non-operational.


Version-Release number of selected component (if applicable):

RHV 4.3.7
RHEL 7.7 host;
    vdsm-4.30.38-1.el7ev.x86_64
    glusterfs-6.0-29.el7rhgs.x86_64


How reproducible:

Appears to be reproducible on newly-installed/upgraded RHV hosts with Gluster storage.


Steps to Reproduce:
1.
2.
3.

Actual results:


Expected results:


Additional info:

Comment 12 Eyal Shenitzky 2020-02-10 15:10:22 UTC
Sandro,

Can you please have a look at it.
It seems like there is some versions problem here.

Comment 22 Nir Soffer 2020-02-11 17:34:38 UTC
Regarding the warnings, if we cannot get the gluster fix for bug 1737141,
we can backport this change in vdsm to 4.3 to avoid the unwanted warnings.
It will be easy to backport since it depends on oop.readFile which does
not exist in 4.3.

commit 0c459b90007947ed303db660b288565969367b59
Author: Nir Soffer <nsoffer>
Date:   Fri Aug 2 23:17:28 2019 +0300

    fileVolume: Strip padding when reading volume metadata
    
    VolumeMetadata.from_lines() does not warn about extra zero padding, but
    there is no reason to return the padding when reading volume metadata
    from storage.
    
    Replacing oop.directReadLines() with oop.readFile(direct=True) simplify
    the code, reading volume metadata the same way we do for domain
    metadata, and allows removing oop.directReadLines().
    
    Change-Id: I6d264e49d53329b2b8eef5a5af4e8fd38d272681
    Signed-off-by: Nir Soffer <nsoffer>

Comment 25 Marina Kalinin 2020-02-11 19:27:59 UTC
(In reply to Nir Soffer from comment #22)
> Regarding the warnings, if we cannot get the gluster fix for bug 1737141,
> we can backport this change in vdsm to 4.3 to avoid the unwanted warnings.
> It will be easy to backport since it depends on oop.readFile which does
> not exist in 4.3.
> 
> commit 0c459b90007947ed303db660b288565969367b59
> Author: Nir Soffer <nsoffer>
> Date:   Fri Aug 2 23:17:28 2019 +0300
> 
>     fileVolume: Strip padding when reading volume metadata
>     
>     VolumeMetadata.from_lines() does not warn about extra zero padding, but
>     there is no reason to return the padding when reading volume metadata
>     from storage.
>     
>     Replacing oop.directReadLines() with oop.readFile(direct=True) simplify
>     the code, reading volume metadata the same way we do for domain
>     metadata, and allows removing oop.directReadLines().
>     
>     Change-Id: I6d264e49d53329b2b8eef5a5af4e8fd38d272681
>     Signed-off-by: Nir Soffer <nsoffer>

Gordon, can you please open a new bug for that?

Comment 26 Gordon Watson 2020-02-11 20:49:38 UTC
> 
> Gordon, can you please open a new bug for that?

Marina, 

I just opened BZ 1801892.

Regards, GFW.

Comment 29 errata-xmlrpc 2020-02-18 09:36:53 UTC
Since the problem described in this bug report should be
resolved in a recent advisory, it has been closed with a
resolution of ERRATA.

For information on the advisory, and where to find the updated
files, follow the link below.

If the solution does not work for you, open a new bug report.

https://access.redhat.com/errata/RHBA-2020:0534

Comment 31 Sandro Bonazzola 2020-02-28 12:30:40 UTC
(In reply to Eyal Shenitzky from comment #12)
> Sandro,
> 
> Can you please have a look at it.
> It seems like there is some versions problem here.

Looks like this has been already clarified.


Note You need to log in before you can comment on or make changes to this bug.