Bug 1343695 - [Disperse] : Assertion Failed Error messages in rebalance log post add-brick/rebalance.
Summary: [Disperse] : Assertion Failed Error messages in rebalance log post add-brick/...
Keywords:
Status: CLOSED ERRATA
Alias: None
Product: Red Hat Gluster Storage
Classification: Red Hat Storage
Component: disperse
Version: rhgs-3.1
Hardware: x86_64
OS: Linux
unspecified
high
Target Milestone: ---
: RHGS 3.2.0
Assignee: Pranith Kumar K
QA Contact: Ambarish
URL:
Whiteboard:
Depends On: 1343906
Blocks: 1351522
TreeView+ depends on / blocked
 
Reported: 2016-06-07 17:28 UTC by Ambarish
Modified: 2017-03-28 06:56 UTC (History)
6 users (show)

Fixed In Version: glusterfs-3.8.4-1
Doc Type: If docs needed, set a value
Doc Text:
Clone Of:
Environment:
Last Closed: 2017-03-23 05:35:12 UTC
Embargoed:


Attachments (Terms of Use)


Links
System ID Private Priority Status Summary Last Updated
Red Hat Product Errata RHSA-2017:0486 0 normal SHIPPED_LIVE Moderate: Red Hat Gluster Storage 3.2.0 security, bug fix, and enhancement update 2017-03-23 09:18:45 UTC

Description Ambarish 2016-06-07 17:28:48 UTC
Description of problem:
-----------------------

Started with a 1*(4+2) disperse volume.Added bricks.Rebalanced.

Rebalance log had the following setxattr assertion errors :


[2016-06-06 15:10:53.999653] E [ec-inode-write.c:395:ec_manager_setattr] (-->/usr/lib64/glusterfs/3.7.9/xlator/cluster/disperse.so(ec_resume+0x91) [0x7fd263b5e621] -->/usr/lib64/glusterfs/3.7.9/xlator/cluster/disperse.so(__ec_manager+0x57) [0x7fd263b5e807] -->/usr/lib64/glusterfs/3.7.9/xlator/cluster/disperse.so(ec_manager_setattr+0x2c6) [0x7fd263b7be76] ) 0-: Assertion failed: ec_get_inode_size(fop, fop->locks[0].lock->loc.inode, &cbk->iatt[0].ia_size)
[2016-06-06 15:10:54.003509] E [ec-inode-write.c:395:ec_manager_setattr] (-->/usr/lib64/glusterfs/3.7.9/xlator/cluster/disperse.so(ec_resume+0x91) [0x7fd263b5e621] -->/usr/lib64/glusterfs/3.7.9/xlator/cluster/disperse.so(__ec_manager+0x57) [0x7fd263b5e807] -->/usr/lib64/glusterfs/3.7.9/xlator/cluster/disperse.so(ec_manager_setattr+0x2c6) [0x7fd263b7be76] ) 0-: Assertion failed: ec_get_inode_size(fop, fop->locks[0].lock->loc.inode, &cbk->iatt[0].ia_size)
[2016-06-06 15:10:54.012540] E [ec-inode-write.c:395:ec_manager_setattr] (-->/usr/lib64/glusterfs/3.7.9/xlator/cluster/disperse.so(ec_resume+0x91) [0x7fd263b5e621] -->/usr/lib64/glusterfs/3.7.9/xlator/cluster/disperse.so(__ec_manager+0x57) [0x7fd263b5e807] -->/usr/lib64/glusterfs/3.7.9/xlator/cluster/disperse.so(ec_manager_setattr+0x2c6) [0x7fd263b7be76] ) 0-: Assertion failed: ec_get_inode_size(fop, fop->locks[0].lock->loc.inode, &cbk->iatt[0].ia_size)
[2016-06-06 15:13:15.333800] E [MSGID: 109023] 



Version-Release number of selected component (if applicable):
-------------------------------------------------------------

3.7.9-8

How reproducible:
-----------------

100%

Steps to Reproduce:
------------------

1. Create a 1*(4+2) EC volume.Mount it via gNFS.

2. Add brick rebalance,.Run I/Os from various mounts while this happens.

3. Check rebal logs periodically.

Actual results:
---------------

Assertion Failure error message in logs

Expected results:
----------------

Add-brick/rebal should succeed without any problems

Additional info:
----------------
Ashish raised a bug for the same(tracked via https://bugzilla.redhat.com/show_bug.cgi?id=1339465),which was later duped to https://bugzilla.redhat.com/show_bug.cgi?id=1330997.The fix version given in tha BZ is 3.7.9-7.I am able to hit this issue on 3.7.9-8 as well.

Comment 5 Atin Mukherjee 2016-08-30 05:00:31 UTC
fix http://review.gluster.org/15008 has made into release-3.8 branch in gluster upstream and the same should be available in rhgs-3.2.0 as part of rebase.

Comment 6 Atin Mukherjee 2016-09-17 13:39:18 UTC
Upstream mainline : http://review.gluster.org/14669
Upstream 3.8 : http://review.gluster.org/15008

And the fix is available in rhgs-3.2.0 as part of rebase to GlusterFS 3.8.4.

Comment 9 Ambarish 2016-10-24 08:57:22 UTC
Verified on 3.8.4-2.

Did a couple of add-brick+rebal and remove-bricks with continuous I/O from different mounts over gNFS.
Could not reproduce the reported issue.

Comment 11 errata-xmlrpc 2017-03-23 05:35:12 UTC
Since the problem described in this bug report should be
resolved in a recent advisory, it has been closed with a
resolution of ERRATA.

For information on the advisory, and where to find the updated
files, follow the link below.

If the solution does not work for you, open a new bug report.

https://rhn.redhat.com/errata/RHSA-2017-0486.html


Note You need to log in before you can comment on or make changes to this bug.