Bug 1221906 - ctdb's ping_pong lock tester fails with input/output error on disperse volume mounted with glusterfs
Summary: ctdb's ping_pong lock tester fails with input/output error on disperse volume...
Keywords:
Status: CLOSED CURRENTRELEASE
Alias: None
Product: GlusterFS
Classification: Community
Component: disperse
Version: 3.7.0
Hardware: Unspecified
OS: Unspecified
high
high
Target Milestone: ---
Assignee: Pranith Kumar K
QA Contact:
URL:
Whiteboard:
Depends On: 1177167 1221145
Blocks:
TreeView+ depends on / blocked
 
Reported: 2015-05-15 08:28 UTC by Pranith Kumar K
Modified: 2016-05-03 12:33 UTC (History)
7 users (show)

Fixed In Version:
Doc Type: Bug Fix
Doc Text:
Clone Of: 1221145
Environment:
Last Closed: 2016-05-03 12:33:30 UTC
Regression: ---
Mount Type: ---
Documentation: ---
CRM:
Verified Versions:
Embargoed:


Attachments (Terms of Use)

Description Pranith Kumar K 2015-05-15 08:28:00 UTC
+++ This bug was initially created as a clone of Bug #1221145 +++

+++ This bug was initially created as a clone of Bug #1177167 +++

Description of problem:
ctdb's ping_pong lock tester fails with input/output error on disperse volume mounted with glusterfs.

It apparently works when ping_pong is launched on various hosts where the volume is mounted. As soon as more then on ping_pong is launched on the same host, the tool shows input/output error.

The problem doesn't appear with replica volumes.

Version-Release number of selected component (if applicable):
3.6.1

How reproducible:
Always

Steps to Reproduce:
1. Create a disperse volume (I used 2+1) and mount it as glusterfs
   Problem shows up if the bricks are on a single host too. One single host
   can be used to reproduce it.
2. cd to mount point. Launch 2 simultaneous "ping_pong test 1"
3. 

Actual results:
$ ping_pong test 1
unlock at 0 failed! - Input/output error
lock at 0 failed! - Input/output error
unlock at 0 failed! - Input/output error
lock at 0 failed! - Input/output error
unlock at 0 failed! - Input/output error


Expected results:
$ ping_pong test 1
nnnnn locks/sec

Additional info:
$ gluster volume info test
 
Volume Name: test
Type: Disperse
Volume ID: c41b2c0b-a876-487f-9bf0-01e83027f9da
Status: Started
Number of Bricks: 1 x (2 + 1) = 3
Transport-type: tcp
Bricks:
Brick1: 192.168.114.177:/gluster/cluster/brick.test
Brick2: 192.168.114.13:/gluster/cluster/brick.test
Brick3: 192.168.114.171:/gluster/cluster/brick.test
Options Reconfigured:
nfs.disable: off

--- Additional comment from Anand Avati on 2015-05-13 07:35:40 EDT ---

REVIEW: http://review.gluster.org/10770 (cluster/ec: Prevent unnecessary self-heals) posted (#1) for review on master by Pranith Kumar Karampuri (pkarampu)

--- Additional comment from Anand Avati on 2015-05-13 10:56:53 EDT ---

REVIEW: http://review.gluster.org/10770 (cluster/ec: Prevent unnecessary self-heals) posted (#2) for review on master by Pranith Kumar Karampuri (pkarampu)

--- Additional comment from Anand Avati on 2015-05-15 04:24:54 EDT ---

COMMIT: http://review.gluster.org/10770 committed in master by Pranith Kumar Karampuri (pkarampu) 
------
commit 503acdb32ca84102d07cd1142eff464152b06690
Author: Pranith Kumar K <pkarampu>
Date:   Wed May 13 16:57:49 2015 +0530

    cluster/ec: Prevent unnecessary self-heals
    
    When a blocking lock is requested, lock request is succeeded even when
    ec->fragment number of locks are acquired successfully in non-blocking locking
    phase. This will lead to fop succeeding only on the bricks where the locks are
    acquired, leading to the necessity of self-heals. To prevent these un-necessary
    self-heals, if the remaining locks fail with EAGAIN in non-blocking lock phase
    try blocking locking phase instead.
    
    Change-Id: I940969e39acc620ccde2a876546cea77f7e130b6
    BUG: 1221145
    Signed-off-by: Pranith Kumar K <pkarampu>
    Reviewed-on: http://review.gluster.org/10770
    Tested-by: Gluster Build System <jenkins.com>
    Reviewed-by: Xavier Hernandez <xhernandez>

Comment 1 Anand Avati 2015-05-15 08:43:49 UTC
REVIEW: http://review.gluster.org/10789 (cluster/ec: Prevent unnecessary self-heals) posted (#1) for review on release-3.7 by Pranith Kumar Karampuri (pkarampu)

Comment 2 Anand Avati 2015-05-15 08:46:20 UTC
REVIEW: http://review.gluster.org/10789 (cluster/ec: Prevent unnecessary self-heals) posted (#2) for review on release-3.7 by Pranith Kumar Karampuri (pkarampu)

Comment 3 Mike McCune 2016-03-28 22:22:31 UTC
This bug was accidentally moved from POST to MODIFIED via an error in automation, please see mmccune with any questions

Comment 4 Kaleb KEITHLEY 2016-05-03 12:33:30 UTC
fix merged in http://review.gluster.org/10926


Note You need to log in before you can comment on or make changes to this bug.