Bug 1396784

Summary: [Ganesha] : EIO in gfapi logs during I/O
Product: Red Hat Gluster Storage Reporter: Ambarish <asoman>
Component: nfs-ganeshaAssignee: Kaleb KEITHLEY <kkeithle>
Status: CLOSED NEXTRELEASE QA Contact: Ambarish <asoman>
Severity: high Docs Contact:
Priority: unspecified    
Version: rhgs-3.2CC: asoman, bturner, jthottan, kkeithle, rhinduja, rhs-bugs, skoduri, storage-qa-internal
Target Milestone: ---Keywords: ZStream
Target Release: ---   
Hardware: x86_64   
OS: Linux   
Whiteboard:
Fixed In Version: rhgs-3.3.0 Doc Type: If docs needed, set a value
Doc Text:
Story Points: ---
Clone Of: Environment:
Last Closed: 2017-08-23 12:35:56 UTC Type: Bug
Regression: --- Mount Type: ---
Documentation: --- CRM:
Verified Versions: Category: ---
oVirt Team: --- RHEL 7.3 requirements from Atomic Host:
Cloudforms Team: --- Target Upstream Version:

Description Ambarish 2016-11-20 06:15:37 UTC
Description of problem:
-----------------------

4 node Ganesha cluster.Mounted a 2*2 volume on 4 clients via v3.

I see Input/Output Error messages in ganesha gfapi logs during small file creates :

[2016-11-18 15:29:49.840315] E [MSGID: 114031] [client-rpc-fops.c:301:client3_3_mkdir_cbk] 0-testvol-client-2: remote operation failed. Path: /network_shared [Input/output error]
[2016-11-18 15:29:49.933629] E [MSGID: 114031] [client-rpc-fops.c:301:client3_3_mkdir_cbk] 0-testvol-client-3: remote operation failed. Path: /network_shared [Input/output error]

I am hitting it with and without md-cache improvements.

Version-Release number of selected component (if applicable):
--------------------------------------------------------------

glusterfs-ganesha-3.8.4-5.el7rhgs.x86_64
nfs-ganesha-2.4.1-1.el7rhgs.x86_64

How reproducible:
-----------------

Every time I tried.

Steps to Reproduce:
-------------------

1. Create a new 2*2 volume ,mount it via v3 on 4 clients.

2. Run smallfile creates in  a distributed multithreaded way.

3. Check gfapi logs.

Actual results:
---------------

I/O Error in gfapi logs.

Expected results:
------------------

No EIO in logs.

Additional info:
----------------


Volume Name: testvol
Type: Distributed-Replicate
Volume ID: 865c5329-7fa5-4a10-888b-671902b0bca6
Status: Started
Snapshot Count: 0
Number of Bricks: 2 x 2 = 4
Transport-type: tcp
Bricks:
Brick1: gqas013.sbu.lab.eng.bos.redhat.com:/bricks/testvol_brick0
Brick2: gqas005.sbu.lab.eng.bos.redhat.com:/bricks/testvol_brick1
Brick3: gqas006.sbu.lab.eng.bos.redhat.com:/bricks/testvol_brick2
Brick4: gqas011.sbu.lab.eng.bos.redhat.com:/bricks/testvol_brick3
Options Reconfigured:
ganesha.enable: on
features.cache-invalidation: on
nfs.disable: on
performance.readdir-ahead: on
transport.address-family: inet
performance.stat-prefetch: off
server.allow-insecure: on
diagnostics.latency-measurement: on
diagnostics.count-fop-hits: on
nfs-ganesha: enable
cluster.enable-shared-storage: enable
[root@gqas013 ~]#