Bug 802018 - writes on nfs mount failed with "Input/output error" when quota limit-usage was set
Summary: writes on nfs mount failed with "Input/output error" when quota limit-usage w...
Keywords:
Status: CLOSED DUPLICATE of bug 800755
Alias: None
Product: GlusterFS
Classification: Community
Component: replicate
Version: mainline
Hardware: Unspecified
OS: Unspecified
high
high
Target Milestone: ---
Assignee: Pranith Kumar K
QA Contact:
URL:
Whiteboard:
Depends On:
Blocks:
TreeView+ depends on / blocked
 
Reported: 2012-03-10 09:40 UTC by Shwetha Panduranga
Modified: 2012-03-16 05:49 UTC (History)
1 user (show)

Fixed In Version:
Doc Type: Bug Fix
Doc Text:
Clone Of:
Environment:
Last Closed: 2012-03-16 05:49:01 UTC
Regression: ---
Mount Type: ---
Documentation: ---
CRM:
Verified Versions:
Embargoed:


Attachments (Terms of Use)

Description Shwetha Panduranga 2012-03-10 09:40:23 UTC
Description of problem:
writes operation on files fails on nfs mount when there are parallel writes from other mounts on the same volume and quota limit-usage is set 

Version-Release number of selected component (if applicable):
3.3.0qa26

How reproducible:
often

Steps to Reproduce:
1.create a distribute-replicate volume (2 x 3, 200GB total available space).
Start the volume
2.Create 2 gluster mounts and one nfs mount
3.Start dd on one of the gluster mount and nfs mount
4.start ping_pong on a file on other gluster mount
5.enable quota on the volume
6.set the quota limit-usage to some value (The quota limit-usage was set to 150GB.)
7.The dd on the nfs mount fails.

Actual results:
[03/10/12 - 17:32:47 root@APP-CLIENT1 nfsc1]# dd if=/dev/zero of=nfsf1 bs=1M count=51200
dd: writing `nfsf1': Invalid argument
4152+0 records in
4151+0 records out
4352638976 bytes (4.4 GB) copied, 387.761 s, 11.2 MB/s


Expected results:
dd should continue to run. 

Additional info:
Brick Log:-
------------
[2012-03-10 17:41:50.374193] W [marker-quota.c:2042:mq_inspect_directory_xattr] 0-store1-marker: cannot add a new contribution node
[2012-03-10 17:41:50.374308] W [marker-quota.c:2042:mq_inspect_directory_xattr] 0-store1-marker: cannot add a new contribution node
[2012-03-10 17:41:50.374719] D [io-threads.c:275:iot_schedule] 0-store1-io-threads: FINODELK scheduled as normal fop
[2012-03-10 17:41:50.374965] E [marker-quota-helper.c:230:mq_dict_set_contribution] (-->/usr/local/lib/glusterfs/3.3.0qa26/xlator/debug/io-stats.so(io_stats_lookup+0x28c) [0x7f2eb8caea02] (-->/usr/local/lib/glusterfs/3.3.0qa26/xlator/features/marker.so(marker_lookup+0x142) [0x7f2eb8eccc45] (-->/usr/local/lib/glusterfs/3.3.0qa26/xlator/features/marker.so(mq_req_xattr+0x123) [0x7f2eb8ed7cdc]))) 0-marker: invalid argument: loc->parent


NFS Server Log:- (Check the log during the time frame: [2012-03-10 17:39:55.205570])
--------------
[2012-03-10 17:40:57.079499] D [nfs3-helpers.c:1765:nfs3_log_rw_call] 0-nfs-nfsv3: XID: 369f3732, WRITE: args: FH: hashcount 1, exportid ed0838af-6b69-4a3b-8357-fc7a71cad4af, gfid 39786995-e612-429e-9939-fc1fe1a055c7, offset: 4233867264,  count: 65536, UNSTABLE
[2012-03-10 17:40:57.080238] D [dht-common.c:269:dht_discover_cbk] 0-store1-dht: lookup of <gfid:00000000-0000-0000-0000-000000000000> on store1-replicate-1 returned error (No such file or directory)
[2012-03-10 17:40:57.081214] D [nfs3-helpers.c:1765:nfs3_log_rw_call] 0-nfs-nfsv3: XID: 359f3732, WRITE: args: FH: hashcount 1, exportid ed0838af-6b69-4a3b-8357-fc7a71cad4af, gfid 39786995-e612-429e-9939-fc1fe1a055c7, offset: 4233801728,  count: 65536, UNSTABLE
[2012-03-10 17:40:57.082019] D [dht-common.c:269:dht_discover_cbk] 0-store1-dht: lookup of <gfid:00000000-0000-0000-0000-000000000000> on store1-replicate-1 returned error (No such file or directory)
[2012-03-10 17:40:57.082337] D [dht-common.c:269:dht_discover_cbk] 0-store1-dht: lookup of <gfid:00000000-0000-0000-0000-000000000000> on store1-replicate-1 returned error (No such file or directory)
[2012-03-10 17:40:57.082556] D [dht-common.c:269:dht_discover_cbk] 0-store1-dht: lookup of <gfid:00000000-0000-0000-0000-000000000000> on store1-replicate-1 returned error (No such file or directory)
[2012-03-10 17:40:57.082706] D [dht-common.c:269:dht_discover_cbk] 0-store1-dht: lookup of <gfid:00000000-0000-0000-0000-000000000000> on store1-replicate-1 returned error (No such file or directory)
[2012-03-10 17:40:57.082837] D [nfs3-helpers.c:1765:nfs3_log_rw_call] 0-nfs-nfsv3: XID: 349f3732, WRITE: args: FH: hashcount 1, exportid ed0838af-6b69-4a3b-8357-fc7a71cad4af, gfid 39786995-e612-429e-9939-fc1fe1a055c7, offset: 4233736192,  count: 65536, UNSTABLE
[2012-03-10 17:40:57.083509] D [dht-common.c:269:dht_discover_cbk] 0-store1-dht: lookup of <gfid:00000000-0000-0000-0000-000000000000> on store1-replicate-1 returned error (No such file or directory)
[2012-03-10 17:40:57.103183] D [nfs3-helpers.c:1765:nfs3_log_rw_call] 0-nfs-nfsv3: XID: 339f3732, WRITE: args: FH: hashcount 1, exportid ed0838af-6b69-4a3b-8357-fc7a71cad4af, gfid 39786995-e612-429e-9939-fc1fe1a055c7, offset: 4233670656,  count: 65536, UNSTABLE
[2012-03-10 17:40:57.103801] D [dht-common.c:269:dht_discover_cbk] 0-store1-dht: lookup of <gfid:00000000-0000-0000-0000-000000000000> on store1-replicate-1 returned error (No such file or directory)
[2012-03-10 17:40:57.103974] D [nfs3-helpers.c:1765:nfs3_log_rw_call] 0-nfs-nfsv3: XID: 329f3732, WRITE: args: FH: hashcount 1, exportid ed0838af-6b69-4a3b-8357-fc7a71cad4af, gfid 39786995-e612-429e-9939-fc1fe1a055c7, offset: 4233605120,  count: 65536, UNSTABLE

[2012-03-10 17:40:57.105841] D [afr-self-heal-common.c:148:afr_sh_print_pending_matrix] 0-store1-replicate-0: pending_matrix: [ 3 3 3 ]
[2012-03-10 17:40:57.105868] D [afr-self-heal-common.c:148:afr_sh_print_pending_matrix] 0-store1-replicate-0: pending_matrix: [ 3 3 3 ]
[2012-03-10 17:40:57.105884] D [afr-self-heal-common.c:148:afr_sh_print_pending_matrix] 0-store1-replicate-0: pending_matrix: [ 3 3 3 ]
[2012-03-10 17:40:57.105903] D [afr-self-heal-common.c:753:afr_mark_sources] 0-store1-replicate-0: Number of sources: 3
[2012-03-10 17:40:57.105921] D [afr-self-heal-data.c:799:afr_lookup_select_read_child_by_txn_type] 0-store1-replicate-0: returning read_child: 2
[2012-03-10 17:40:57.105937] D [afr-common.c:1268:afr_lookup_select_read_child] 0-store1-replicate-0: Source selected as 2 for <gfid:00000000-0000-0000-0000-000000000000>
[2012-03-10 17:40:57.105960] D [afr-common.c:1075:afr_lookup_build_response_params] 0-store1-replicate-0: Building lookup response from 2
[2012-03-10 17:40:57.105988] D [afr-common.c:1137:afr_lookup_set_self_heal_params_by_xattr] 0-store1-replicate-0: data self-heal is pending for <gfid:00000000-0000-0000-0000-000000000000>.
[2012-03-10 17:40:57.106008] D [afr-common.c:1137:afr_lookup_set_self_heal_params_by_xattr] 0-store1-replicate-0: data self-heal is pending for <gfid:00000000-0000-0000-0000-000000000000>.
[2012-03-10 17:40:57.106027] D [afr-common.c:1137:afr_lookup_set_self_heal_params_by_xattr] 0-store1-replicate-0: data self-heal is pending for <gfid:00000000-0000-0000-0000-000000000000>.
[2012-03-10 17:40:57.106046] I [afr-common.c:1313:afr_launch_self_heal] 0-store1-replicate-0: background  data self-heal triggered. path: <gfid:00000000-0000-0000-0000-000000000000>, reason: lookup detected pending operations
[2012-03-10 17:40:57.106104] D [afr-self-heal-metadata.c:78:afr_sh_metadata_done] 0-store1-replicate-0: proceeding to data check on <gfid:00000000-0000-0000-0000-000000000000>
[2012-03-10 17:40:57.123809] D [dht-common.c:269:dht_discover_cbk] 0-store1-dht: lookup of <gfid:00000000-0000-0000-0000-000000000000> on store1-replicate-1 returned error (No such file or directory)
[2012-03-10 17:40:57.195300] D [dht-common.c:269:dht_discover_cbk] 0-store1-dht: lookup of <gfid:00000000-0000-0000-0000-000000000000> on store1-replicate-0 returned error (Input/output error)
[2012-03-10 17:40:57.195335] I [dht-layout.c:600:dht_layout_normalize] 0-store1-dht: found anomalies in <gfid:00000000-0000-0000-0000-000000000000>. holes=1 overlaps=0
[2012-03-10 17:40:57.195355] D [dht-layout.c:614:dht_layout_normalize] 0-store1-dht: path=<gfid:00000000-0000-0000-0000-000000000000> err=No such file or directory on subvol=store1-replicate-1
[2012-03-10 17:40:57.195372] D [dht-layout.c:614:dht_layout_normalize] 0-store1-dht: path=<gfid:00000000-0000-0000-0000-000000000000> err=Input/output error on subvol=store1-replicate-0
[2012-03-10 17:40:57.195387] D [dht-common.c:197:dht_discover_complete] 0-store1-dht: normalizing failed on <gfid:00000000-0000-0000-0000-000000000000>
[2012-03-10 17:40:57.195456] E [nfs3-helpers.c:3768:nfs3_fh_resolve_inode_lookup_cbk] 0-nfs-nfsv3: Lookup failed: <gfid:00000000-0000-0000-0000-000000000000>: Invalid argument
[2012-03-10 17:40:57.195543] E [nfs3.c:2150:nfs3_write_resume] 0-nfs-nfsv3: Unable to resolve FH: store1 : 00000000-0000-0000-0000-000000000000
[2012-03-10 17:40:57.195573] W [nfs3-helpers.c:3392:nfs3_log_common_res] 0-nfs-nfsv3: XID: 2e9f3732, WRITE: NFS: 22(Invalid argument for operation), POSIX: 14(Bad address)

Comment 1 Raghavendra G 2012-03-16 05:49:01 UTC

*** This bug has been marked as a duplicate of bug 800755 ***


Note You need to log in before you can comment on or make changes to this bug.