Bug 928054
Summary: | remove-brick operation led to Storage Domain and Data Center being brought down to non-responsive state for a long time | ||
---|---|---|---|
Product: | [Red Hat Storage] Red Hat Gluster Storage | Reporter: | Rejy M Cyriac <rcyriac> |
Component: | glusterfs | Assignee: | shishir gowda <sgowda> |
Status: | CLOSED DUPLICATE | QA Contact: | Rejy M Cyriac <rcyriac> |
Severity: | high | Docs Contact: | |
Priority: | high | ||
Version: | 2.0 | CC: | amarts, kaushal, nsathyan, vbellur |
Target Milestone: | --- | ||
Target Release: | --- | ||
Hardware: | All | ||
OS: | Linux | ||
Whiteboard: | |||
Fixed In Version: | Doc Type: | Bug Fix | |
Doc Text: | Story Points: | --- | |
Clone Of: | Environment: | ||
Last Closed: | 2013-05-09 09:52:15 UTC | Type: | Bug |
Regression: | --- | Mount Type: | --- |
Documentation: | --- | CRM: | |
Verified Versions: | Category: | --- | |
oVirt Team: | --- | RHEL 7.3 requirements from Atomic Host: | |
Cloudforms Team: | --- | Target Upstream Version: | |
Embargoed: |
Description
Rejy M Cyriac
2013-03-26 19:25:39 UTC
Looks like ops are failing with permission denied issues, which is leading to failures. [2013-03-27 00:21:11.045151] W [fuse-bridge.c:725:fuse_fd_cbk] 0-glusterfs-fuse: 2401: OPEN() /79f2acbd-6f1c-4976-a8e6-c82a0073b6bb/dom_md/ids => -1 (Invalid argument) [2013-03-27 00:21:17.454194] I [afr-self-heal-entry.c:2309:afr_sh_entry_fix] 0-virtVOL-replicate-2: /79f2acbd-6f1c-4976-a8e6-c82a0073b6bb/dom_md: Performing conservative merge [2013-03-27 00:21:17.454300] I [afr-self-heal-entry.c:2309:afr_sh_entry_fix] 0-virtVOL-replicate-0: /79f2acbd-6f1c-4976-a8e6-c82a0073b6bb/dom_md: Performing conservative merge [2013-03-27 00:21:17.466916] I [dht-common.c:997:dht_lookup_everywhere_cbk] 0-virtVOL-dht: deleting stale linkfile /79f2acbd-6f1c-4976-a8e6-c82a0073b6bb/dom_md/ids on virtVOL-replicate-2 [2013-03-27 00:21:17.467590] W [client3_1-fops.c:651:client3_1_unlink_cbk] 0-virtVOL-client-4: remote operation failed: Permission denied [2013-03-27 00:21:17.467630] W [client3_1-fops.c:651:client3_1_unlink_cbk] 0-virtVOL-client-5: remote operation failed: Permission denied [2013-03-27 00:21:17.468303] W [client3_1-fops.c:258:client3_1_mknod_cbk] 0-virtVOL-client-0: remote operation failed: Permission denied. Path: /79f2acbd-6f1c-4976-a8e6-c82a0073b6bb/dom_md/ids (e5572401-ce56-4c8 2-a4c1-5f54f6948f44) [2013-03-27 00:21:17.468345] W [client3_1-fops.c:258:client3_1_mknod_cbk] 0-virtVOL-client-1: remote operation failed: Permission denied. Path: /79f2acbd-6f1c-4976-a8e6-c82a0073b6bb/dom_md/ids (e5572401-ce56-4c8 2-a4c1-5f54f6948f44) The problem lies in dht_discover_cbk returning EINVAL on ENOENT errors on newly added bricks. It should trigger selfheal on these bricks. Looks like a duplicate of bug 924572 [2013-03-26 18:15:03.372833] I [dht-layout.c:611:dht_layout_normalize] 1-virtVOL-dht: found anomalies in /79f2acbd-6f1c-4976-a8e6-c82a0073b6bb/dom_md. holes=1 overlaps=1 [2013-03-26 18:15:03.377632] I [dht-layout.c:611:dht_layout_normalize] 1-virtVOL-dht: found anomalies in <gfid:4099f439-fc15-4379-bf25-8c15c401952d>. holes=0 overlaps=1 [2013-03-26 18:15:03.377691] W [fuse-resolve.c:152:fuse_resolve_gfid_cbk] 0-fuse: 4099f439-fc15-4379-bf25-8c15c401952d: failed to resolve (Invalid argument) [2013-03-26 18:15:03.377707] E [fuse-bridge.c:555:fuse_getattr_resume] 0-glusterfs-fuse: 689633: GETATTR 139816465486164 (4099f439-fc15-4379-bf25-8c15c401952d) resolution failed Marking it as duplicate of bug 924572, as the root cause is dht_discover_complete returning EINVAL errors when layout anomalies were found. *** This bug has been marked as a duplicate of bug 924572 *** |