Bug 1308892
Summary: | [geo-rep]: Observed "Operation not supported" error with traceback on slave log | |||
---|---|---|---|---|
Product: | [Red Hat Storage] Red Hat Gluster Storage | Reporter: | Rahul Hinduja <rhinduja> | |
Component: | geo-replication | Assignee: | hari gowtham <hgowtham> | |
Status: | CLOSED WONTFIX | QA Contact: | Rahul Hinduja <rhinduja> | |
Severity: | medium | Docs Contact: | ||
Priority: | medium | |||
Version: | rhgs-3.1 | CC: | amukherj, anrobins, apaladug, chrisw, csaba, khiremat, kiyer, nchilaka, nlevinki, redhatbugzilla, rhinduja, sheggodu, sunkumar, vdas | |
Target Milestone: | --- | Keywords: | ZStream | |
Target Release: | --- | |||
Hardware: | x86_64 | |||
OS: | Linux | |||
Whiteboard: | rebase | |||
Fixed In Version: | glusterfs-3.12.2-1 | Doc Type: | Bug Fix | |
Doc Text: | Story Points: | --- | ||
Clone Of: | ||||
: | 1499180 (view as bug list) | Environment: | ||
Last Closed: | 2020-03-10 09:09:11 UTC | Type: | Bug | |
Regression: | --- | Mount Type: | --- | |
Documentation: | --- | CRM: | ||
Verified Versions: | Category: | --- | ||
oVirt Team: | --- | RHEL 7.3 requirements from Atomic Host: | ||
Cloudforms Team: | --- | Target Upstream Version: | ||
Embargoed: | ||||
Bug Depends On: | ||||
Bug Blocks: | 1499180, 1500396 |
Description
Rahul Hinduja
2016-02-16 11:33:59 UTC
Analysis: Geo-replication slave logs on Slave node is as follows: [2016-02-12 09:58:34.276675] I [gsyncd(slave):652:main_i] <top>: syncing: gluster://localhost:slave [2016-02-12 09:58:35.422693] I [resource(slave):844:service_loop] GLUSTER: slave listening [2016-02-12 10:00:01.626035] E [repce(slave):117:worker] <top>: call failed: Traceback (most recent call last): File "/usr/libexec/glusterfs/python/syncdaemon/repce.py", line 113, in worker res = getattr(self.obj, rmeth)(*in_data[2:]) File "/usr/libexec/glusterfs/python/syncdaemon/resource.py", line 685, in entry_ops [], [ENOTEMPTY, ESTALE, ENODATA]) File "/usr/libexec/glusterfs/python/syncdaemon/syncdutils.py", line 475, in errno_wrap return call(*arg) File "/usr/libexec/glusterfs/python/syncdaemon/resource.py", line 662, in recursive_rmdir recursive_rmdir(gfid, entry, fullname) File "/usr/libexec/glusterfs/python/syncdaemon/resource.py", line 650, in recursive_rmdir names = errno_wrap(os.listdir, [path], [ENOENT], [ESTALE]) File "/usr/libexec/glusterfs/python/syncdaemon/syncdutils.py", line 475, in errno_wrap return call(*arg) OSError: [Errno 95] Operation not supported: '.gfid/0fd92c95-df7b-4e64-a0c6-04d6cf01e9ee/level68/symlink_to_files' So it is during os.lisdir from python. If we see the corresponding aux-mount logs during same time: ... [2016-02-12 10:00:01.625414] W [fuse-bridge.c:982:fuse_fd_cbk] 0-glusterfs-fuse: 7065: OPENDIR() /.gfid/519f470b-b488-4da2-8a8d-a968d5394fc7 => -1 (Operation not supported) ... We can see OPENDIR on /.gfid/<GFID> is being called, which is not supported. gfid access translator will return with 'Operatoin not supported' on such opendirs. But yet to find out who called OPENDIR on .gfid/<GFID> For reference: Hitting this on 3.1.3 (glusterfs-3.7.9-5) Slave Log: ========== [2016-05-22 15:56:37.597151] E [repce(slave):117:worker] <top>: call failed: Traceback (most recent call last): File "/usr/libexec/glusterfs/python/syncdaemon/repce.py", line 113, in worker res = getattr(self.obj, rmeth)(*in_data[2:]) File "/usr/libexec/glusterfs/python/syncdaemon/resource.py", line 714, in entry_ops [], [ENOTEMPTY, ESTALE, ENODATA]) File "/usr/libexec/glusterfs/python/syncdaemon/syncdutils.py", line 475, in errno_wrap return call(*arg) File "/usr/libexec/glusterfs/python/syncdaemon/resource.py", line 654, in recursive_rmdir names = errno_wrap(os.listdir, [path], [ENOENT], [ESTALE]) File "/usr/libexec/glusterfs/python/syncdaemon/syncdutils.py", line 475, in errno_wrap return call(*arg) OSError: [Errno 95] Operation not supported: '.gfid/201b5726-579d-4158-a72d-8b30ed275553/symlink_to_files' [2016-05-22 15:56:37.631872] I [repce(slave):92:service_loop] RepceServer: terminating on reaching EOF. For reference: Hitting this on 3.2.0 (glusterfs-3.8.4-3.el7rhgs.x86_64) Slave: ====== [2016-11-13 11:36:06.387545] E [repce(slave):117:worker] <top>: call failed: Traceback (most recent call last): File "/usr/libexec/glusterfs/python/syncdaemon/repce.py", line 113, in worker res = getattr(self.obj, rmeth)(*in_data[2:]) File "/usr/libexec/glusterfs/python/syncdaemon/resource.py", line 726, in entry_ops [], [ENOTEMPTY, ESTALE, ENODATA]) File "/usr/libexec/glusterfs/python/syncdaemon/syncdutils.py", line 495, in errno_wrap return call(*arg) File "/usr/libexec/glusterfs/python/syncdaemon/resource.py", line 666, in recursive_rmdir names = errno_wrap(os.listdir, [path], [ENOENT], [ESTALE]) File "/usr/libexec/glusterfs/python/syncdaemon/syncdutils.py", line 495, in errno_wrap return call(*arg) OSError: [Errno 95] Operation not supported: '.gfid/28693d18-b73b-47c7-986a-b0e1608f8ab5/level410' [2016-11-13 11:36:06.449531] I [repce(slave):92:service_loop] RepceServer: terminating on reaching EOF. Master: ======= [2016-11-13 11:36:06.389225] E [syncdutils(/bricks/brick1/master_brick4):296:log_raise_exception] <top>: FAIL: Traceback (most recent call last): File "/usr/libexec/glusterfs/python/syncdaemon/gsyncd.py", line 204, in main main_i() File "/usr/libexec/glusterfs/python/syncdaemon/gsyncd.py", line 747, in main_i local.service_loop(*[r for r in [remote] if r]) File "/usr/libexec/glusterfs/python/syncdaemon/resource.py", line 1535, in service_loop g2.crawlwrap() File "/usr/libexec/glusterfs/python/syncdaemon/master.py", line 571, in crawlwrap self.crawl() File "/usr/libexec/glusterfs/python/syncdaemon/master.py", line 1132, in crawl self.changelogs_batch_process(changes) File "/usr/libexec/glusterfs/python/syncdaemon/master.py", line 1107, in changelogs_batch_process self.process(batch) File "/usr/libexec/glusterfs/python/syncdaemon/master.py", line 992, in process self.process_change(change, done, retry) File "/usr/libexec/glusterfs/python/syncdaemon/master.py", line 933, in process_change failures = self.slave.server.entry_ops(entries) File "/usr/libexec/glusterfs/python/syncdaemon/repce.py", line 226, in __call__ return self.ins(self.meth, *a) File "/usr/libexec/glusterfs/python/syncdaemon/repce.py", line 208, in __call__ raise res OSError: [Errno 95] Operation not supported: '.gfid/28693d18-b73b-47c7-986a-b0e1608f8ab5/level410' [2016-11-13 11:36:06.418972] I [syncdutils(/bricks/brick1/master_brick4):237:finalize] <top>: exiting. upstream patch : https://review.gluster.org/18443 Is it worth documenting this issue (as a KCS) or known issue in 3.4 Release notes , so customer can try to remedy the issue instead of opening a support case ? (In reply to Anand Paladugu from comment #20) > Is it worth documenting this issue (as a KCS) or known issue in 3.4 Release > notes , so customer can try to remedy the issue instead of opening a support > case ? No, not required. There is no manual admint intervention required. The geo-rep worker restarts automatically and fixes it self. |