Hide Forgot
Running GlusterFS nfs_beta_rc14 on CentOS 4 sharing a replicate of 5 bricks from 5 nodes. Client is openSUSE 11.3 running kernel 2.6.34-12-desktop. Occasionally, when running ls on a directory in the NFS mounted volume, I would get: ls: reading directory .: Stale NFS file handle total 0 and on the client's syslog: Sep 24 14:05:52 vus-bli kernel: [2505162.929534] NFS: server gluster-nfs error: fileid changed Sep 24 14:05:52 vus-bli kernel: [2505162.929537] fsid 0:14: expected fileid 0xbf4046, got 0x82280d5 Sep 24 14:05:52 vus-bli kernel: [2505163.614948] NFS: server gluster-nfs error: fileid changed Sep 24 14:05:52 vus-bli kernel: [2505163.614952] fsid 0:14: expected fileid 0xbf4046, got 0x82280d5 Sep 24 14:05:52 vus-bli kernel: [2505163.637769] NFS: server gluster-nfs error: fileid changed Sep 24 14:05:52 vus-bli kernel: [2505163.637773] fsid 0:14: expected fileid 0xbf4046, got 0x82280d5 on the server, the log shows: [2010-09-24 14:05:45] E [rpcsvc.c:1230:rpcsvc_program_actor] rpc-service: RPC program not available The glusterfsd on the replicate brick servers AFAIK have stayed up during the whole time, so this should have nothing to do with self-healing.
5 replicas have not been tested. I am looking into it.
Error is similar to the fileid changes Harsha experienced during recent tests.
I forgot to mention that the bricks are running GlusterFS version 3.0.5 and *not* nfs_beta_rc14, not sure if that matters...
(In reply to comment #3) > I forgot to mention that the bricks are running GlusterFS version 3.0.5 and > *not* nfs_beta_rc14, not sure if that matters... That hasnt been tested. Please try with rc14 on all bricks and nfs servers. What is the version of the kernel on the nfs client machine where you saw this error?
Bernard, please re-open if ESTALEs occur even with nfs-beta on the bricks.