Bug 762731 (GLUSTER-999)

Summary: Crash in nfs3_fh_resolve_and_resume
Product: [Community] GlusterFS Reporter: Anush Shetty <anush>
Component: nfsAssignee: Shehjar Tikoo <shehjart>
Severity: high Docs Contact:
Priority: low    
Version: nfs-alphaCC: amarts, gluster-bugs, lakshmipathi, vijay
Target Milestone: ---   
Target Release: ---   
Hardware: All   
OS: Linux   
Fixed In Version: Doc Type: Bug Fix
Doc Text:
Story Points: ---
Clone Of: Environment:
Last Closed: Type: ---
Regression: RTNR Mount Type: nfs
Documentation: --- CRM:
Verified Versions: Category: ---
oVirt Team: --- RHEL 7.3 requirements from Atomic Host:
Cloudforms Team: ---
Description Flags
NFS server log none

Description Shehjar Tikoo 2010-06-11 07:16:27 EDT
I need the log please.
Comment 1 Anush Shetty 2010-06-11 08:45:14 EDT
Saw this crash in nfs-beta while running iozone 

Vol file:
volume posix1
  type storage/posix                    
  option directory /gluster/export1

volume locks
  type features/posix-locks
  option mandatory on         
  subvolumes posix1

volume brick1
 type performance/io-threads
 option thread-count 8
 subvolumes locks

volume eg
 type debug/error-gen
 option failure 5
 subvolumes brick1

volume nfs
 type nfs/server
 subvolumes eg
 option rpc-auth.addr.allow *

patchset: git://git.sv.gnu.org/gluster.git
signal received: 11
time of crash: 2010-06-11 17:36:18
configuration details:
argp 1
backtrace 1
dlfcn 1
fdatasync 1
libpthread 1
llistxattr 1
setfsid 1
spinlock 1
epoll.h 1
xattr.h 1
st_atim.tv_nsec 1
package-string: glusterfs 3.0.0git
Comment 2 Shehjar Tikoo 2010-06-15 03:47:13 EDT
The logs please?
Comment 3 Shehjar Tikoo 2010-07-04 03:00:27 EDT
Anush, we have been running tera bytes of IO using iozone over nfs so if I have to fix this, I need those logs which I've asked for before. This does not look like a trivial bug because of the extensive testing the IO path in NFS is undergoing.
Comment 4 Shehjar Tikoo 2010-07-04 03:17:39 EDT
The one thing you need to try for reproducing this is by using the access-control translator. Its not there in this volume file. Please test with and without access-control to reproduce. Just another data point in helping fix it.
Comment 5 Shehjar Tikoo 2010-07-29 03:41:46 EDT
Not able to reproduce. Closing.
Comment 6 Anush Shetty 2010-07-29 03:57:40 EDT
I still got the crash with Alpha. Works now with Beta git
Comment 7 Shehjar Tikoo 2010-09-24 03:40:42 EDT
User reported this on 3.1 qa27. Reopening.
Comment 8 Samuli Heinonen 2010-09-24 04:33:33 EDT
Created attachment 315

NFS server crashed during file copying.
Comment 9 Shehjar Tikoo 2010-10-05 04:25:14 EDT
*** Bug 1755 has been marked as a duplicate of this bug. ***
Comment 10 Vijay Bellur 2010-10-12 01:39:44 EDT
PATCH: http://patches.gluster.com/patch/5466 in master (nfs: avoid assignment of structure pointer into serialized buffer)
Comment 11 Amar Tumballi 2010-10-12 23:27:36 EDT
Issue fixed with http://patches.gluster.com/patch/5466/
Comment 12 Vijay Bellur 2010-10-18 23:25:13 EDT
*** Bug 1909 has been marked as a duplicate of this bug. ***
Comment 13 Shehjar Tikoo 2010-10-19 00:10:18 EDT
Thanks. thats a beautiful fix.