Bug 1509118

Summary: [Ganesha] : Ganesha crashed while exporting volumes in mdc_up_invalidate().
Product: [Red Hat Storage] Red Hat Gluster Storage Reporter: Ambarish <asoman>
Component: nfs-ganeshaAssignee: Kaleb KEITHLEY <kkeithle>
Status: CLOSED ERRATA QA Contact: Manisha Saini <msaini>
Severity: high Docs Contact:
Priority: unspecified    
Version: rhgs-3.3CC: amukherj, bturner, dang, dblack, ffilz, jthottan, kkeithle, mbenjamin, msaini, rhinduja, rhs-bugs, skoduri, ssaha, storage-qa-internal
Target Milestone: ---   
Target Release: RHGS 3.4.0   
Hardware: x86_64   
OS: Linux   
Whiteboard:
Fixed In Version: nfs-ganesha-2.5.4-1 Doc Type: If docs needed, set a value
Doc Text:
Story Points: ---
Clone Of: Environment:
Last Closed: 2018-09-04 06:53:36 UTC Type: Bug
Regression: --- Mount Type: ---
Documentation: --- CRM:
Verified Versions: Category: ---
oVirt Team: --- RHEL 7.3 requirements from Atomic Host:
Cloudforms Team: --- Target Upstream Version:
Embargoed:
Bug Depends On:    
Bug Blocks: 1503134    

Description Ambarish 2017-11-03 06:55:54 UTC
Description of problem:
-----------------------

Ganesha crashed while exporting multiple volumes in loop and dumped a core on 3 out of 6 nodes :

Core was generated by `/usr/bin/ganesha.nfsd -L /var/log/ganesha.log -f /etc/ganesha/ganesha.conf -N N'.
Program terminated with signal 11, Segmentation fault.
#0  0x0000562c8dd15ea3 in mdc_up_invalidate (export=0x7fab1cf352e0, handle=0x7fa884002858, flags=271) at /usr/src/debug/nfs-ganesha-2.4.4/src/FSAL/Stackable_FSALs/FSAL_MDCACHE/mdcache_up.c:55
55		key.fsal = export->sub_export->fsal;
(gdb) bt
#0  0x0000562c8dd15ea3 in mdc_up_invalidate (export=0x7fab1cf352e0, handle=0x7fa884002858, flags=271) at /usr/src/debug/nfs-ganesha-2.4.4/src/FSAL/Stackable_FSALs/FSAL_MDCACHE/mdcache_up.c:55
#1  0x0000562c8dc55fe9 in queue_invalidate (ctx=<optimized out>) at /usr/src/debug/nfs-ganesha-2.4.4/src/FSAL_UP/fsal_up_async.c:81
#2  0x0000562c8dcf3889 in fridgethr_start_routine (arg=0x7fa9480019e0) at /usr/src/debug/nfs-ganesha-2.4.4/src/support/fridgethr.c:550
#3  0x00007fabab11fe25 in start_thread (arg=0x7fa80ce3b700) at pthread_create.c:308
#4  0x00007fabaa7ed34d in clone () at ../sysdeps/unix/sysv/linux/x86_64/clone.S:113
(gdb) 


Version-Release number of selected component (if applicable):
-------------------------------------------------------------

glusterfs-ganesha-3.8.4-50.el7rhgs.x86_64
nfs-ganesha-2.4.4-17.el7rhgs.x86_64


How reproducible:
-----------------

1/1 , but same crash on 3 nodes.

Comment 3 Jiffin 2017-11-03 11:37:37 UTC
On latest code this issue is fixed by Dan https://review.gerrithub.io/#/c/360161/ (15e5c707). Requesting Dan to confirm the same.

Comment 4 Daniel Gryniewicz 2017-11-03 13:28:11 UTC
This should be the correct fix for this, yes.

Comment 5 Ambarish 2017-11-15 08:36:21 UTC
Can this be taken for 3.4.0?

Comment 8 Ambarish 2017-11-17 06:29:43 UTC
Thanks Kaleb!

Atin - Requesting Devel ACK that got lost in collision.

Need Info on Rahul for QA_ACK.

Comment 15 errata-xmlrpc 2018-09-04 06:53:36 UTC
Since the problem described in this bug report should be
resolved in a recent advisory, it has been closed with a
resolution of ERRATA.

For information on the advisory, and where to find the updated
files, follow the link below.

If the solution does not work for you, open a new bug report.

https://access.redhat.com/errata/RHEA-2018:2610