Bug 1301120
Summary: | smbd crashes with 3.7.6 and VFS module 4.2.3 | ||||||
---|---|---|---|---|---|---|---|
Product: | [Community] GlusterFS | Reporter: | Anders Rydmell <anders> | ||||
Component: | gluster-smb | Assignee: | Raghavendra Talur <rtalur> | ||||
Status: | CLOSED CURRENTRELEASE | QA Contact: | |||||
Severity: | medium | Docs Contact: | |||||
Priority: | medium | ||||||
Version: | 3.7.6 | CC: | anders, anoopcs, bugs, hgowtham | ||||
Target Milestone: | --- | Keywords: | Triaged | ||||
Target Release: | --- | Flags: | anders:
needinfo-
anders: needinfo- anders: needinfo- |
||||
Hardware: | x86_64 | ||||||
OS: | Linux | ||||||
Whiteboard: | |||||||
Fixed In Version: | glusterfs-3.7.10 | Doc Type: | Bug Fix | ||||
Doc Text: | Story Points: | --- | |||||
Clone Of: | |||||||
: | 1315201 (view as bug list) | Environment: | |||||
Last Closed: | 2016-09-02 09:27:08 UTC | Type: | Bug | ||||
Regression: | --- | Mount Type: | --- | ||||
Documentation: | --- | CRM: | |||||
Verified Versions: | Category: | --- | |||||
oVirt Team: | --- | RHEL 7.3 requirements from Atomic Host: | |||||
Cloudforms Team: | --- | Target Upstream Version: | |||||
Embargoed: | |||||||
Attachments: |
|
Description
Anders Rydmell
2016-01-22 16:10:22 UTC
Created attachment 1117982 [details]
core dump from smbd
Bug 1234877 was fixed in the Samba package, we'll need to find out if samba-4.2.3-11 contains that patch. If this requires a change to the Samba RPM, please update the product (RHEL?) and component. Samba-4.2.3 already contains the fix for issue mentioned in the following upstream bug: https://bugzilla.samba.org/show_bug.cgi?id=11115 Back trace provided here is different from what we have seen from https://bugzilla.redhat.com/show_bug.cgi?id=1234877 and needs some investigation. Therefore https://bugzilla.samba.org/show_bug.cgi?id=11115 is not related to this bug. See my reply to the following thread: http://www.gluster.org/pipermail/gluster-users/2016-February/025293.html From a quick look from the dmesg bt, I suspect a race between some glusterfs timer related threads. But need to find the exact root cause. Hi Anders, Attached core dump file is truncated and its hard to debug from the same. So can you please attach a new complete core dump? I can vaguely suspect an issue regarding the race between gf_timer_proc() and gf_timer_call_cancel() in accessing some already freed content from glusterfs stack. A complete core would help to root cause the issue much easier than from the high-level back trace that we have from /var/log/messages or dmesg. I see that Mukul in bug id 1315201 has provided some new core dumps. Do you still need some from me? It can take a while to get them, because the system I tested on is not currently running gluster as it was configured when I discovered the bug. Hi Anders, Recently uploaded cores were also truncated. There are two options to make sure that cores are not getting truncated: sure way -------- See https://bugzilla.redhat.com/show_bug.cgi?id=1315201#c7 should work (it worked for me) ------------------------------ Use prlimit command to change core limit size for a running process (as soon as we have the pid up and running) as follows: # prlimit --pid=<smb-pid> --core=unlimited Please make sure that you provide the correct pid for the mounted share connection(either CIFS or Windows clients). Because prlimit is always associated with a process id. One among the above mentioned changes will allow Samba to produce complete cores. Hi Anders, Can you please update your glusterfs packages to some version >= 3.7.10 or 3.7.11(which will be available soon)? Because 2 suspected fixes for this issue have merged within 3.7.9 and afterwards. Hi Anders, Were you able to upgrade glusterfs packages to recent version(glusterfs-3.7.11)? If so, do you see crashes post-upgrade? Hi Anders, Any updates on this bug? The following suspected fixes have been present since glusterfs v3.7.10: http://review.gluster.org/#/c/11796/ http://review.gluster.org/#/c/13803/ Since there are no updates from the reporter after upgrading the glusterfs packages to mentioned version we are closing this bug under the assumption that no more crashes were observed. Please feel free to re-open this bug or file a new one as required in case of new issues. |