Bug 683250
| Summary: | slapd crashing when traffic replayed | ||||||||||||||||||
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| Product: | [Retired] 389 | Reporter: | Nathan Kinder <nkinder> | ||||||||||||||||
| Component: | Directory Server | Assignee: | Rich Megginson <rmeggins> | ||||||||||||||||
| Status: | CLOSED CURRENTRELEASE | QA Contact: | Viktor Ashirov <vashirov> | ||||||||||||||||
| Severity: | high | Docs Contact: | |||||||||||||||||
| Priority: | high | ||||||||||||||||||
| Version: | 1.2.8 | CC: | amsharma, andrey.ivanov, dan, diego, jazcek, jgalipea, jmates, martin.donnelly, nhosoi, samh.work+rhb | ||||||||||||||||
| Target Milestone: | --- | Keywords: | VerifiedUpstream | ||||||||||||||||
| Target Release: | --- | ||||||||||||||||||
| Hardware: | i686 | ||||||||||||||||||
| OS: | Linux | ||||||||||||||||||
| Whiteboard: | |||||||||||||||||||
| Fixed In Version: | Doc Type: | Bug Fix | |||||||||||||||||
| Doc Text: | Story Points: | --- | |||||||||||||||||
| Clone Of: | 668619 | ||||||||||||||||||
| : | 684349 (view as bug list) | Environment: | |||||||||||||||||
| Last Closed: | 2015-12-07 16:54:27 UTC | Type: | --- | ||||||||||||||||
| Regression: | --- | Mount Type: | --- | ||||||||||||||||
| Documentation: | --- | CRM: | |||||||||||||||||
| Verified Versions: | Category: | --- | |||||||||||||||||
| oVirt Team: | --- | RHEL 7.3 requirements from Atomic Host: | |||||||||||||||||
| Cloudforms Team: | --- | Target Upstream Version: | |||||||||||||||||
| Embargoed: | |||||||||||||||||||
| Bug Depends On: | |||||||||||||||||||
| Bug Blocks: | 639035, 656390, 684349 | ||||||||||||||||||
| Attachments: |
|
||||||||||||||||||
|
Description
Nathan Kinder
2011-03-08 22:10:49 UTC
Created attachment 483044 [details]
gdb logs
Attachment originally from Jeremy Mates:
gdb traces from two crashes of 389-ds-base-1.2.8-0.2.a2.el5 in response to
replayed production traffic.
Created attachment 483046 [details]
gdb trace from 1.2.8 a3 segfault
Attachment originally from Jeremy Mates:
gdb trace from 1.2.8 a3 segfault added (alpha 3 is harder to segfault than
alpha 2).
Is it possible you could attach your tcpdump so we could replay it in our dev environment? If that is not possible, would you be able to install a debug build in your testing environment? Sorry, the tcpdump contains student account names and other metadata that cannot be shared. I can easily install a debug build. Ok. I'll build you an el5 32-bit package with full debugging enabled. Ok. The new rpms are here: http://rmeggins.fedorapeople.org/ download the base and the -libs package - you don't need the other 2. Install them using rpm -ivh (or upgrade using rpm -Uvh). If/when it crashes, and you run gdb, you'll have to use the gdb 'dir' command to tell it where to find the source code, since there is no debuginfo package. (gdb) dir /usr/src/debug/389-ds-base-VERSION you can use the older .a4 version of the source. Created attachment 483315 [details] gdb trace Crash from http://rmeggins.fedorapeople.org/ packages plus 389-ds-base-1.2.6.a4.tar.bz2 source tree. Thanks. New packages for testing: http://rmeggins.fedorapeople.org/ try these Created attachment 483531 [details]
0001-use-a-big-lock-in-saslbind.patch
Created attachment 483607 [details]
0001-Bug-683250-slapd-crashing-when-traffic-replayed.patch
Created attachment 483608 [details]
0001-Bug-683250-slapd-crashing-when-traffic-replayed.patch
missed a couple of places where I needed to Unlock
Created attachment 483609 [details]
0001-Bug-683250-slapd-crashing-when-traffic-replayed.patch
have to call unlock before send_result
To ssh://git.fedorahosted.org/git/389/ds.git
34f2f30..2c8637c master -> master
commit 2c8637c242ace8a7d61474913c861e336a7809cd
Author: Rich Megginson <rmeggins>
Date: Wed Mar 9 18:27:05 2011 -0700
Reviewed by: nkinder (Thanks!)
Branch: master
Fix Description: There was a race condition in the saslbind.c code if multip
threads and multiple connections were doing gssapi at the same time, with
different points of failure. The solution is to increase the size of the
mutex section in saslbind.c so that all access of pb->pb_conn are protected.
Thanks to Jeremy Mates <jmates> for finding this issue and for his
assistance in testing.
Platforms tested: RHEL6 x86_64, Fedora 14 i386
Flag Day: no
Doc impact: no
To ssh://git.fedorahosted.org/git/389/ds.git
cc578f1..fb7547f 389-ds-base-1.2.8 -> 389-ds-base-1.2.8
Hi, Request you to please share more information on how to test this bug fix. Thanks, Amita (In reply to comment #14) > Hi, > > Request you to please share more information on how to test this bug fix. > > Thanks, > Amita We could never reproduce the crash internally, and we don't have access to the reporter's private data he used to reproduce the crash. So I think we can just run our SASL stress and long duration tests and confirm that this fix did not introduce any regressions. Note: the upstream bug 668619 is verified. Based on Comment 16, marking this bug verified. |