Bug 1307035

Summary: tgtd crash detected via abrtd
Product: Red Hat Enterprise Linux 6 Reporter: michal novacek <mnovacek>
Component: scsi-target-utilsAssignee: Andy Grover <agrover>
Status: CLOSED NEXTRELEASE QA Contact: Martin Hoyer <mhoyer>
Severity: unspecified Docs Contact:
Priority: unspecified    
Version: 6.7CC: agrover, bgoncalv, cleech, coughlan, jkortus, mnovacek
Target Milestone: rcKeywords: TestBlocker
Target Release: ---   
Hardware: Unspecified   
OS: Unspecified   
Whiteboard:
Fixed In Version: scsi-target-utils-1.0.24-18.el6 Doc Type: Bug Fix
Doc Text:
Story Points: ---
Clone Of: Environment:
Last Closed: 2016-03-14 18:12:54 UTC Type: Bug
Regression: --- Mount Type: ---
Documentation: --- CRM:
Verified Versions: Category: ---
oVirt Team: --- RHEL 7.3 requirements from Atomic Host:
Cloudforms Team: --- Target Upstream Version:
Attachments:
Description Flags
sosreport from abrtd none

Description michal novacek 2016-02-12 14:20:12 UTC
Description of problem:
tgtd daemon crasheshes from time to time. We are having workaround for that
which starts it again once it is detected but it sometimes times out after
120s. 

We use tgtd to serve as a target host of about 300 hundreds of lvs from remotely
connected disk array on a machine that has 250GB of ram.

Version-Release number of selected component (if applicable):
scsi-target-utils-1.0.24-16.el6.x86_64
2.6.32-573.8.1.el6.x86_64

How reproducible: randomly about once in a three weeks

Steps to Reproduce:
1. tgtd crashes in normal operation and takes much longer to start then (sometimes over two minutes)

Actual results: random crashes from time to time, longer start time than before 6.7

Expected results: no random crashes, short start time

Comment 2 michal novacek 2016-02-12 14:22:04 UTC
Created attachment 1123512 [details]
sosreport from abrtd

Comment 3 Chris Leech 2016-02-12 18:03:13 UTC
# from the sosreport
segfault at 1e4fdc0 ip 0000000000409bb0 sp 00007ffdbdae3fc0 error 4 in tgtd[400000+3e000]

# addr2line -e `which tgtd` 409bb0
/usr/src/debug/fujita-tgt-e039354/usr/iscsi/target.c:410

which is a list_for_each_entry_safe loop in iscsi_target_destroy()