Bug 1413501
Summary: | rgw_obj_expirer thread segfaults and nfsd process terminates | ||
---|---|---|---|
Product: | [Red Hat Storage] Red Hat Ceph Storage | Reporter: | shilpa <smanjara> |
Component: | Documentation | Assignee: | Bara Ancincova <bancinco> |
Status: | CLOSED CURRENTRELEASE | QA Contact: | shilpa <smanjara> |
Severity: | unspecified | Docs Contact: | |
Priority: | unspecified | ||
Version: | 2.1 | CC: | asriram, cbodley, ceph-eng-bugs, hnallurv, kdreyer, mbenjamin, owasserm, smanjara, sweil |
Target Milestone: | rc | ||
Target Release: | 2.2 | ||
Hardware: | Unspecified | ||
OS: | Unspecified | ||
Whiteboard: | |||
Fixed In Version: | Doc Type: | If docs needed, set a value | |
Doc Text: | Story Points: | --- | |
Clone Of: | Environment: | ||
Last Closed: | 2017-03-21 23:48:46 UTC | Type: | Bug |
Regression: | --- | Mount Type: | --- |
Documentation: | --- | CRM: | |
Verified Versions: | Category: | --- | |
oVirt Team: | --- | RHEL 7.3 requirements from Atomic Host: | |
Cloudforms Team: | --- | Target Upstream Version: | |
Embargoed: |
Description
shilpa
2017-01-16 08:36:25 UTC
Adding more detailed logging directives (see /etc/ganesha/ganesha.conf, plus redirecting logging to a file, I see the following likely root cause: [root@magna039 ganesha]# /usr/bin/ganesha.nfsd -f /etc/ganesha/ganesha.conf -F 2017-01-16 18:34:50.720992 7ff78f6a20c0 -1 auth: unable to find a keyring on /var/lib/ceph/radosgw/-admin/keyring: (2) No such file or directory 2017-01-16 18:34:50.722071 7ff78f6a20c0 -1 monclient(hunting): authenticate NOTE: no keyring found; disabled cephx authentication 2017-01-16 18:34:50.722536 7ff78f6a20c0 -1 Couldn't init storage provider (RADOS) *** Caught signal (Segmentation fault) ** I.e., I think the problem with path to a radosgw admin keyring is preventing RGW from starting within the NFS ganesha instance. Update with working setup: 1. there is a segfault on shutdown after failure to initialize RADOS--triggered proximately by misconfiguration (tracker 17638)--this won't be fixed in 2.2, but is being worked on 2. the root cause of the misconfiguration is missing values for the radosgw arguments "--name" and "--cluster"; as of 2.1, the correct way to set these values (on an installation that requires them, such as this one), is to pass them as parameters in the RGW FSAL configuration block: RGW { ceph_conf = "/etc/ceph/ceph.conf"; cluster = "ceph"; name = "client.rgw.magna039"; init_args = "-d --debug-rgw=16"; } It turns out that currently the "init_args" argument should be passed as a set of separate, null-terminated strings appended to the librgw_create(...) argv argument, not passed in on a single line. |