Bug 1787664
Summary: | Accessing select directories unmounts the filesystem. | ||
---|---|---|---|
Product: | [Community] GlusterFS | Reporter: | Calvin Dunigan <cdunigan> |
Component: | distribute | Assignee: | bugs <bugs> |
Status: | CLOSED NOTABUG | QA Contact: | |
Severity: | medium | Docs Contact: | |
Priority: | unspecified | ||
Version: | 6 | CC: | bugs, nchilaka, pasik, rhs-bugs, sasundar, storage-qa-internal |
Target Milestone: | --- | Keywords: | ZStream |
Target Release: | --- | ||
Hardware: | x86_64 | ||
OS: | Linux | ||
Whiteboard: | |||
Fixed In Version: | Doc Type: | If docs needed, set a value | |
Doc Text: | Story Points: | --- | |
Clone Of: | Environment: | ||
Last Closed: | 2020-01-17 17:13:02 UTC | Type: | Bug |
Regression: | --- | Mount Type: | --- |
Documentation: | --- | CRM: | |
Verified Versions: | Category: | --- | |
oVirt Team: | --- | RHEL 7.3 requirements from Atomic Host: | |
Cloudforms Team: | --- | Target Upstream Version: | |
Embargoed: |
Description
Calvin Dunigan
2020-01-03 22:12:08 UTC
This bug is moved to Gluster product, as this is not a downstream product - RHGS - related one. Please upload the relevant glusterd.log ( /var/log/glusterfs/glusterd.log ) and brick logs ( /var/log/glusterfs/bricks/* ) If possible, you can also get us the sosreports from these centos nodes. Unfortunately we have a couple of restrictions. First, our customer is a Federal agency and requires governmental clearance to see most data. More unfortunately, when the problem first appeared the logs grew so large that they were filling the root filesystem and were truncated to free up space. Also, due to the nature of the customer, SOS is not an option. I do have some logs that cover a time when the problem was active. I could "scrub" those of sensitive data and forward them if you think that would be helpful. Finally, I have found a potential cure. It seems that the filesystem error only occurs the first time a file is accessed. (The first time since since the onset of the problem on Dec. 22.) So I wrote a shell script that touched every file, and for those that crashed the filesystem, it killed the glusterfs proc, remounted and continued. So far, the problem hasn't recurred for any given file. I have no way of knowing if this is a permanent cure (it's certainly not a fix) or if the problems will come back. I only mention it in the hopes that it may provide some insight into the problem. Let me know if the logs that cover only a portion of the time that the issue was present will be of value. This problem was self-inflicted, we copied files directly to the bricks. |