Bug 1259312
Summary: | Data Tiering:File create and new writes to existing file fails when the hot tier is full instead of redirecting/flushing the data to cold tier | ||||||
---|---|---|---|---|---|---|---|
Product: | [Community] GlusterFS | Reporter: | Nag Pavan Chilakam <nchilaka> | ||||
Component: | tiering | Assignee: | Nithya Balachandran <nbalacha> | ||||
Status: | CLOSED EOL | QA Contact: | bugs <bugs> | ||||
Severity: | high | Docs Contact: | |||||
Priority: | medium | ||||||
Version: | 3.7.5 | CC: | bugs, dlambrig, jbyers, sankarshan, vnosov | ||||
Target Milestone: | --- | Keywords: | Reopened | ||||
Target Release: | --- | ||||||
Hardware: | Unspecified | ||||||
OS: | Unspecified | ||||||
Whiteboard: | |||||||
Fixed In Version: | Doc Type: | Bug Fix | |||||
Doc Text: | Story Points: | --- | |||||
Clone Of: | |||||||
: | 1277112 (view as bug list) | Environment: | |||||
Last Closed: | 2017-03-08 10:47:55 UTC | Type: | Bug | ||||
Regression: | --- | Mount Type: | --- | ||||
Documentation: | --- | CRM: | |||||
Verified Versions: | Category: | --- | |||||
oVirt Team: | --- | RHEL 7.3 requirements from Atomic Host: | |||||
Cloudforms Team: | --- | Target Upstream Version: | |||||
Embargoed: | |||||||
Bug Depends On: | |||||||
Bug Blocks: | 1277112 | ||||||
Attachments: |
|
Description
Nag Pavan Chilakam
2015-09-02 11:58:47 UTC
sosreports @ rhsqe-repo.lab.eng.blr.redhat.com:/home/repo/sosreports/bug.1259312/ Created attachment 1069404 [details]
qe cli log during bug raising
This is failing on glusterfs-server-3.7.5-0.18 [root@rhel7-autofuseclient estonia]# for i in {1..100};do touch x.$i;done touch: cannot touch ‘x.1’: No space left on device touch: cannot touch ‘x.2’: No space left on device touch: cannot touch ‘x.3’: No space left on device touch: cannot touch ‘x.4’: No space left on device touch: cannot touch ‘x.5’: No space left on device touch: cannot touch ‘x.6’: No space left on device touch: cannot touch ‘x.7’: No space left on device touch: cannot touch ‘x.8’: No space left on device touch: cannot touch ‘x.9’: No space left on device touch: cannot touch ‘x.10’: No space left on device touch: cannot touch ‘x.11’: No space left on device touch: cannot touch ‘x.12’: No space left on device touch: cannot touch ‘x.13’: No space left on device touch: cannot touch ‘x.14’: No space left on device touch: cannot touch ‘x.15’: No space left on device touch: cannot touch ‘x.16’: No space left on device touch: cannot touch ‘x.17’: No space left on device touch: cannot touch ‘x.18’: No space left on device touch: cannot touch ‘x.19’: No space left on device touch: cannot touch ‘x.20’: No space left on device touch: cannot touch ‘x.21’: No space left on device touch: cannot touch ‘x.22’: No space left on device touch: cannot touch ‘x.23’: No space left on device touch: cannot touch ‘x.24’: No space left on device touch: cannot touch ‘x.25’: No space left on device touch: cannot touch ‘x.26’: No space left on device touch: cannot touch ‘x.27’: No space left on device touch: cannot touch ‘x.28’: No space left on device touch: cannot touch ‘x.29’: No space left on device touch: cannot touch ‘x.30’: No space left on device touch: cannot touch ‘x.31’: No space left on device touch: cannot touch ‘x.32’: No space left on device touch: cannot touch ‘x.33’: No space left on device touch: cannot touch ‘x.34’: No space left on device touch: cannot touch ‘x.35’: No space left on device touch: cannot touch ‘x.36’: No space left on device touch: cannot touch ‘x.37’: No space left on device touch: cannot touch ‘x.38’: No space left on device touch: cannot touch ‘x.39’: No space left on device touch: cannot touch ‘x.40’: No space left on device touch: cannot touch ‘x.41’: No space left on device touch: cannot touch ‘x.42’: No space left on device touch: cannot touch ‘x.43’: No space left on device touch: cannot touch ‘x.44’: No space left on device touch: cannot touch ‘x.45’: No space left on device touch: cannot touch ‘x.46’: No space left on device touch: cannot touch ‘x.47’: No space left on device touch: cannot touch ‘x.48’: No space left on device touch: cannot touch ‘x.49’: No space left on device touch: cannot touch ‘x.50’: No space left on device touch: cannot touch ‘x.51’: No space left on device touch: cannot touch ‘x.52’: No space left on device touch: cannot touch ‘x.53’: No space left on device touch: cannot touch ‘x.54’: No space left on device touch: cannot touch ‘x.55’: No space left on device touch: cannot touch ‘x.56’: No space left on device refer bz#1271151 My cannot do anything with write updates, but my understanding is DHT has a mechanism to redirect new file creates to a different brick when the hashed sub volume is full. Need to understand why this does not work with tier. What was the name of the volume on which this failed? I tried this on the latest master code and could not reproduce the issue where new file creates were still going to the hashed subvol. We cannot do anything about the writes until the file is moved to the cold tier - this is existing DHT behaviour. I am moving this to WorksForMe. Please reopen if seen again. I tried this on the latest master code and could not reproduce the issue where new file creates were still going to the hashed subvol. We cannot do anything about the writes until the file is moved to the cold tier - this is existing DHT behaviour. I am moving this to WorksForMe. Please reopen if seen again. Hi, I saw this not working on latest downstream. I am cloning this to downstream for further tracking =========================== [root@mia diskfull]# for i in {1..10};do dd if=/dev/urandom of=bigfi bs=1024 count=300000;done dd: failed to open ‘bigfi’: No space left on device dd: failed to open ‘bigfi’: No space left on device dd: failed to open ‘bigfi’: No space left on device dd: failed to open ‘bigfi’: No space left on device dd: failed to open ‘bigfi’: No space left on device dd: failed to open ‘bigfi’: No space left on device dd: failed to open ‘bigfi’: No space left on device dd: failed to open ‘bigfi’: No space left on device dd: failed to open ‘bigfi’: No space left on device dd: failed to open ‘bigfi’: No space left on device fuse mount logs: ================== d 9 times between [2015-11-02 04:40:08.609241] and [2015-11-02 04:40:08.723177] The message "E [MSGID: 114031] [client-rpc-fops.c:251:client3_3_mknod_cbk] 0-diskfull-client-6: remote operation failed. Path: /bigfi [No space left on device]" repeated 9 times between [2015-11-02 04:40:08.609351] and [2015-11-02 04:40:08.723197] ^C [root@mia glusterfs]# tail -f mnt-diskfull.log [2015-11-02 04:40:08.635409] W [fuse-bridge.c:1978:fuse_create_cbk] 0-glusterfs-fuse: 30100: /bigfi => -1 (No space left on device) [2015-11-02 04:40:08.647871] W [fuse-bridge.c:1978:fuse_create_cbk] 0-glusterfs-fuse: 30102: /bigfi => -1 (No space left on device) [2015-11-02 04:40:08.660484] W [fuse-bridge.c:1978:fuse_create_cbk] 0-glusterfs-fuse: 30104: /bigfi => -1 (No space left on device) [2015-11-02 04:40:08.673046] W [fuse-bridge.c:1978:fuse_create_cbk] 0-glusterfs-fuse: 30106: /bigfi => -1 (No space left on device) [2015-11-02 04:40:08.685465] W [fuse-bridge.c:1978:fuse_create_cbk] 0-glusterfs-fuse: 30108: /bigfi => -1 (No space left on device) [2015-11-02 04:40:08.698471] W [fuse-bridge.c:1978:fuse_create_cbk] 0-glusterfs-fuse: 30110: /bigfi => -1 (No space left on device) [2015-11-02 04:40:08.710918] W [fuse-bridge.c:1978:fuse_create_cbk] 0-glusterfs-fuse: 30112: /bigfi => -1 (No space left on device) [2015-11-02 04:40:08.723667] W [fuse-bridge.c:1978:fuse_create_cbk] 0-glusterfs-fuse: 30114: /bigfi => -1 (No space left on device) The message "E [MSGID: 114031] [client-rpc-fops.c:251:client3_3_mknod_cbk] 0-diskfull-client-7: remote operation failed. Path: /bigfi [No space left on device]" repeated 9 times between [2015-11-02 04:40:08.609241] and [2015-11-02 04:40:08.723177] The message "E [MSGID: 114031] [client-rpc-fops.c:251:client3_3_mknod_cbk] 0-diskfull-client-6: remote operation failed. Path: /bigfi [No space left on device]" repeated 9 times between [2015-11-02 04:40:08.609351] and [2015-11-02 04:40:08.723197] Looks like the hashed file creation is failing because there is no space on the hot tier. We should ideally never get to this situation where the hot tier is 100% full as it will cause DHT to stop working properly. This bug is getting closed because GlusteFS-3.7 has reached its end-of-life. Note: This bug is being closed using a script. No verification has been performed to check if it still exists on newer releases of GlusterFS. If this bug still exists in newer GlusterFS releases, please reopen this bug against the newer release. The bug is reproduced on GlusterFS 3.12.3. |