Bug 1238068
| Summary: | [geo-rep]: some of the files are not removed from slaves when performed rm -rf * from master | ||
|---|---|---|---|
| Product: | [Red Hat Storage] Red Hat Gluster Storage | Reporter: | Rahul Hinduja <rhinduja> |
| Component: | geo-replication | Assignee: | Kotresh HR <khiremat> |
| Status: | CLOSED DUPLICATE | QA Contact: | storage-qa-internal <storage-qa-internal> |
| Severity: | high | Docs Contact: | |
| Priority: | unspecified | ||
| Version: | rhgs-3.1 | CC: | avishwan, chrisw, csaba, khiremat, nlevinki, sankarshan, smohan, spalai |
| Target Milestone: | --- | Keywords: | ZStream |
| Target Release: | --- | ||
| Hardware: | x86_64 | ||
| OS: | Linux | ||
| Whiteboard: | |||
| Fixed In Version: | Doc Type: | Bug Fix | |
| Doc Text: | Story Points: | --- | |
| Clone Of: | Environment: | ||
| Last Closed: | 2018-11-19 09:08:57 UTC | Type: | Bug |
| Regression: | --- | Mount Type: | --- |
| Documentation: | --- | CRM: | |
| Verified Versions: | Category: | --- | |
| oVirt Team: | --- | RHEL 7.3 requirements from Atomic Host: | |
| Cloudforms Team: | --- | Target Upstream Version: | |
| Embargoed: | |||
| Bug Depends On: | 1235633, 1310194 | ||
| Bug Blocks: | |||
|
Description
Rahul Hinduja
2015-07-01 06:44:57 UTC
From the logs we could not concretely say what could have happened. But we could depict the following things from changelogs that were processed and also considering the bricks going offline and coming online. For one of the file which has not removed from slave, the entries are as follows in .processed directory. .processing: 1435675373 UNLINK georep2: b2 normal changelog .processing ========================================================================= georep1: ./764586b145d7206a154a778f64bd2f50/.processed/CHANGELOG.1435606117:E 988e518b-0766-473a-990c-4427577cc413 CREATE 384 0 0 00000000-0000-0000-0000-000000000001%2Fasound.conf ./764586b145d7206a154a778f64bd2f50/.processed/CHANGELOG.1435675375:E c2edfe25-3ba4-4d09-9f27-c9cd5e4c0bfe UNLINK 00000000-0000-0000-0000-000000000001%2Fasound.conf georep2: [root@georep2 ssh%3A%2F%2Froot%4010.70.46.101%3Agluster%3A%2F%2F127.0.0.1%3Aslave]# find . | xargs grep 00000000-0000-0000-0000-000000000001%2Fasound 2>/dev/null ./764586b145d7206a154a778f64bd2f50/.processing/CHANGELOG.1435675373:E c2edfe25-3ba4-4d09-9f27-c9cd5e4c0bfe UNLINK 00000000-0000-0000-0000-000000000001%2Fasound.conf georep3: [root@georep3 ssh%3A%2F%2Froot%4010.70.46.101%3Agluster%3A%2F%2F127.0.0.1%3Aslave]# find . | xargs grep 00000000-0000-0000-0000-000000000001%2Fasound 2>/dev/null Binary file ./764586b145d7206a154a778f64bd2f50/xsync/archive_201506.tar matches ./764586b145d7206a154a778f64bd2f50/xsync/XSYNC-CHANGELOG.1435666036:E c2edfe25-3ba4-4d09-9f27-c9cd5e4c0bfe MKNOD 33188 0 0 00000000-0000-0000-0000-000000000001%2Fasound.conf georep4: [root@georep4 ssh%3A%2F%2Froot%4010.70.46.101%3Agluster%3A%2F%2F127.0.0.1%3Aslave]# find . | xargs grep 00000000-0000-0000-0000-000000000001%2Fasound 2>/dev/null Binary file ./c19b89ac45352ab8c894d210d136dd56/.history/.processed/archive_201506.tar matches ./c19b89ac45352ab8c894d210d136dd56/.history/.processed/CHANGELOG.1435675319:E c2edfe25-3ba4-4d09-9f27-c9cd5e4c0bfe CREATE 384 0 0 00000000-0000-0000-0000-000000000001%2Fasound.conf ======================================================== .processed: CHANGELOG.1435606117 CREATE georep1: b2 normal changelog XSYNC-CHANGELOG.1435666036 MKNOD georep3: b2 xsync (Could be after replace brick. This was added as part of replace brick) The flip between gerep3:/rhs/brick2/b2 to georep4:/rhs/brick1/b1. So history picked and created the file CHANGELOG.1435675319 CREATE georep4: b1 history CHANGELOG.1435675375 DEL georep1: b2 normal changelog Because of ping pong nature of bricks going down and becoming online, and also stime query from mount point which gives max of replica and min of distribute, there could be overlaps of changelog processsing and it could process changelog which is already processed in other brick which went down. By the time it is processing already processed changelog, if this brick also goes down. There could be chances of inconsistent sync. Say in above, if CHANGELOG.1435675319 is processed last, the file would remain. which is what possibly could have happened. This bug was accidentally moved from POST to MODIFIED via an error in automation, please see mmccune with any questions *** This bug has been marked as a duplicate of bug 1400198 *** |