1537480 – quorum-reads option can give inconsistent reads

Bug 1537480 - quorum-reads option can give inconsistent reads

Summary: quorum-reads option can give inconsistent reads

Keywords:
Status:	CLOSED DEFERRED
Alias:	None
Product:	Red Hat Gluster Storage
Classification:	Red Hat Storage
Component:	replicate
Sub Component:
Version:	rhgs-3.3
Hardware:	Unspecified
OS:	Unspecified
Priority:	medium
Severity:	medium
Target Milestone:	---
Target Release:	---
Assignee:	Karthik U S
QA Contact:	Nag Pavan Chilakam
Docs Contact:
URL:
Whiteboard:
Depends On:	1541438
Blocks:
TreeView+	depends on / blocked

Reported:	2018-01-23 10:15 UTC by Pranith Kumar K
Modified:	2020-02-07 06:32 UTC (History)
CC List:	7 users (show)
Fixed In Version:
Doc Type:	If docs needed, set a value
Doc Text:
Clone Of:
Clones:	1541438 (view as bug list)
Environment:
Last Closed:	2020-02-06 07:39:36 UTC
Embargoed:
Dependent Products:

Attachments	(Terms of Use)

Links
System	ID	Private	Priority	Status	Summary	Last Updated
Red Hat Bugzilla	1384983	0	high	CLOSED	split-brain observed with arbiter & replica 3 volume.	2021-06-10 11:36:08 UTC

Internal Links: 1384983

Description Pranith Kumar K 2018-01-23 10:15:56 UTC

Description of problem:
For a file, Brick-A has pending operations on Brick-B, Brick-B has pending operations on Brick-C and Brick-C has pending operations on Brick-A. Since no two other bricks are blaming one brick any of these bricks can be considered as a good copy and a heal can be done. Reads will fail until heal happens.
The consistent read issue we found happens when Any one of the bricks go down in this state. If Brick-A goes down, Reads will be served from Brick-B and if Brick-B goes down Reads will be served from Brick-C. If Brick-C goes down reads will be served from Brick-A. All these reads could give different content.


Version-Release number of selected component (if applicable):


How reproducible:
It is extremely difficult to hit this case. We are mostly going to simulate it by putting breakpoints in gdb.

Steps to Reproduce:
1.
2.
3.

Actual results:


Expected results:


Additional info:

Comment 2 Karthik U S 2018-02-12 07:30:48 UTC

To fix this issue we need to change the read, write and self-heal transactions, which are the heart of AFR transactions. It is better to give some time for it to become stable upstream before taking it to downstream. So targeting it for the later release.

Comment 3 Karthik U S 2018-02-12 07:32:41 UTC

Upstream patch: https://review.gluster.org/#/c/19477/

Note You need to log in before you can comment on or make changes to this bug.