Bug 1703423
| Summary: | Multiple disconnect events being propagated for the same child | |||
|---|---|---|---|---|
| Product: | [Red Hat Storage] Red Hat Gluster Storage | Reporter: | Ravishankar N <ravishankar> | |
| Component: | rpc | Assignee: | Raghavendra G <rgowdapp> | |
| Status: | CLOSED ERRATA | QA Contact: | Mugdha Soni <musoni> | |
| Severity: | high | Docs Contact: | ||
| Priority: | high | |||
| Version: | rhgs-3.5 | CC: | amukherj, nchilaka, rgowdapp, rhs-bugs, sheggodu, vdas | |
| Target Milestone: | --- | Keywords: | Regression | |
| Target Release: | RHGS 3.5.0 | |||
| Hardware: | Unspecified | |||
| OS: | Linux | |||
| Whiteboard: | ||||
| Fixed In Version: | glusterfs-6.0-7 | Doc Type: | If docs needed, set a value | |
| Doc Text: | Story Points: | --- | ||
| Clone Of: | ||||
| : | 1716979 (view as bug list) | Environment: | ||
| Last Closed: | 2019-10-30 12:20:55 UTC | Type: | Bug | |
| Regression: | --- | Mount Type: | --- | |
| Documentation: | --- | CRM: | ||
| Verified Versions: | Category: | --- | ||
| oVirt Team: | --- | RHEL 7.3 requirements from Atomic Host: | ||
| Cloudforms Team: | --- | Target Upstream Version: | ||
| Embargoed: | ||||
| Bug Depends On: | ||||
| Bug Blocks: | 1696809, 1716979, 1739334, 1739335, 1739336 | |||
|
Comment 6
Raghavendra G
2019-06-04 14:09:56 UTC
upstream patch: https://review.gluster.org/#/c/glusterfs/+/22821/ Tested with the following:
glusterfs-6.0-12.el7rhgs.x86_64
glusterfs-server-6.0-12.el7rhgs.x86_64
glusterfs-libs-6.0-12.el7rhgs.x86_64
glusterfs-fuse-6.0-12.el7rhgs.x86_64
glusterfs-api-6.0-12.el7rhgs.x86_64
glusterfs-rdma-6.0-12.el7rhgs.x86_64
glusterfs-cli-6.0-12.el7rhgs.x86_64
glusterfs-geo-replication-6.0-12.el7rhgs.x86_64
glusterfs-events-6.0-12.el7rhgs.x86_64
glusterfs-debuginfo-6.0-12.el7rhgs.x86_64
glusterfs-client-xlators-6.0-12.el7rhgs.x86_64
The steps performed :
1.Created a replica 3 volume and started it.
2.Installed the GDB and glusterfs-debuginfo packages on the node.
3.Enter into the gdb session (gdb -p `pidof glusterfs`) and set a break point (b __afr_handle_child_down_event)
4.Went to one of the node and killed a brick on the node (kill -9 pid)
5.The following output was seen.There was only 1 event per one disconnect and didnot see any issue that of events generating every 3 sec for one disconnect.
(gdb) b __afr_handle_child_down_event
Breakpoint 1 at 0x7f675b62b940: file afr-common.c, line 5249.
(gdb) c
Continuing.
[Switching to Thread 0x7f675cafe700 (LWP 16524)]
Breakpoint 1, __afr_handle_child_down_event (this=this@entry=0x7f675400ffe0, child_xlator=child_xlator@entry=0x7f675400d5c0, idx=idx@entry=2, child_latency_msec=child_latency_msec@entry=-1,
event=event@entry=0x7f675cafcc5c, call_psh=call_psh@entry=0x7f675cafcc68, up_child=up_child@entry=0x7f675cafcc6c) at afr-common.c:5249
5249 {
.
Based on the above output moving the bug to verified state.
Since the problem described in this bug report should be resolved in a recent advisory, it has been closed with a resolution of ERRATA. For information on the advisory, and where to find the updated files, follow the link below. If the solution does not work for you, open a new bug report. https://access.redhat.com/errata/RHEA-2019:3249 |