Bug 1382686
| Summary: | heal info --xml when bricks are down in a systemic environment is not displaying anything even after more than 30minutes | |||
|---|---|---|---|---|
| Product: | [Red Hat Storage] Red Hat Gluster Storage | Reporter: | Nag Pavan Chilakam <nchilaka> | |
| Component: | replicate | Assignee: | Pranith Kumar K <pkarampu> | |
| Status: | CLOSED WONTFIX | QA Contact: | Nag Pavan Chilakam <nchilaka> | |
| Severity: | urgent | Docs Contact: | ||
| Priority: | low | |||
| Version: | rhgs-3.2 | CC: | amukherj, nchilaka, ravishankar, rcyriac, rhinduja, rhs-bugs, storage-qa-internal | |
| Target Milestone: | --- | Keywords: | ZStream | |
| Target Release: | --- | |||
| Hardware: | Unspecified | |||
| OS: | Unspecified | |||
| Whiteboard: | ||||
| Fixed In Version: | Doc Type: | If docs needed, set a value | ||
| Doc Text: | Story Points: | --- | ||
| Clone Of: | ||||
| : | 1395993 (view as bug list) | Environment: | ||
| Last Closed: | 2018-10-17 08:30:13 UTC | Type: | Bug | |
| Regression: | --- | Mount Type: | --- | |
| Documentation: | --- | CRM: | ||
| Verified Versions: | Category: | --- | ||
| oVirt Team: | --- | RHEL 7.3 requirements from Atomic Host: | ||
| Cloudforms Team: | --- | Target Upstream Version: | ||
| Embargoed: | ||||
| Bug Depends On: | ||||
| Bug Blocks: | 1395993, 1396779 | |||
|
Description
Nag Pavan Chilakam
2016-10-07 12:13:12 UTC
[root@dhcp37-150 glusterfs]# tailf glfsheal-distrepvol.log [2016-10-07 11:47:07.332315] E [socket.c:2309:socket_connect_finish] 0-distrepvol-client-7: connection to 10.70.37.150:49155 failed (Connection refused) [2016-10-07 11:47:07.335381] I [rpc-clnt.c:1947:rpc_clnt_reconfig] 0-distrepvol-client-2: changing port to 49154 (from 0) [2016-10-07 11:47:07.340317] E [socket.c:2309:socket_connect_finish] 0-distrepvol-client-2: connection to 10.70.35.3:49154 failed (Connection refused) [2016-10-07 11:47:09.340486] E [name.c:262:af_inet_client_get_remote_sockaddr] 0-distrepvol-snapd-client: DNS resolution failed on host /var/run/glusterd.socket [2016-10-07 11:47:10.349650] I [rpc-clnt.c:1947:rpc_clnt_reconfig] 0-distrepvol-client-7: changing port to 49155 (from 0) [2016-10-07 11:47:10.349914] I [rpc-clnt.c:1947:rpc_clnt_reconfig] 0-distrepvol-client-5: changing port to 49155 (from 0) [2016-10-07 11:47:10.352448] E [socket.c:2309:socket_connect_finish] 0-distrepvol-client-7: connection to 10.70.37.150:49155 failed (Connection refused) [2016-10-07 11:47:10.355497] E [socket.c:2309:socket_connect_finish] 0-distrepvol-client-5: connection to 10.70.37.187:49155 failed (Connection refused) [2016-10-07 11:47:10.355554] I [rpc-clnt.c:1947:rpc_clnt_reconfig] 0-distrepvol-client-2: changing port to 49154 (from 0) [2016-10-07 11:47:10.358502] E [socket.c:2309:socket_connect_finish] 0-distrepvol-client-2: connection to 10.70.35.3:49154 failed (Connection refused) [2016-10-07 11:47:12.356667] E [name.c:262:af_inet_client_get_remote_sockaddr] 0-distrepvol-snapd-client: DNS resolution failed on host /var/run/glusterd.socket As per our discussion last about this bz, there seem to be new entries that are added for healing right? i.e. will the command ever end in that case? statedumps available @ [qe@rhsqe-repo nchilaka]$ chmod -R 0777 /home/repo/sosreports/nchilaka/bug.1382686 [qe@rhsqe-repo nchilaka]$ hostname rhsqe-repo.lab.eng.blr.redhat.com (4 statedumps taken every 30min .....for all 4 nodes) Tested on 3.8.4-13: for a volume which has lot of files in heal pending [root@dhcp35-37 ~]# time gluster v heal distrep info|grep ntries Number of entries: 112484 Number of entries: 113455 Number of entries: 112327 Number of entries: 113872 the heal info has above pending entries. Also heal info keeps streaming the o/p instead of buffering. with heal info --xml , i still see the issue of not streaming the o/p and instead all is dumped at the end. Hence failing the fix(discussed with Pranith) [root@dhcp35-37 ~]# gluster v info distrep g Volume Name: distrep Type: Distributed-Replicate Volume ID: df5319f0-d889-4030-bb39-b8a41936a726 Status: Started Snapshot Count: 0 Number of Bricks: 2 x 2 = 4 Transport-type: tcp Bricks: Brick1: 10.70.35.37:/rhs/brick1/distrep Brick2: 10.70.35.116:/rhs/brick1/distrep Brick3: 10.70.35.37:/rhs/brick2/distrep Brick4: 10.70.35.116:/rhs/brick2/distrep Options Reconfigured: cluster.self-heal-daemon: disable performance.readdir-ahead: on nfs.disable: on [root@dhcp35-37 ~]# gluster v status distrep Status of volume: distrep Gluster process TCP Port RDMA Port Online Pid ------------------------------------------------------------------------------ Brick 10.70.35.37:/rhs/brick1/distrep 49153 0 Y 600 Brick 10.70.35.116:/rhs/brick1/distrep 49152 0 Y 32269 Brick 10.70.35.37:/rhs/brick2/distrep 49154 0 Y 620 Brick 10.70.35.116:/rhs/brick2/distrep 49153 0 Y 32288 Task Status of Volume distrep ------------------------------------------------------------------------------ There are no active volume tasks [root@dhcp35-37 ~]# I'm closing this since the BZ is old and there are no immediate plans to look at this. If the issue occurs in the latest recent RHGS version and you feel it is important to be looked at, please re-open. |