Description of problem: When node fails geo-rep is not failing over gracefully. See "defunct" status. And when you try to stop the geo-rep it doesn't work as well. Version-Release number of selected component (if applicable): RHS 2.1 Actual results: [deploy@pprddapglu13400 ~]$ sudo gluster volume geo-replication dp-vol ssh://pprddapglu13300.example.net::dp-vol1 status NODE MASTER SLAVE HEALTH UPTIME ----------------------------------------------------------------------------------------------------------------- pprddapglu13400.example.net dp-vol ssh://pprddapglu13300.example.net::dp-vol1 Stable 07:49:50 pprddapglu13404.example.net dp-vol ssh://pprddapglu13300.example.net::dp-vol1 defunct N/A pprddapglu13418.example.net dp-vol ssh://pprddapglu13300.example.net::dp-vol1 Stable 07:58:14 pprddapglu13409.example.net dp-vol ssh://pprddapglu13300.example.net::dp-vol1 Stable 07:58:14 pprddapglu13423.example.net dp-vol ssh://pprddapglu13300.example.net::dp-vol1 Stable 07:58:14 pprddapglu13405.example.net dp-vol ssh://pprddapglu13300.example.net::dp-vol1 Stable 07:58:14 pprddapglu13414.example.net dp-vol ssh://pprddapglu13300.example.net::dp-vol1 Stable 07:58:14 pprddapglu13422.example.net dp-vol ssh://pprddapglu13300.example.net::dp-vol1 Stable 07:58:14 pprddapglu13421.example.net dp-vol ssh://pprddapglu13300.example.net::dp-vol1 Stable 07:58:14 pprddapglu13419..example.net dp-vol ssh://pprddapglu13300.example.net::dp-vol1 Stable 07:58:14 pprddapglu13427.example.net dp-vol ssh://pprddapglu13300.example.net::dp-vol1 Stable 07:58:14 pprddapglu13408.example.net dp-vol ssh://pprddapglu13300.example.net::dp-vol1 Stable 07:58:14 pprddapglu13410.example.net dp-vol ssh://pprddapglu13300.example.net::dp-vol1 Stable 07:49:48 pprddapglu13407.example.net dp-vol ssh://pprddapglu13300.example.net::dp-vol1 Stable 07:58:14 pprddapglu13406.example.net dp-vol ssh://pprddapglu13300.example.net::dp-vol1 defunct N/A pprddapglu13426.example.net dp-vol ssh://pprddapglu13300.example.net::dp-vol1 defunct N/A pprddapglu13412.example.net dp-vol ssh://pprddapglu13300.example.net::dp-vol1 Stable 07:58:14 pprddapglu13415.example.net dp-vol ssh://pprddapglu13300.example.net::dp-vol1 Stable 07:58:14 pprddapglu13425.example.net dp-vol ssh://pprddapglu13300.example.net::dp-vol1 Stable 07:58:14 pprddapglu13417.example.net dp-vol ssh://pprddapglu13300.example.net::dp-vol1 Stable 07:58:14 pprddapglu13416.example.net dp-vol ssh://pprddapglu13300.example.net::dp-vol1 Stable 07:49:49 pprddapglu13401.example.net dp-vol ssh://pprddapglu13300.example.net::dp-vol1 Stable 07:58:14 pprddapglu13402.example.net dp-vol ssh://pprddapglu13300.example.net::dp-vol1 Stable 07:58:14 pprddapglu13424.example.net dp-vol ssh://pprddapglu13300.example.net::dp-vol1 Stable 07:58:14 pprddapglu13420.example.net dp-vol ssh://pprddapglu13300.example.net::dp-vol1 Stable 07:58:14 pprddapglu13403.example.net dp-vol ssh://pprddapglu13300.example.net::dp-vol1 Stable 07:58:14 pprddapglu13411.example.net dp-vol ssh://pprddapglu13300.example.net::dp-vol1 Stable 07:58:14 pprddapglu13413.example.net dp-vol ssh://pprddapglu13300.example.net::dp-vol1 Stable 07:58:14 [deploy@pprddapglu13400 ~]$ sudo gluster volume geo-replication dp-vol ssh://pprddapglu13300.example.net::dp-vol1 stop Staging failed on pprddapglu13406.ie.example.net. Error: geo-replication session b/w dp-vol & ssh://pprddapglu13300.example.net::dp-vol1 is not running on this node. Staging failed on pprddapglu13404.example.net. Error: geo-replication session b/w dp-vol & ssh://pprddapglu13300.example.net::dp-vol1 is not running on this node. Staging failed on pprddapglu13426.example.net. Error: geo-replication session b/w dp-vol & ssh://pprddapglu13300.example.net::dp-vol1 is not running on this node. geo-replication command failed
*** This bug has been marked as a duplicate of bug 1022518 ***