429927 – qdisk does not check the heuristics

Bug 429927 - qdisk does not check the heuristics

Summary: qdisk does not check the heuristics

Keywords:
Status:	CLOSED ERRATA
Alias:	None
Product:	Red Hat Enterprise Linux 5
Classification:	Red Hat
Component:	cman
Sub Component:
Version:	5.1
Hardware:	All
OS:	Linux
Priority:	urgent
Severity:	urgent
Target Milestone:	rc
Target Release:	---
Assignee:	Lon Hohberger
QA Contact:	GFS Bugs
Docs Contact:
URL:
Whiteboard:
Depends On:
Blocks:	430574
TreeView+	depends on / blocked

Reported:	2008-01-23 20:37 UTC by Thorsten Scherf
Modified:	2009-04-16 22:18 UTC (History)
CC List:	4 users (show)
Fixed In Version:	RHBA-2008-0347
Doc Type:	Bug Fix
Doc Text:
Clone Of:
Environment:
Last Closed:	2008-05-21 15:58:44 UTC
Target Upstream Version:
Embargoed:
Dependent Products:

Attachments	(Terms of Use)
Kills ping after 2 seconds (225 bytes, text/plain) 2008-01-23 23:09 UTC, Lon Hohberger	no flags	Details
Fix (779 bytes, patch) 2008-01-24 00:16 UTC, Lon Hohberger	no flags	Details \| Diff
View All

Links
System	ID	Private	Priority	Status	Summary	Last Updated
Red Hat Product Errata	RHBA-2008:0347	0	normal	SHIPPED_LIVE	cman bug fix and enhancement update	2008-05-20 12:39:41 UTC

Description Thorsten Scherf 2008-01-23 20:37:03 UTC

Description of problem:
qdisk seems to ignore the interval set in cluster.conf to check the heuristics.
I have to restart the qdiskd in order to perform a check. I have this scenario:

cluster.conf on node1 (1vote) and node2 (1vote):

<quorumd device="/dev/sda2" interval="2" min_score="1" tko="2" votes="2">
                <heuristic interval="2" program="ping -c1 -t1 172.16.0.2"
score="1"/>
</quorumd>

when I now set disable access to 172.16.0.2 with an iptables rule, nothing
happens. when I restart for several minutes. when I restart qdiskd the quorum
goes and cluster isn't quorate any more. 

Nodes: 2
Expected votes: 3
Total votes: 2
Quorum: 3 Activity blocked

when I disable the iptables rule, nothing happens. I have to restart the qdiskd
again to check the heuristic and to get aware of the reestablished link.

Nodes: 2
Expected votes: 3
Total votes: 4
Quorum: 3  

several questions:
* why do I have to restart the qdiskd? it should check the heuristic in the
interval specified 
* why does the number of expected votes changes from 4 to 3 when the cluster
isn't quorate
* why does no node gets fenced when the cluster isn't quorate?
 



Version-Release number of selected component (if applicable):
cman-2.0.73-1.el5

How reproducible:
see above

Steps to Reproduce:
1.
2.
3.
  
Actual results:


Expected results:


Additional info:

Comment 2 Lon Hohberger 2008-01-23 22:53:39 UTC

Ping's not exiting.

Comment 3 Lon Hohberger 2008-01-23 23:03:59 UTC

It gets stuck in a loop:

 recvmsg ...
 -1 / EAGAIN

This only seems to happen for me if I start qdiskd from within the init script.
 If I start it by hand, it works fine.

Comment 4 Lon Hohberger 2008-01-23 23:09:59 UTC

Created attachment 292708 [details]
Kills ping after 2 seconds

I placed this script (ping-wrap) in /sbin - and changed my heuristic from:

  ping -c2 -t2 192.168.79.254

to:

  ping-wrap -c2 -t2 192.168.79.254

... and it worked.

Comment 5 Lon Hohberger 2008-01-23 23:25:06 UTC

I'm wrong - it seems that if you do 'iptables -A OUTPUT -d <ipaddr> -j DROP',
ping always hangs, qdiskd or not!

[root@molly ~]# ping 192.168.79.254
PING 192.168.79.254 (192.168.79.254) 56(84) bytes of data.
ping: sendmsg: Operation not permitted
ping: sendmsg: Operation not permitted
ping: sendmsg: Operation not permitted

[root@molly ~]# iptables -F
[root@molly ~]# ping 192.168.79.254
PING 192.168.79.254 (192.168.79.254) 56(84) bytes of data.
64 bytes from 192.168.79.254: icmp_seq=1 ttl=255 time=13.4 ms

--- 192.168.79.254 ping statistics ---
1 packets transmitted, 1 received, 0% packet loss, time 0ms
rtt min/avg/max/mdev = 13.440/13.440/13.440/0.000 ms

Comment 6 Lon Hohberger 2008-01-23 23:26:15 UTC

However, as Thorsten figured out:

It seems that if you do 'iptables -A OUTPUT -d <ipaddr> -j REJECT',
ping works fine.

Comment 7 Lon Hohberger 2008-01-23 23:38:05 UTC

... it's because ping's using SIGALRM, and qdiskd blocks it.

Comment 8 Lon Hohberger 2008-01-24 00:16:02 UTC

Created attachment 292717 [details]
Fix

Comment 10 Charlie Brady 2008-01-24 16:02:39 UTC

> * why does no node gets fenced when the cluster isn't quorate?

My understanding is that nodes within quorum should fence failed nodes, but
outsiders shouldn't fence anything in a running or trying to form cluster.

Comment 11 Lon Hohberger 2008-01-24 16:35:18 UTC

Normally, that's the case, with some exceptions:

(a) If using a two_node="1" cluster, nodes who cannot see each other will try to
fence each other
(b) if using qdisk, and your heuristics are "good" while a "majority" is "bad",
you can gain quorum and then fence the majority set of nodes

But generally, yes, only the quorate partition fences.

In the case Thorsten was worried about, fencing didn't occur due to a bug in
qdiskd - where it was blocking signals in child processes.  Qdiskd wasn't
declaring the node dead like it should have - so the node appeared "just fine".
 What should have happened is the node with the iptables rule should have
rebooted (or removed its qdisk vote(s) if reboot was set to 0) - allowing the
good node to fence it.

What happened was that because the heuristic hung (and never exited), qdiskd
didn't remove the node at all.  CMAN, however, thought it was alive-and-quorate,
causing a fence-race.

Comment 12 Lon Hohberger 2008-01-28 19:44:19 UTC

Patch to restore signals in CVS

Comment 15 Lon Hohberger 2008-03-27 18:16:45 UTC

cman-2.0.81 fixes this given the test case in comment #6

Comment 16 Lon Hohberger 2008-03-27 18:26:13 UTC

Marking verified.

Comment 18 errata-xmlrpc 2008-05-21 15:58:44 UTC

An advisory has been issued which should help the problem
described in this bug report. This report is therefore being
closed with a resolution of ERRATA. For more information
on the solution and/or where to find the updated files,
please follow the link below. You may reopen this bug report
if the solution does not work for you.

http://rhn.redhat.com/errata/RHBA-2008-0347.html

Note You need to log in before you can comment on or make changes to this bug.