This is the mail archive of the
cluster-cvs@sources.redhat.com
mailing list for the cluster.
cluster/fence/fenced recover.c
- From: teigland at sourceware dot org
- To: cluster-cvs at sources dot redhat dot com
- Date: 20 Apr 2005 05:47:55 -0000
- Subject: cluster/fence/fenced recover.c
CVSROOT: /cvs/cluster
Module name: cluster
Changes by: teigland@sourceware.org 2005-04-20 05:47:55
Modified files:
fence/fenced : recover.c
Log message:
Improve logic that delays and reduces fencing. When fenced is recovering
for a failed node, the 'post_fail_delay' is used to give victims some
time to rejoin the cluster and avoid being fenced. If this happens once,
then it's likely to happen again and the 'post_join_delay' is more
appropriate, so fenced switches to the 'post_join_delay' value (if it's
larger which is usually the case.)
The common situation where this helps is when multiple nodes fail causing
the cluster to lose quorum and then the failed nodes all rejoin the
cluster at about the same time. The rejoining nodes are more likely
to all avoid being fenced if fenced uses the larger post_join_delay.
Patches:
http://sources.redhat.com/cgi-bin/cvsweb.cgi/cluster/fence/fenced/recover.c.diff?cvsroot=cluster&r1=1.16&r2=1.17