Mailing List Archive

heartbeat
Hello,

I have an ether net connection that connects all hosts in a cluster and
the nodes also have an IB connection. I want the failover host to take
over when an IB connection goes down on a host. Is there an example for
how to do this? (I am using ipmi for shutting down hosts etc).

A cluster I am using has 8 nodes and want to do fail over in pairs of
two. in the ha.cf file do I mention all the hosts or just the host and
it's fail over, per pair?

thanks,

Ron
_______________________________________________
Linux-HA mailing list
Linux-HA@lists.linux-ha.org
http://lists.linux-ha.org/mailman/listinfo/linux-ha
See also: http://linux-ha.org/ReportingProblems
Re: heartbeat [ In reply to ]
On 01/20/2015 01:34 PM, Ron Croonenberg wrote:
> Hello,
>
> I have an ether net connection that connects all hosts in a cluster and
> the nodes also have an IB connection. I want the failover host to take
> over when an IB connection goes down on a host. Is there an example for
> how to do this? (I am using ipmi for shutting down hosts etc).
>
> A cluster I am using has 8 nodes and want to do fail over in pairs of
> two. in the ha.cf file do I mention all the hosts or just the host and
> it's fail over, per pair?

Do you have 4 separate active-passive pairs or a cluster of 8 nodes? If
it's the latter, I think you want pacemaker, not heartbeat. Dunno what
pacemaker might have for monitoring an IB connection, with heartbeat R1
I'd do something like grep for "LinkUp" in the output of ibstat.

--
Dimitri Maziuk
Programmer/sysadmin
BioMagResBank, UW-Madison -- http://www.bmrb.wisc.edu
Re: heartbeat [ In reply to ]
Hi Dimitri,

yes there are 4 pairs, but they are all active. When a node fails, the
other one in the pair just takes everything over.
A different HA is not an option, it has to be heartbeat. I noticed
something called ethmonitor, I can probably notice an IB connection with
it (it has ipoib on it)



On 01/20/2015 12:51 PM, Dimitri Maziuk wrote:
> On 01/20/2015 01:34 PM, Ron Croonenberg wrote:
>> Hello,
>>
>> I have an ether net connection that connects all hosts in a cluster and
>> the nodes also have an IB connection. I want the failover host to take
>> over when an IB connection goes down on a host. Is there an example for
>> how to do this? (I am using ipmi for shutting down hosts etc).
>>
>> A cluster I am using has 8 nodes and want to do fail over in pairs of
>> two. in the ha.cf file do I mention all the hosts or just the host and
>> it's fail over, per pair?
>
> Do you have 4 separate active-passive pairs or a cluster of 8 nodes? If
> it's the latter, I think you want pacemaker, not heartbeat. Dunno what
> pacemaker might have for monitoring an IB connection, with heartbeat R1
> I'd do something like grep for "LinkUp" in the output of ibstat.
>
>
>
> _______________________________________________
> Linux-HA mailing list
> Linux-HA@lists.linux-ha.org
> http://lists.linux-ha.org/mailman/listinfo/linux-ha
> See also: http://linux-ha.org/ReportingProblems
>
_______________________________________________
Linux-HA mailing list
Linux-HA@lists.linux-ha.org
http://lists.linux-ha.org/mailman/listinfo/linux-ha
See also: http://linux-ha.org/ReportingProblems