Skip site navigation (1)Skip section navigation (2)
Date:      Fri, 02 Jul 2010 16:59:30 +0900
From:      "hiroshi@soupacific.com" <hiroshi@soupacific.com>
To:        Mikolaj Golub <to.my.trociny@gmail.com>
Cc:        freebsd-fs@freebsd.org, Pawel Jakub Dawidek <pjd@FreeBSD.org>
Subject:   Re: HAST and CARP
Message-ID:  <4C2D9C62.4050105@soupacific.com>
In-Reply-To: <861vbm1hpr.fsf@zhuzha.ua1>
References:  <4C139F9C.2090305@soupacific.com><86iq5oc82y.fsf@kopusha.home.net> <4C14215D.9090304@soupacific.com><20100613003635.GA60012@icarus.home.lan><20100613074921.GB1320@garage.freebsd.pl><4C149A5C.3070401@soupacific.com><20100613102401.GE1320@garage.freebsd.pl><86eigavzsg.fsf@kopusha.home.net><20100614095044.GH1721@garage.freebsd.pl><868w6hwt2w.fsf@kopusha.home.net><20100614153746.GN1721@garage.freebsd.pl><86zkyxvc4v.fsf@kopusha.home.net> <4C2C43D5.1080907@soupacific.com><86mxubndrp.fsf@kopusha.home.net> <4C2D7615.5070606@soupacific.com> <861vbm1hpr.fsf@zhuzha.ua1>

next in thread | previous in thread | raw e-mail | index | archive | help
On 7/2/2010 4:11 PM, Mikolaj Golub wrote:

>
> So you have:
>
> secondary localcnt:	1
> secondary remotecnt:	0
> primary localcnt:	1
> primary remotecnt:	0
>
> This is a split-brain condition as described on wiki: primary's localcnt is
> greater than secondary's remotecnt (primary [fw01A] was modified while fw01B
> wasn't watching) and secondary's localcnt is greater than primary's remotecnt
> (fw01B was modified while fw01A wasn't watching).

So hasctl role secondary xxx does not change cnt values ?

Scenario is this
ServerA failed, then ServerB became MASTER.

Only ServerA is started(say after fixed something) , both servers are 
connected,then ServerB starts,  BUT during failure of ServerA, ServerB 
was MASTER.
ServerA was started before ServerB is started, thus ServerA should be 
MASTER!

On this situation, CARP will set ServerA is MASTER and late comer 
ServerB is set as BACKUP by CARP.

hastctl role secondary xxx set

 > secondary localcnt:	1
 > secondary remotecnt:	0
 > primary localcnt:	1
 > primary remotecnt:	0

above values to NOT split-brain. It sounds more favoritabel way ????

hastctl role is managed by ifstated watching CARP status.

Is this strange idea ?

Thanks

Hiroshi


>
>   h>  Hope this logs can help you ! If you need to make me debug bit more,
>   h>  give me some idea to check!
>
> Actually the logs you have provided are not very interesting as they shows the
> state after bad things happened. It is more interesting to look at the logs
> (both hosts) before split brain.
>
> I would recommend:
>
> 1) Configure hast manually and ensure that both primary and secondary function
> properly and data are synchronized between the nodes. Also make sure the clock
> on both hosts is in sync (needed when comparing logs).
>
> 2) Reboot both servers so your carp/hast setup auto starts and see what
> happens.
>
> 3) If it sets primary and secondary automatically and status is ok on both
> nodes initiate switching to failover.
>
> 4) If after switching (or earlier) split brain is detected, provide logs from
> both nodes since hosts reboot.
>




Want to link to this message? Use this URL: <https://mail-archive.FreeBSD.org/cgi/mid.cgi?4C2D9C62.4050105>